January 8, 2009

New tool enables powerful data analysis

(PhysOrg.com) -- A powerful computing tool that allows scientists to extract features and patterns from enormously large and complex sets of raw data has been developed by scientists at University of California, Davis, and Lawrence Livermore National Laboratory. The tool - a set of problem-solving calculations known as an algorithm - is compact enough to run on computers with as little as two gigabytes of memory.

The team that developed this algorithm has already used it to probe a slew of phenomena represented by billions of data points, including analyzing and creating images of flame surfaces; searching for clusters and voids in a virtual universe experiment; and identifying and tracking pockets of fluid in a simulated mixing of two fluids.

"What we've developed is a workable system of handling any data in any dimension," said Attila Gyulassy, who led the five-year development effort while pursuing a PhD in computer science at UC Davis. "We expect this algorithm will become an integral part of a scientist's toolbox to answer questions about data."

A paper describing the new algorithm was published in the November-December issue of IEEE Transactions on Visualization and Computer Graphics.

Mixing of Fluids — This image of an early moment in the simulated mixing of two fluids was created by researchers using a powerful new algorithm they developed to extract features and patterns from massive data sets. In the image, blue and red spheres and the lines between them represent the branching of pockets of fluid. Image: Attila Gyulassi/UC Davis copyright UC Regents

Computers are widely used to perform simulations of real-world phenomena and to capture results of physical experiments and observations, storing this information as collections of numbers. But as the size of these data sets has burgeoned, hand-in-hand with computer capacity, analysis has grown increasingly difficult.

A mathematical tool to extract and visualize useful features from data sets has existed for nearly 40 years - in theory. Called the Morse-Smale complex, it partitions sets by similarity of features and encodes them into mathematical terms. But working with the Morse-Smale complex is not easy. "It's a powerful language. But a cost of that, is that using it meaningfully for practical applications is very difficult," Gyulassy said.

Gyulassy's algorithm divides data sets into parcels of cells, then analyzes each parcel separately using the Morse-Smale complex. Results of those computations are then merged together. As new parcels are created from merged parcels, they are analyzed and merged yet again. At each step, data that do not need to be stored in memory are discarded, drastically reducing the computing power required to run the calculations.

One of Gyulassy's tests of the algorithm was to use it to analyze and track the formation and movement of pockets of fluid in the simulated mixing of two fluids: one dense, one light. The complexity of this data set is so vast - it consists of more than one billion data points on a three-dimensional grid - it challenges even supercomputers, Gyulassy said. Yet the new algorithm with its streamlining features was able to perform the analysis on a laptop computer with just two gigabytes of memory. Although Gyulassy had to wait nearly 24 hours for the little machine to complete its calculations, at the end of this process he could pull up images in mere seconds to illustrate phenomena he was interested in, such as the branching of fluid pockets in the mixture.

Two main factors are driving the need for analysis of large data sets, said co-author Bernd Hamann: a surge in the use of powerful computers that can produce huge amounts of data, and an upswing in affordability and availability of sensing devices that researchers deploy in the field and lab to collect a profusion of data.

"Our data files are becoming larger and larger, while the scientist has less and less time to understand them," said Hamann, a professor of computer science and associate vice chancellor for research at UC Davis. "But what are the data good for if we don't have the means of applying mathematically sound and computationally efficient computer analysis tools to look for what is captured in them?"

Gyulassy is currently developing software that will allow others to put the algorithm to use. He expects the learning curve to be steep for this open-source product, "but if you just learn the minimal amount about what a Morse-Smale complex is," he said, "it will be pretty intuitive."

Source: University of California - Davis

Citation: New tool enables powerful data analysis (2009, January 8) retrieved 23 April 2024 from https://phys.org/news/2009-01-tool-enables-powerful-analysis.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A new bioimaging method for speeding up and simplifying chemicals identification in tissues

0 shares

Feedback to editors

Advances in understanding the evolution of stomach loss in agastric fishes

1 minute ago

New toolkit makes molecular dynamics simulations more accessible

2 minutes ago

New small molecule helps scientists study regeneration

2 minutes ago

Breaking boundaries in tiny labs: New technology using sound waves has implications for nanoparticle manipulation

2 minutes ago

Novel method could explore gluon saturation at the future electron-ion collider

2 minutes ago

Study shows it's not too late to save the West Antarctic Ice Sheet

8 minutes ago

A first glimpse at our galaxy's magnetic field in 3D

9 minutes ago

A new method for enzymatic synthesis of potential RNA therapeutics

15 minutes ago

A universal framework for spatial biology

19 minutes ago

Bacteria for climate-neutral chemicals of the future

23 minutes ago

Load comments (4)

New tool enables powerful data analysis

Advances in understanding the evolution of stomach loss in agastric fishes

New toolkit makes molecular dynamics simulations more accessible

New small molecule helps scientists study regeneration

Breaking boundaries in tiny labs: New technology using sound waves has implications for nanoparticle manipulation

Novel method could explore gluon saturation at the future electron-ion collider

Study shows it's not too late to save the West Antarctic Ice Sheet

A first glimpse at our galaxy's magnetic field in 3D

A new method for enzymatic synthesis of potential RNA therapeutics

A universal framework for spatial biology

Bacteria for climate-neutral chemicals of the future

Relevant PhysicsForums posts

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

A new bioimaging method for speeding up and simplifying chemicals identification in tissues

AI algorithm keeps a mile-long particle accelerator healthy

'Computer says no': More employers are using AI to recruit, increasing the risk of discrimination

Searching for axions with the ATLAS detector

Integrating AI with super-resolution microscopy for advancements in cellular biology

Exploring how the convergence of automation and AI reshapes organic chemistry research

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

New tool enables powerful data analysis

Advances in understanding the evolution of stomach loss in agastric fishes

New toolkit makes molecular dynamics simulations more accessible

New small molecule helps scientists study regeneration

Breaking boundaries in tiny labs: New technology using sound waves has implications for nanoparticle manipulation

Novel method could explore gluon saturation at the future electron-ion collider

Study shows it's not too late to save the West Antarctic Ice Sheet

A first glimpse at our galaxy's magnetic field in 3D

A new method for enzymatic synthesis of potential RNA therapeutics

A universal framework for spatial biology

Bacteria for climate-neutral chemicals of the future

Relevant PhysicsForums posts

Related Stories

A new bioimaging method for speeding up and simplifying chemicals identification in tissues

AI algorithm keeps a mile-long particle accelerator healthy

'Computer says no': More employers are using AI to recruit, increasing the risk of discrimination

Searching for axions with the ATLAS detector

Integrating AI with super-resolution microscopy for advancements in cellular biology

Exploring how the convergence of automation and AI reshapes organic chemistry research

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience