IU 'Twister' software improves Google's MapReduce for large-scale scientific data analysis

Mar 16, 2010

(PhysOrg.com) -- "Twister," a new software tool released by Indiana University, supports faster execution of many data mining applications implemented as MapReduce programs. Developed by researchers from the Pervasive Technology Institute at IU, the tool extends the functionality of MapReduce, a distributed programming technique patented by Google for large-scale data processing in datacenter environments.

Twister allows MapReduce to achieve higher performance, perform faster data transfers, and reduce the time it takes to process vast sets of data for data mining and machine learning applications.

"MapReduce is an exceptionally valuable tool for finding meaning in very large scientific data sets," said Xiaohong "Judy" Qiu, Associate Director of the Community Grids Lab within the PTI Digital Science Center and lead on the project (Service Aggregated Linked Sequential Activities, or SALSA) that produced the Twister . "Twister makes MapReduce even more powerful for data-intensive disciplines such as physics, chemistry and the medical and life sciences."

Applications that currently use Twister include: K-means clustering, Google's page rank, Breadth first graph search, Matrix multiplication, and Multidimensional scaling. Twister also builds on the SALSA team's work related to commercial MapReduce runtimes, including Microsoft Dryad software and open source Hadoop software. SALSA project work is funded in part by an award from Microsoft, Inc.

"Twister is especially effective for applications with iterative MapReduce Computations," said Jaliya Ekanayake, lead developer on the Twister project. "The architecture is based on pub/sub messaging that enables it to perform faster data transfers, minimizing the overhead of the runtime. Also, the support for long running processes improves the efficiency of the runtime for many iterative MapReduce computations."

To access these papers or to learn more about Twister, please visit www.iterativemapreduce.org .

To watch a video about Twister, please visit pti.iu.edu/video/twister.

Explore further: Blink, point, solve an equation: Introducing PhotoMath

add to favorites email to friend print save as pdf

Related Stories

See what I see -- machines with mental muscle

Oct 01, 2008

(PhysOrg.com) -- The way we use and interact with machines is undergoing a profound change as computers are programmed to learn from experience and see more how we see. European research into machine learning ...

Data Travels Six Times Faster in the Clouds

Feb 26, 2009

(PhysOrg.com) -- The National Center for Data Mining (NCDM) at the University of Chicago at Illinois established a cloud computing system that can quickly compile data from widely geographically distributed ...

Recommended for you

Ericsson profit down 10 pct despite higher sales

1 hour ago

Wireless equipment maker Ericsson says its third-quarter earnings slumped 10 percent despite higher sales due to increased operating costs and negative effects from currency hedging.

UK wind power share shows record rise

2 hours ago

The United Kingdom wind power production has been enjoying an upward trajectory, and on Tuesday wind power achieved a significant energy production milestone, reported Brooks Hays for UPI. High winds from Hurricane Gonzalo were the force behind wind turbines outproducing nuclear power ...

Glass maker deals to exit Apple, Arizona plant

4 hours ago

Nearly 2,000 furnaces installed in a factory to make synthetic sapphire glass for Apple Inc. will be removed and sold under a deal between the tech giant and the company that had been gearing up to produce huge amounts of ...

Global boom in hydropower expected this decade

6 hours ago

An unprecedented boom in hydropower dam construction is underway, primarily in developing countries and emerging economies. While this is expected to double the global electricity production from hydropower, it could reduce ...

User comments : 0