Data Travels Six Times Faster in the Clouds

Feb 26, 2009
Sector is cloud computing system designed for data intensive computing. Sector is designed to run on racks of comodity computers (such as pictured). The racks may be located within a single data center or across several geographically distributed data centers. Credit: Michal Sabala, NCDM, University of Illinois at Chicago

(PhysOrg.com) -- The National Center for Data Mining (NCDM) at the University of Chicago at Illinois established a cloud computing system that can quickly compile data from widely geographically distributed data centers across high performance networks. NCDM used the Open Cloud Testbed, managed by the Open Cloud Consortium, to demonstrate the "Sector System" at the annual meeting of the American Association for the Advancement of Science conference earlier this month in Chicago.

"We demonstrated that our system is six times faster than competing technology," said Robert Grossman, NCDM director and Open Data Group managing partner. "Without the requirement of costly and combersome data transfer from various locations to one central location, this opens the way to exciting collaborative scientific discovery."

This is a diagram of Phase 1 of the Open Cloud Testbed. Phase 1 of the Open Cloud Testbed consists of 4 racks located at the University of Illinois at Chicago, the StarLight Facility in Chicago, Johns Hopkins University in Baltimore, Maryland, and Calit2 at the University of California at San Diego. The 4 racks are connected by a 10 Gb/s network, provided by the Cisco C-Wave and regional high performance networks at each of the locations. The Open Cloud Testbed is managed by the Open Cloud Consortium. Credit: Open Cloud Consortium

Grossman and his team demonstrated using a common benchmark called Terasort. They found there was less than a 5 percent performance penalty when Terasort was run across the four data centers distributed across the country compared to running the entire computation within one data center. Prior to the Sector System, such computations were rarely done, as performance penalties were as high as 30 percent.

"With the Sector System, data intensive computing can scale not only to a data center, but for the first time, across data centers," said Grossman." This enables locating data centers in areas in which power and cooling is cost-effective."

The Open Cloud Testbed consists of racks of computers located at the University of Illinois at Chicago, the StarLight facility in Chicago, Johns Hopkins University in Baltimore, Maryland, and the University of California at San Diego, all connected by a wide area 10 Gb/s network, and all running a variety of cloud computing services, including cloud storage services and cloud computing services. The technology that makes this possible uses an open architecture design, specifically the open source sector system developed by the NCDM (sector.sf.net).

Although cloud computing is becoming common, processing data by clouds today is almost always done within a single data center. Generally, data intensive computing across geographically distributed data centers is avoided due to the difficulties and cost of moving large amounts of data over long distances. Sector employs an alternative network protocol called UDT designed to swiftly and smoothly transfer data.

According to Joe Mambretti, director of the International Center of Advanced Internet Research at Northwestern University and co-director of the Open Cloud Testbed, "These innovative technologies provide unique capabilities that will enable new generations of applications that can make discoveries involving large volumes of highly distributed data."

Provided by National Science Foundation

Explore further: Communication-optimal algorithms for contracting distributed tensors

add to favorites email to friend print save as pdf

Related Stories

Saving seeds the right way can save the world's plants

11 minutes ago

Exotic pests, shrinking ranges and a changing climate threaten some of the world's most rare and ecologically important plants, and so conservationists establish seed collections to save the seeds in banks ...

Evidence of a local hot bubble carved by a supernova

36 minutes ago

I spent this past weekend backpacking in Rocky Mountain National Park, where although the snow-swept peaks and the dangerously close wildlife were staggering, the night sky stood in triumph. Without a fire, ...

Nature inspires a greener way to make colorful plastics

36 minutes ago

Long before humans figured out how to create colors, nature had already perfected the process—think stunning, bright butterfly wings of many different hues, for example. Now scientists are tapping into ...

F1000Research brings static research figures to life

1 hour ago

F1000Research today published new research from Bjorn Brembs, professor of neurogenetics at the Institute of Zoology, Universitaet Regensburg, in Germany, with a proof-of-concept figure allowing readers and reviewers to run ...

Recommended for you

Microsoft challenging US on overseas data

14 minutes ago

In a case closely watched by the tech sector, Microsoft will challenge Thursday a US court order requiring it to give prosecutors electronic mail content associated with an overseas server.

Facebook's Internet.org expands in Zambia

14 minutes ago

(AP)—Facebook's Internet.org project is taking another step toward its goal of bringing the Internet to people who are not yet online with an app launching Thursday in Zambia.

Sony surprises with first quarter profit

27 minutes ago

(AP)—Sony reported a surprise eightfold jump in quarterly profit as sales got a perk from a cheap yen and its bottom line was helped by gains from buildings and its stake in a video-game maker.

Samsung profit falls as smartphone sales slow

42 minutes ago

(AP)—Samsung Electronics Co. reported a bigger-than-expected fall in second quarter profit on Thursday and said it was uncertain if earnings from its handset business would improve in the current quarter.

User comments : 0