Petacache: Use that Memory

Mar 07, 2006
Petacache: Use that Memory
SLAC's Computerraum

For decades, high energy experimental physicists have struggled with a fundamental problem: they simply have too much data to analyze quickly and in its entirety.

BaBar researchers routinely wait nine months for computers to sift through large datasets, searching for interesting events and setting these aside for later analysis. This “data skimming” alone constantly uses about 50 percent of BaBar's computing power. And that’s before a researcher can even start analyzing her or his data. Preparing data from CERN's Large Hadron Collider (LHC) will only take longer.

Recognizing this widespread limitation, a team at SLAC is developing the PetaCache project, a new way of thinking about data access and storage. With new computer software and more efficient types of memory, PetaCache may significantly increase the speed of data analysis.

"PetaCache may help scientists change the way they think about exploring new ideas," said PetaCache project manager Randal Melen. "It will allow a physicist with a sudden new idea, an 'I wonder if…' moment, to quickly begin exploring that new idea."

Before the early 1990s, researchers analyzed much of their data from magnetic tape, having their computers spool through miles of it to find interesting events. As disk drives got larger and cheaper, and with the rise of computer clusters, much more of the data could be kept on disk. Yet these disks still required mechanical movement, limiting the speed at which researchers could begin accessing data. Computer technology has made great strides in speeding up the movement of data—called bandwidth—but the time to get the first byte of data—called latency—has been much slower to improve. "PetaCache, then, is really about improving the latency of testing new ideas," said Melen.

To do this, PetaCache uses several types of memory, not disks. Although memory is much faster at getting this first byte of data, in the past it has been too expensive to buy in the quantities necessary to record and analyze the massive amounts of data taken at particle accelerators. Today, DRAM (Dynamic Random Access Memory) and flash memory are more affordable, and flash memory is expected to continue to drop in price as it is used more and more in consumer electronics such as digital cameras, iPod-like devices, and cell phones. If successful, the PetaCache project will allow researchers to use both DRAM and flash memory on a large scale.

The prototype PetaCache system comprises two racks of 64 server computers, each with 16 gigabytes of DRAM for a total of one terabyte of memory. This large yet fragmented amount of memory is linked together with SCALLA (Structured Cluster Architecture for Low Latency Access), a computer program developed by SCCS Software Developer Andy Hanushevsky. SCALLA moves data from data servers to batch systems running physics analysis software with the lowest possible latencies. This load-balancing, self-organizing software distributes data across many data servers efficiently, making the individual machines appear as one huge chunk of memory to SCALLA-aware physics applications.

"The software makes good use of common hardware, so you don't have to make huge expenditures for great computing power," said Hanushevsky.

Right now, SLAC’s prototype system has one terabyte (1,000 gigabytes) of DRAM memory. With their next machine, the PetaCache team hopes to mainly use less expensive flash memory which, according to SCCS Director Richard Mount, "holds future promise of cost-effective memory-based data-analysis systems."

This second-generation prototype will aim at a few tens of terabytes of flash memory, which would make the system useful to BaBar and LSST researchers. In the next decade, the PetaCache team hopes to expand the system to a petabyte (1,000 terabytes). This is around the scale of what is needed to be useful at the LHC.

"Over the next few years, this type of memory technology will become much more common, from BaBar to the LHC to banks and airline reservation systems," said Research Director Emeritus David Leith. "They all benefit from being able to work from memory."

Source: Stanford Linear Accelerator Center, by Kelen Tuttle

Explore further: Flatland, we hardly knew ye: Unique 1-D metasurface acts as polarized beam splitter, allows novel form of holography

add to favorites email to friend print save as pdf

Related Stories

Russia turns back clocks to permanent Winter Time

14 hours ago

Russia on Sunday is set to turn back its clocks to winter time permanently in a move backed by President Vladimir Putin, reversing a three-year experiment with non-stop summer time that proved highly unpopular.

UN climate talks shuffle to a close in Bonn

14 hours ago

Concern was high at a perceived lack of urgency as UN climate negotiations shuffled towards a close in Bonn on Saturday with just 14 months left to finalise a new, global pact.

Microsoft beefs up security protection in Windows 10

19 hours ago

What Microsoft users in business care deeply about—-a system architecture that supports efforts to get their work done efficiently; a work-centric menu to quickly access projects rather than weather readings ...

Comet Siding Spring whizzes past Mars (Update)

Oct 19, 2014

A comet the size of a small mountain and about as solid as a pile of talcum powder whizzed past Mars on Sunday, dazzling space enthusiasts with the once-in-a-million-years encounter.

Recommended for you

Three-dimensional metamaterials with a natural bent

Oct 24, 2014

Metamaterials, a hot area of research today, are artificial materials engineered with resonant elements to display properties that are not found in natural materials. By organizing materials in a specific way, scientists ...

Scientists develop compact medical imaging device

Oct 23, 2014

Scientists at the MIRA research institute, in collaboration with various companies, have developed a prototype of a handy device that combines echoscopy (ultrasound) with photoacoustics. Combining these two ...

User comments : 0