Sorting millions of snapshots from the Linac Coherent Light Source

August 31, 2011 By Glennda Chui

Sorting millions of snapshots from the LCLS

Randomly selected representatives from (a) nanorice snapshots; (b) Mimivirus snapshots; (c) miscellaneous snapshots; (d) blank snapshots; and (e) saturated snapshots, each from a separate bin of images sorted by the new, automatic method. Credit: Chun Hong Yoon, et al.

The great thing about SLAC’s Linac Coherent Light Source is that it churns out incredible volumes of data about things no one has ever seen before, such as snapshots of individual viruses.

The hard thing is: What to do with all that data? Of the several million snapshots scientists might get in a single, 10-hour shift of zapping samples with a powerful X-ray , fewer than 1 in 100 will contain the information they’re looking for.

This is the problem that Abbas Ourmazd and colleagues Peter Schwander and Chun Hong Yoon of the University of Wisconsin-Milwaukee are working on. They’re members of an 80-plus person collaboration, including SLAC scientists from the LCLS and the PULSE Institute for Ultrafast Energy Science, who performed groundbreaking experiments on large viruses and tiny protein nanocrystals with the world’s most powerful X-ray laser in December 2009.

In an Aug. 12 report in Optics Express, the team outlines a method for automatically sorting those millions of snapshots so most of the good shots end up in just one bin. Further analysis of the data in that bin should yield a 3-D image of the original object, Ourmazd said, whether it’s a virus or a tiny grain of “nanorice” that’s used as a test subject in these studies.

“Computer analysis – making sense of all this beautiful data coming out of the LCLS – has turned out to be as difficult and as important as the data acquisition,” Ourmazd said in an interview.

He describes the problem this way:

“Say, for example, you make a solution full of virus and you spit little droplets of the solution into the X-ray laser beam” with an injector that sprays them in a fine mist. ”Each droplet can be empty, or it can have one virus particle in it, or it can have multiple particles in it,” Ourmazd said. “Also, the solution is not necessarily pure, so you can have different kinds of particles being spat out individually or in combination.

“In addition, the X-ray intensity varies from shot to shot. Even when it’s the same, the laser beam may miss the particle, hit it square on, or just hit some of it and not the rest. What you would like to do is take these snapshots and from them extract the droplets that contain a single copy of a single virus which has been nicely hit.”

When the X-ray pulse hits the virus, it forms a diffraction pattern in a detector, and thousands of those patterns can be combined to get the structure of the virus. The challenge is to do this automatically, without direct human supervision; without bias; with high precision; and really fast.
Ourmazd, Schwander and Yoon are part of a theory group that is developing algorithms for analyzing the terabytes of data coming out of the LCLS. Their approach combines mathematics, scattering physics and information theory. Ourmazd is a distinguished professor of physics; Schwander, a senior scientist specializing in physics and informatics; and Yoon, a postdoctoral researcher and electrical engineer.

The simplest way to think about their method, Ourmazd said, is to start with the idea that experimental data is correlated – it hangs together, somehow – while background noise is random. You can think of the correlated data for each snapshot of an object, whether it’s a full-on hit or a glancing blow, as lying on a surface. “The interesting information content is in the shape and in the wrinkles of the surface,” he said. The question is, what is the characteristic wrinkle pattern and shape that distinguish one type of object from another, and good shots from mediocre ones?

The algorithm found these patterns of correlation and used them to sort 7,214 snapshots of nanorice and viruses. The results showed 90 percent agreement with sorting done manually by a human expert. The researchers estimate that a million snapshots can be sorted in less than 10 hours with this technique.

“You go to the data with no preconceived notions. All the information is in the wrinkles and the shape of the data set,” Ourmazd said. “It’s a wonderful combination of mathematics and information theory.”

Once you’ve done that, he said, you should be able to open the bin that contains full-on images of nanorice grains or viruses, taken from all possible angles as the tiny particles tumbled through the X-ray beam, and reconstruct their structure in 3-D.

Ourmazd said the team is already applying this approach to data from LCLS experiments, images taken with cryo-electron microscopy and free-electron laser studies of changes in individual molecules.

“We feel like kids in a toy store,” he said, “totally confused by the choices and overwhelmed with pleasure.”

Provided by SLAC National Accelerator Laboratory search and more info website

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

El_Nose
Aug 31, 2011

Rank: not rated yet
now that is beautiful CS project
Rank not rated yet
Relevant PhysicsForums posts
  • Water flow question
    created3 hours ago
  • [Drift velocity] Factors affecting velocity
    created5 hours ago
  • does cold gasoline have less energy
    created6 hours ago
  • distribution of molecules throughout the atmosphere
    created8 hours ago
  • The Global Positioning System !
    created9 hours ago
  • A Question relating Power
    created10 hours ago
  • More from Physics Forums - General Physics

More news stories

Is a classical electrodynamics law incompatible with special relativity?

(Phys.org) -- The laws of classical electromagnetism that were developed in the 19th century are the same laws that scientists use today. They include Maxwell’s four equations along with the Lorentz la ...

Physics / General Physics

created May 24, 2012 | popularity 4.7 / 5 (17) | comments 43 | with audio podcast feature

Landmark calculation clears the way to answering how matter is formed

(Phys.org) -- An international collaboration of scientists, including Thomas Blum, associate professor of physics, is reporting in landmark detail the decay process of a subatomic particle called a kaon – ...

Physics / General Physics

created May 25, 2012 | popularity 4.3 / 5 (22) | comments 50 | with audio podcast

Lying in wait for WIMPs: Researchers seek to dramatically increase sensitivity of Large Underground Xenon detector

Although it's invisible, dark matter accounts for at least 80 percent of the matter in the universe. No one knows what it is, but most scientists would bet on weakly interacting massive particles, or WIMPs.

Physics / General Physics

created May 23, 2012 | popularity 4 / 5 (7) | comments 15 | with audio podcast

Hawaii lab turns laser-powered bubbles into microrobots

(Phys.org) -- A team of scientists from the University of Hawaii are working on microrobots created from bubbles of air in a saline solution. The bubbles take on their title of “robots” as a laser ...

Physics / General Physics

created May 23, 2012 | popularity 5 / 5 (4) | comments 2 | with audio podcast weblog

Sound increases the efficiency of boiling

Scientists at the Georgia Institute of Technology achieved a 17-percent increase in boiling efficiency by using an acoustic field to enhance heat transfer. The acoustic field does this by efficiently removing vapor bubbles ...

Physics / Soft Matter

created May 24, 2012 | popularity 5 / 5 (2) | comments 2


Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study

At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...

Computer model used to pinpoint prime materials for efficient carbon capture

When power plants begin capturing their carbon emissions to reduce greenhouse gases – and to most in the electric power industry, it's a question of when, not if – it will be an expensive undertaking.

'Unzipped' carbon nanotubes could help energize fuel cells, batteries

Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...

T cells 'hunt' parasites like animal predators seek prey, study shows

By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...

Manufacturing genes to attack flu virus

An international research team has manufactured a new protein that can combat deadly flu epidemics.

Yale study concludes public apathy over climate change unrelated to science literacy

Are members of the public divided about climate change because they don't understand the science behind it? If Americans knew more basic science and were more proficient in technical reasoning, would public consensus match ...