Data-mining for crystal 'gold' at SLAC's X-ray laser

March 17, 2014 by Glenn Roberts Jr., SLAC National Accelerator Laboratory
Nicholas Sauter, middle, a computer staff scientist at Lawrence Berkeley National Laboratory, points to a monitor during an experiment this month at SLAC's Linac Coherent Light Source X-ray laser. Sauter helped to develop a new data-analysis tool for X-ray crystallography experiments at LCLS. Also pictured are Hartawan Laksmono, left, of the Stanford Pulse Institute at SLAC, and Jan Kern of Berkeley Lab. Credit: Fabricio Sousa/SLAC

A new tool for analyzing mountains of data from SLAC's Linac Coherent Lightsource (LCLS) X-ray laser can produce high-quality images of important proteins using fewer samples. Scientists hope to use it to  reveal the structures and functions of proteins that have proven elusive, as well as mine data from past experiments for new information.

"Such analytical tools might be as important to LCLS experiments as better detectors, sample-delivery systems and other instruments," said Uwe Bergmann, director of LCLS and a member of a research collaboration that has tested the . "Continued improvements in methods like this will be critical for very precious samples – particularly when time or the amount of sample is limited."

The software package, known as the Computational Crystallography Toolbox for X-ray Free-electron Lasers or cctbx.xfel, was developed as a part of an international project to study proteins involved in oxygen-producting stages in photosynthesis, but can be applied to other protein studies as well. It should be especially helpful in analyzing proteins that are difficult to crystallize in large quantities for experiments, including many relevant to fighting disease. The software toolbox is freely available online, and users can get help online or via email.

Detailed in a paper published in the March 16 edition of Nature Methods, the new tool is designed to glean more information from protein samples based on a customized, improved analysis of LCLS X-ray images.

The software finds new ways to precisely match LCLS data with Bragg's Law, the 101-year-old discovery that describes the mathematics of how X-rays project the molecular blueprints of tiny crystallized samples onto a detector. It does so by factoring in painstaking measurements of the surfaces of LCLS X-ray detectors.

This infographic shows the extreme volume of data generated in experiments at SLAC's Linac Coherent Light Source X-ray laser. In one example, an experiment focused on stages of photosynthesis collected 114 terabytes of data in five days, or enough to fill 2,335 standard Blue-ray video discs. Credit: Terry Anderson/SLAC

"We're trying to really accurately measure the geometry of the detectors – to know where the measurements are being made to the level of microns," said Nicholas Sauter, a computer staff scientist at Lawrence Berkeley National Laboratory who led the software development effort with Berkeley Lab senior scientist Paul Adams. This detailed mapping of the detectors provides a more accurate analysis of LCLS X-ray images.

The software also analyzes spots in the X-ray images that other tools reject or overlook, such as streaked, curved, dim or fuzzy features, increasing the number of usable images. "In addition, it is designed to resolve sharper details of the atomic structure," Sauter said.

The developers adapted the software from LABELIT, a tool Sauter released a decade ago to analyze data from synchrotrons, the most widely used X-ray facilities for studying crystallized biological samples.

X-ray free-electron lasers such as LCLS, with ultrashort X-ray pulses that are millions of times brighter than synchrotron X-rays, are proving a powerful new force in solving molecular mysteries that synchrotrons cannot, but they bring a new set of scientific challenges.

At synchrotrons, scientists typically study frozen crystals one at a time, rotating each one slowly and taking multiple X-ray images.

Members of a research team that performed a photosynthesis-related experiment at SLAC's Linac Coherent Light Source. The team has used a data-analysis software tool developed for X-ray crystallography experiments at LCLS. From left to right: Aaron Brewster, Junko Yano, Roberto Alonso-Mori, Jan Kern, Sheraz Gul, Franklin Fuller, Nicholas Sauter, Thomas Kroll, Rosalie Tran and Johannes Messinger. Credit: Fabricio Sousa/SLAC

LCLS can study smaller crystals and under more natural conditions, but it requires a much larger number of crystals, which are typically suspended in a liquid or gel and jetted into the path of the X-rays. Because the crystals are tumbling randomly when the X-ray snapshots are taken and only one image can be taken of each crystal, scientists must gather tens of thousands of high-quality images to get a complete picture of a protein structure. A recent experiment at LCLS collected enough data to fill about 2,335 standard Blu-ray video discs, Sauter said.

Junko Yano, a staff scientist at Berkeley Lab whose research team includes Berkeley Lab senior scientist Vittal Yachandra, has used the new data-analysis tool to study the molecular machinery at work in photosynthesis. She said even in cases where it is easy to produce crystals and generate a lot of data, the software could improve the resolution of protein structures by capturing more details from the highest-quality crystals.

"With many biological systems we may not have this luxury of easily producing a lot of crystals," she added, "so this will help us to minimize the amount of samples we need to collect high-quality data, both at LCLS and at other free-electron X-ray laser facilities that are coming on line."

Explore further: X-ray laser at SLAC maps important drug target

More information: "Accurate macromolecular structures using minimal measurements from X-ray free-electron lasers." Johan Hattne, Nathaniel Echols et al., Nature Methods, 16 March 2014 (10.1038/nmeth.2887)

Related Stories

X-ray laser at SLAC maps important drug target

December 31, 2013

( —Researchers have used one of the brightest X-ray sources on the planet to map the 3-D structure of an important cellular gatekeeper known as a G protein-coupled receptor, or GPCR, in a more natural state than ...

A new way to tune X-ray laser pulses

March 11, 2014

( —A new system at SLAC National Accelerator Laboratory's X-ray laser narrows a rainbow spectrum of X-ray colors to a more intense band of light, creating a much more powerful way to view fine details in samples ...

Ribosome research takes shape

August 29, 2013

In a new state-of-the-art lab at SLAC National Accelerator Laboratory, components of ribosomes – tiny biological machines that make new proteins and play a vital role in gene expression and antibiotic treatments – form ...

LAMP: A new tool turns on at SLAC's X-ray laser

January 17, 2014

( —A 2-ton instrument the size of a compact car, now available at SLAC's X-ray laser, makes it possible to capture more detailed images of atoms, molecules, nanoscale features of solids, and individual particles ...

Recommended for you

Reaching new heights in laser-accelerated ion energy

February 20, 2018

A laser-driven ion acceleration scheme, developed in research led at the University of Strathclyde, could lead to compact ion sources for established and innovative applications in science, medicine and industry.

MEMS chips get metatlenses

February 20, 2018

Lens technologies have advanced across all scales, from digital cameras and high bandwidth in fiber optics to the LIGO lab instruments. Now, a new lens technology that could be produced using standard computer-chip technology ...

Using organoids to understand how the brain wrinkles

February 20, 2018

A team of researchers working at the Weizmann Institute of Science has found that organoids can be used to better understand how the human brain wrinkles as it develops. In their paper published in the journal Nature Physics, ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.