New model of protein folding helps researchers handle flood of genomic data
March 22, 2011 by Larry Hardesty
Graphic: Christine Daniloff
All living tissue is made from proteins, and all proteins are made from a combination of the same 20 chemical building blocks, called amino acids. The difference between the proteins that make up bone, blood, hair and eyeballs is largely one of shape.
Genes are the recipes for stringing together amino acids into proteins, but the way in which those strings fold back on themselves determines their shape. So understanding genes roles in disease requires understanding how proteins fold.
In a series of recent papers, researchers at MITs Computer Science and Artificial Intelligence Laboratory have demonstrated a promising new technique for modeling such protein folding. While not as accurate as some existing techniques, it is much more computationally efficient. Sophisticated, atom-by-atom simulations that run on hundreds of thousands of computers might take months to model a few milliseconds of protein folding. The researchers new technique can model the same process in minutes on a single laptop.
Speed is of the essence as the amount of unprocessed genomic data proliferates. Theres the Broad 1,000 Genomes project, theres X many species that have been sequenced now, and the sequence data is just vastly outpacing the speed with which you could apply some of these other techniques, says Charles ODonnell, a PhD student in the Department of Electrical Engineering and Computer Science who helped develop the new approach. If you want to make sense of all this high-throughput data thats coming from this great biotech innovation, then you need something quick.
Other quick methods of simulating protein folding exist, but the MIT researchers appears to be more accurate. There is still much we dont know about the actual structure of proteins, ODonnell cautions, so that makes assessing the quality of computational methods difficult. But at the 19th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB) in July, the MIT researchers will present a paper demonstrating that for a class of proteins known as amyloids, their techniques predictions match the currently available data with 81 percent accuracy, whereas high-efficiency techniques previously managed 42 percent at best.
Computational modeling of protein folding has been an active research area for decades, but it hasnt been entirely clear whether it was going to be useful or not, says Susan Lindquist, an MIT professor of biology, recent recipient of the National Medal of Science, and, along with CSAILs Bonnie Berger, one of ODonnells faculty advisors. I think that this paper helps realize that goal.
Quantity over quality
When a protein folds, amino acids far from each other on the protein strand are brought close together, and chemical bonds form between them. That folding, however, brings other amino acids into proximity with each other, and those acids could exert either an attractive or a repulsive force on each other. Predicting a proteins shape is a matter of figuring out which regions of the strand could have affinities for each other, and whether bringing those regions together would cause unsupportable tensions elsewhere.
Atom-by-atom simulations can model amino acids interactions very precisely, but because theyre so computationally complex, theyve generally been restricted to protein strands with only a couple dozen amino acids, whereas full proteins can comprise hundreds or even thousands. Making the simulations computationally efficient means sacrificing information about the amino acids interactions, and most previous attempts have tried to strike a balance between accuracy of representation and simplicity of description.
MIT computer science professors Bonnie Berger and Srini Devadas, ODonnell and Jérôme Waldispuhl, a former MIT math instructor whos now an assistant professor at McGill, adopted a somewhat different approach. They employ what they describe as a coarse representation of a proteins chemical properties, but that allows them to generate a huge number of candidate shapes. Their algorithm then looks for the features that occur most frequently across all the candidates, which it then synthesizes into a small group of likely structures.
Working with collaborators at McGill, Boston College, and the MIT Department of Biology, theyve applied the technique to several different problems. The paper theyre presenting in July describes their amyloid shape-prediction results, but at the 15th Annual International Conference on Research in Computational Molecular Biology on March 28, theyll present another paper describing the precise sequence of steps by which different types of proteins mainly so-called beta-sheet proteins fold. There, Waldispuhl explains, the trick is that each step in the folding pathway is itself a different shape in the library of candidates, and the algorithm finds a pathway through them. Two years ago at the same conference, the researchers presented an earlier result in which they used their technique to explain the commonalities between proteins with different sequences of amino acids that nonetheless played the same role in certain biological systems, implying that they had structural similarities.
Protein folding continues to be wide-open problem with desperate need of more rigorous mathematical, statistical and computer-science approaches, says Sorin Istrail, a professor of computer science at Brown University who specializes in computational biology. What distinguishes the MIT researchers work, he says, is its rigorously mathematical results. The world needs to do what Bonnie and Charlie are doing, Istrail says, taking one aspect of the problem and building rigorous methods for that particular component.
This story is republished courtesy of MIT News (http://web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.
Provided by
Massachusetts Institute of Technology
-
From lemons to lemonade: Reaction uses carbon dioxide to make carbon-based semiconductor,
32 comments
-
Thioridazine kills cancer stem cells in human while avoiding toxic side-effects of conventional cancer treatments,
3 comments
-
SpaceX private rocket blasts off for space station (Update),
42 comments
-
Climate scientists say they have solved riddle of rising sea,
30 comments
-
Research team claims to have found evidence Lake Cheko is impact crater for Tunguska Event,
18 comments
-
Ideas to mitigate risk of 911 calls being misdirected
May 24, 2012
-
Live scribe pen?
May 10, 2012
-
Shallow water flow simulation
May 07, 2012
-
Tablet for taking notes?
May 05, 2012
-
Best fit tablet for me?
May 05, 2012
-
Measure of Informaton
May 04, 2012
- More from Physics Forums - Computing & Technology
More news stories
SpotterRF debuts Radar Backpack Kit (w/ Video)
(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...
Probability of contamination from severe nuclear reactor accidents is higher than expected: study
Catastrophic nuclear accidents such as the core meltdowns in Chernobyl and Fukushima are more likely to happen than previously assumed. Based on the operating hours of all civil nuclear reactors and the number ...
Technology / Energy & Green Tech
May 22, 2012 |
3.6 / 5 (21) |
54
|
Delphi gasoline-injection engine technique rivals hybrid's edge
(Phys.org) -- Running a diesel like engine on gasoline is something Delphi is doing in notable fashion. They claim they are on to a promising way to enjoy an engine that gives the vehicle owner high efficiency ...
HyperSolar shows dirty water no barrier to power world
(Phys.org) -- The Santa Barbara, California, company, HyperSolar, is set to transparently share the ups and downs of its research experiences toward the companys ultimate vision, successfully producing ...
Tesla to launch electric sedan in US on June 22
Tesla Motors said Tuesday it would begin deliveries of "the world's first premium electric sedan" on June 22, slightly ahead of schedule.
Technology / Energy & Green Tech
May 22, 2012 |
4.5 / 5 (11) |
18
Dell tablet leak: 10.1-inch display, two-battery choice
(Phys.org) -- Headline after headline talks about vendors tablets in the wings as likely number-one contenders for the iPad. Such claims have justifiably been taken with a grain of salt, considering ...
Scientist: Evolution debate will soon be history
(AP) -- Richard Leakey predicts skepticism over evolution will soon be history. Not that the avowed atheist has any doubts himself.
SpaceX capsule has 'new car' smell, astronauts say (Update)
SpaceX's Dragon cargo vessel smells like a new car, said astronauts at the International Space Station after opening the hatches Saturday following the spacecraft's landmark mission to the orbiting lab.
Thousands of shellfish found dead in Peru
Thousands of crustaceans were found dead off the coast of Lima following the mystery mass death of dolphins and pelicans, the Peruvian Navy said Friday.
Keep food safety in mind this memorial day weekend
(HealthDay) -- Picnics, parades and cookouts are as much a part of Memorial Day weekend as tributes to the United States' war veterans.
Astronomers seize last chance in lifetime for Venus Transit
Astronomers are gearing for one the rarest events in the Solar System: an alignment of Earth, Venus and the Sun that will not be seen for another 105 years.
Mar 22, 2011
Rank: 1 / 5 (9)
Just think of the highly improbable location of different parts of that folded protein - and just how those atoms interact with others at exactly those locations.
It requires one to suspend logic in order to believe in the miraculous powers of evolution to first create life and then to expect that a human being will eventually arise a few billion years later - all from a single-celled ancestor.
Please don't tell me evolution is not responsible for the creation of life [or that the theory of evolution does not include the origin of life]. If you don't believe in a creator you are left only with random spontaneous physical processes as the originator.
Mar 22, 2011
Rank: not rated yet
Could this process be used to filter out unviable combinations so that the more accurate combutation intensive processes could be used on candidates with more potential?
Mar 22, 2011
Rank: not rated yet
Mar 22, 2011
Rank: 5 / 5 (1)
Translation - I don't understand chemistry, physics, thermodynamics, biology or evolution.
Absolutely correct. So why do you believe in miraculous powers?
Why not, it's a fact.
It doesn't.
See my first, translated, reply above. Bye, bye.