Improved method for protein sequence comparisons is faster, more accurate, sensitive
Lightning fast and yet highly sensitive: HHblits is a new software tool for protein research which promises to significantly improve the functional analysis of proteins. A team of computational biologists led by Dr. Johannes Soding of LMU's Genzentrum has developed a new sequence search method to identify proteins with similar sequences in databases that is faster and can discover twice as many evolutionarily related proteins as previous methods. From the functional and structural properties of the identified proteins conclusions can then be drawn on the properties of the protein to be analysed.
"Our method will expand the scope and power of sequence analysis, which will in turn facilitate the experimental elucidation of the structure and function of many proteins", says Söding, who is also a member of the Center for Integrated Protein Science Munich (CiPSM).
Proteins are involved in nearly all biochemical processes of life. The functions that a protein performs largely depend on the sequence of the 20 amino acid building blocks and on the three-dimensional spatial structure into which this sequence of amino acids folds. From the similarity of protein sequences, bioinformatic methods can predict their evolutionary relatedness, which in turn implies similar structure and functions. Therefore, proteins to be studied are standardly subjected to a sequence search, in which their sequence is compared with millions of sequences in public databases with annotated structures and functions. The properties of the protein of interest can then be inferred from the properties of the proteins with similar sequences, including its structure and functions. The general relationship between sequence and function makes it possible to predict the structure and function of a given protein by comparing its sequence with those of proteins of known structure/function. Publicly accessible databases exist in which the sequences of known proteins are stored, together with information on their biological functions, which facilitates such comparisons. "This kind of sequence analysis is a fundamental tool in the field of bioinformatics," explains Söding.
The sequence search programs assess sequence similarity by computing pairwise alignments: the two sequences of amino acids are arranged one above the other in such a way that mostly identical or similar amino acids are paired up in the same columns. "Perhaps even more important than the search for pairwise sequence similarities is the assembly of so-called multiple sequence alignments; in this case one searches for similar sequences in many related proteins and arranges them into a matrix, in which each sequence fills a row and similar amino acids end up in the same columns" says Söding. Because the functions and structure of evolutionarily related proteins are generally conserved - i.e. preserved even when the sequence is altered by mutations during the course of evolution - multiple sequence alignments form the basis for the prediction of the structure and molecular functions of uncharacterized proteins.
For the past 15 years, the program PSI-BLAST has been the most popular tool for the comparison of protein sequences, as it combines speed with high sensitivity and precision. Now Söding's team has designed a method, called HHblits, which clearly surpasses PSI-BLAST in all aspects of performance. This improvement is largely due to two factors. First the researchers convert both the sequence of interest and the sequences in the database to be searched into so-called Hidden Markov Models (HMMs). HMMs are statistical models that incorporate the mutation probabilities determined from sequence alignments so this step increases the sensitivity and precision of the subsequent similarity search. In addition, the team has developed a filtering procedure that allows them to reduce the amount of data that needs to be searched without appreciable loss of sensitivity. The trick is first to assemble similar sequences from the database into multiple sequence alignments. Each alignment column is then labeled with one of 219 "letters", such that columns with similar amino acid composition are represented by the same letter. "By translating the multiple sequence alignments into sequences composed of these 219 letters, we can replace the time-consuming pairwise comparison of HMMs by the comparison of simple sequences", says Söding. This reduces the search time 2500-fold. Söding emphasizes that "HHblits allows to predict the function and structure of proteins more often and more accurately than was previously possible." His group is already working on further improvements to the method, for example by incorporating information on the three-dimensional structures of proteins.
More information: HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment. M. Remmert, A. Biegert, A. Hauser, J. Söding. Nature Methods, 25.12.2011
Journal reference:
Nature Methods
Provided by Ludwig-Maximilians-Universitat Munchen
-
From lemons to lemonade: Reaction uses carbon dioxide to make carbon-based semiconductor,
32 comments
-
Thioridazine kills cancer stem cells in human while avoiding toxic side-effects of conventional cancer treatments,
3 comments
-
SpaceX private rocket blasts off for space station (Update),
42 comments
-
Climate scientists say they have solved riddle of rising sea,
31 comments
-
SpaceX capsule has 'new car' smell, astronauts say (Update),
4 comments
-
What would stain as translucent on light-coloured fabric?
May 26, 2012
-
How do I identify different bacteria on culture plates?
May 26, 2012
-
Why Do Dogs do Strange things...
May 25, 2012
-
What does exophillic and endophillic mean in terms of mosquito and their control?
May 24, 2012
-
Semen stains glows under black lights (uv light)?
May 23, 2012
-
Question on Human Chromosome 2
May 23, 2012
- More from Physics Forums - Biology
More news stories
Manufacturing genes to attack flu virus
An international research team has manufactured a new protein that can combat deadly flu epidemics.
5 hours ago |
not rated yet |
0
|
Scientist: Evolution debate will soon be history
(AP) -- Richard Leakey predicts skepticism over evolution will soon be history. Not that the avowed atheist has any doubts himself.
May 26, 2012 |
3.5 / 5 (20) |
88
More plant species responding to global warming than previously thought
(Phys.org) -- Far more wild plant species may be responding to global warming than previous large-scale estimates have suggested.
May 22, 2012 |
4.6 / 5 (14) |
18
|
For monogamous sparrows, it doesn't pay to stray (but they do it anyway)
It's quite common for a female song sparrow to stray from her breeding partner and mate with the male next door, but a new study shows that sleeping around can be costly.
May 22, 2012 |
5 / 5 (2) |
8
|
Thousands of shellfish found dead in Peru
Thousands of crustaceans were found dead off the coast of Lima following the mystery mass death of dolphins and pelicans, the Peruvian Navy said Friday.
May 26, 2012 |
4.7 / 5 (7) |
7
'Unzipped' carbon nanotubes could help energize fuel cells, batteries
Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...
Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study
At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...
Computer model used to pinpoint prime materials for efficient carbon capture
When power plants begin capturing their carbon emissions to reduce greenhouse gases and to most in the electric power industry, it's a question of when, not if it will be an expensive undertaking.
T cells 'hunt' parasites like animal predators seek prey, study shows
By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...
Land and sea species differ in climate change response: study
(Phys.org) -- Marine and terrestrial species will likely differ in their responses to climate warming, new research by Simon Fraser University and Australia’s University of Tasmania has found.
Yale study concludes public apathy over climate change unrelated to science literacy
Are members of the public divided about climate change because they don't understand the science behind it? If Americans knew more basic science and were more proficient in technical reasoning, would public consensus match ...