December 20, 2007

New gene prediction method capitalizes on multiple genomes

Researchers at Stanford University report in the online open access journal, Genome Biology, a new approach to computationally predicting the locations and structures of protein-coding genes in a genome. Gene finding remains an important problem in biology as scientists are still far from fully mapping the set of human genes.

Furthermore, gene maps for other vertebrates, including important model organisms such as mouse, are much more incomplete than the human annotation. The new technique, known as CONTRAST (CONditionally TRAined Search for Transcripts), works by comparing a genome of interest to the genomes of several related species.

CONTRAST exploits the fact that the functional role protein-coding genes play a specific part within a cell and are therefore subjected to characteristic evolutionary pressures. For example, mutations that alter an important part of a protein's structure are likely to be deleterious and thus selected against. On the other hand, mutations that preserve a protein's amino acid sequence are normally well tolerated. Thus, protein-coding genes can be identified by searching a genome for regions that show evidence such patterns of selection. However, learning to recognize such patterns when more than two species are compared has proved difficult.

Previous systems for gene prediction were able to effectively make use of one additional 'informant' genome. For example, when searching for human genes, taking into account information from the mouse genome led to a substantial increase in accuracy. But, no system was able to leverage additional informant genomes to improve upon state-of-the-art performance using mouse alone, although it was expected that adding informants would make patterns of selection clearer.

CONTRAST solves this problem by learning to recognize the signature of protein-coding gene selection in a fundamentally different way from previous approaches. Instead of constructing a model of sequence evolution, CONTRAST directly 'learns' which features of a genomic alignment are most useful for recognizing genes. This approach leads to overall higher levels of accuracy and is able to extract useful information from several informant sequences.

In a test on the human genome, CONTRAST exactly predicted the full structure of 59% of the genes in the test set, compared with the previous best result of 36%. Its exact exon sensitivity of 93%, compared with a previous best of 84%, translates into many thousands of exons correctly predicted by CONTRAST but missed by previous methods. Importantly, CONTRAST's accuracy using a combination of eleven informant genomes was significantly higher than its accuracy using any single informant. The substantial advance in predictive accuracy represented by CONTRAST will further efforts to complete protein-coding gene maps for human and other organisms.

Further information about existing gene-prediction methods and the advance CONTRAST brings to the field can be found in a minireview by Paul Flicek, which accompanies the article by Batzoglou and colleagues.

Source: BioMed Central

Citation: New gene prediction method capitalizes on multiple genomes (2007, December 20) retrieved 21 September 2024 from https://phys.org/news/2007-12-gene-method-capitalizes-multiple-genomes.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New calculation approach allows more accurate predictions of how atoms ionize when impacted by high-energy electrons

0 shares

Feedback to editors

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

3 hours ago

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

3 hours ago

New method developed to relocate misplaced proteins in cells

4 hours ago

New biosensor illuminates physiological signals in living animals

4 hours ago

New tool to help decision makers navigate possible futures of the Colorado River

5 hours ago

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

5 hours ago

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

8 hours ago

New data science tool greatly speeds up molecular analysis of our environment

Sep 20, 2024

AI tools help uncover enzyme mechanisms for lasso peptides

Sep 20, 2024

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

Sep 20, 2024

Load comments (0)

New gene prediction method capitalizes on multiple genomes

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New data science tool greatly speeds up molecular analysis of our environment

AI tools help uncover enzyme mechanisms for lasso peptides

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

New calculation approach allows more accurate predictions of how atoms ionize when impacted by high-energy electrons

New drone imagery reveals 97% of coral dead at a Lizard Island reef after last summer's mass bleaching

How forest fires also have an impact on lakes

Stress testing pension funds—researchers present technique based on hidden Markov regime switching model

New tool enables faster, more cost-effective genome editing of traits to improve agriculture sustainability

Leading-edge model predicts impact of river plants on flood level

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Medical Xpress

Tech Xplore

Science X

New gene prediction method capitalizes on multiple genomes

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New data science tool greatly speeds up molecular analysis of our environment

AI tools help uncover enzyme mechanisms for lasso peptides

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

Related Stories

New calculation approach allows more accurate predictions of how atoms ionize when impacted by high-energy electrons

New drone imagery reveals 97% of coral dead at a Lizard Island reef after last summer's mass bleaching

How forest fires also have an impact on lakes

Stress testing pension funds—researchers present technique based on hidden Markov regime switching model

New tool enables faster, more cost-effective genome editing of traits to improve agriculture sustainability

Leading-edge model predicts impact of river plants on flood level

Recommended for you

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Newsletter sign up

Donate and enjoy an ad-free experience