January 19, 2007

Learning the language of gene expression

Researchers have taken a major step towards understanding the language of gene regulation in the fruitfly Drosophila and they expect the technique to be rapidly applicable to understanding the effects of genome variation in humans.

The new research, published today in PLoS Computational Biology, is a major advance in using computers to detect the regions in DNA that control the activity of genes. Studies on single genes have shown that variation in gene regulation can be important in disease. The new program, called NestedMICA, allows researchers to find many regulatory regions, which will become a new focus for disease understanding.

The team, from the Wellcome Trust Sanger Institute and The University of Manchester, took slices of genome sequence from next to each Drosophila gene - where the highest concentration of regulatory signals are thought to lie - and fed them into the new computer program that looks for patterns shared between the sequences. The search process is similar to looking for words in a sentence where the vocabulary of the language is unknown

"Most words in the language of gene regulation can be spelled more than one way," explained Dr Thomas Down, first author on the report. "In English, you might see people writing either 'analyse' or 'analyze'. In genomes, such variation - or even bigger differences - seems to be normal.

"So we can't just count words, we need to recognize alternative spellings."

The team, which includes Dr Casey Bergman from Manchester's Faculty of Life Sciences, has so far found 120 'words' - distinct examples of regions that might regulate genes. About 30 of these were known from many years of studying how individual Drosophila genes are controlled, but most are novel. This is a major step towards understanding the language of gene regulation in an important model organism, and proof of principle of a new technology that will speed the study of regulatory elements in the human genome. Drosophila is a well-studied organism and shares 48% of its 14,000 genes with humans.

Research emerging in the past few months suggests that variation in the sequence of regulatory regions will affect susceptibility to many diseases. A few cases are already known - one form of thalassaemia is caused by a regulatory sequence variant - but knowledge of regulatory elements in the human genome is limited: scientists have only scratched the surface.

Systematic annotation of regulatory regions in the human genome will be very important if researchers are going to understand the effects of all sequence variation.

Dr Tim Hubbard, senior author on the report explained: "While others have tried to identify these control regions before, they have had to try to align lots of sequences. Our new method doesn't depend on alignment, an advantage because the new program is robust to rapidly evolving sequences.

"The new method also doesn't require prior knowledge from, say, looking at known examples, and can search for hundreds of different motifs at once."

As science should, the work makes predictions that the team is testing. Using a set of excellent, publicly available data on gene activity from the University of California-Berkeley and Lawrence Berkeley National Laboratory, they have predicted what some of the newly discovered sequences might mean in the language of gene regulation.

Computer analysis can accelerate the search for important regions in genomes, but the authors emphasize that computer predictions must always be examined experimentally. The findings in Drosophila by the new program have been validated by examining findings against results from experimental imaging.

The results of the research, a set of Drosophila sequence motifs, are freely available from a database at the Sanger Institute. Like many tools developed at the Sanger Institute, NestedMICA is open source software, freely available for anyone to download, run and modify.

Source: University of Manchester

Citation: Learning the language of gene expression (2007, January 19) retrieved 21 September 2024 from https://phys.org/news/2007-01-language-gene.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

From takeoff to flight, the wiring of a fly's nervous system is mapped

0 shares

Feedback to editors

New data science tool greatly speeds up molecular analysis of our environment

9 hours ago

AI tools help uncover enzyme mechanisms for lasso peptides

9 hours ago

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

10 hours ago

Study reveals large ocean heat storage efficiency during the last deglaciation

10 hours ago

Citizen science collaboration yields precise data on exoplanet WASP-77 A b

10 hours ago

A possible explanation for the 'missing plastic problem': New detection technique finds microplastics in coral skeletons

11 hours ago

Genome sequence analysis identifies new driver of antimicrobial resistance

12 hours ago

Analysis of heterostructures for spintronics shows how two desired quantum-physical effects reinforce each other

12 hours ago

Evolved in the lab, found in nature: Uncovering hidden pH sensing abilities in microbial cultures

12 hours ago

Harnessing exosomes and hydrogels for advanced diabetic wound healing

12 hours ago

Load comments (0)

Learning the language of gene expression

New data science tool greatly speeds up molecular analysis of our environment

AI tools help uncover enzyme mechanisms for lasso peptides

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

Study reveals large ocean heat storage efficiency during the last deglaciation

Citizen science collaboration yields precise data on exoplanet WASP-77 A b

A possible explanation for the 'missing plastic problem': New detection technique finds microplastics in coral skeletons

Genome sequence analysis identifies new driver of antimicrobial resistance

Analysis of heterostructures for spintronics shows how two desired quantum-physical effects reinforce each other

Evolved in the lab, found in nature: Uncovering hidden pH sensing abilities in microbial cultures

Harnessing exosomes and hydrogels for advanced diabetic wound healing

From takeoff to flight, the wiring of a fly's nervous system is mapped

The horrifying human cost of big sporting events

New drone imagery reveals 97% of coral dead at a Lizard Island reef after last summer's mass bleaching

The more medals Canadian athletes win, the fewer Canadians participate in organized sport

Industrial fleets operating in the Indian Ocean turn off monitoring systems, fail reporting obligations

Researchers improve measurement of gene expression in single cells

Mysterious Pacific Ocean sounds identified as a type of whale—a new AI app helps track them

Black garden ants modify the structure of their nests to mitigate fungal infection spread

Scientists propose a new method to search for dark matter using LIGO

Observers detect intraday variability of blazar 1ES 1426+42.8

AI tools help uncover enzyme mechanisms for lasso peptides

New data science tool greatly speeds up molecular analysis of our environment

Medical Xpress

Tech Xplore

Science X

Learning the language of gene expression

New data science tool greatly speeds up molecular analysis of our environment

AI tools help uncover enzyme mechanisms for lasso peptides

Light momentum turns pure silicon from an indirect to a direct bandgap semiconductor

Study reveals large ocean heat storage efficiency during the last deglaciation

Citizen science collaboration yields precise data on exoplanet WASP-77 A b

A possible explanation for the 'missing plastic problem': New detection technique finds microplastics in coral skeletons

Genome sequence analysis identifies new driver of antimicrobial resistance

Analysis of heterostructures for spintronics shows how two desired quantum-physical effects reinforce each other

Evolved in the lab, found in nature: Uncovering hidden pH sensing abilities in microbial cultures

Harnessing exosomes and hydrogels for advanced diabetic wound healing

Related Stories

From takeoff to flight, the wiring of a fly's nervous system is mapped

The horrifying human cost of big sporting events

New drone imagery reveals 97% of coral dead at a Lizard Island reef after last summer's mass bleaching

The more medals Canadian athletes win, the fewer Canadians participate in organized sport

Industrial fleets operating in the Indian Ocean turn off monitoring systems, fail reporting obligations

Researchers improve measurement of gene expression in single cells

Recommended for you

Mysterious Pacific Ocean sounds identified as a type of whale—a new AI app helps track them

Black garden ants modify the structure of their nests to mitigate fungal infection spread

Scientists propose a new method to search for dark matter using LIGO

Observers detect intraday variability of blazar 1ES 1426+42.8

AI tools help uncover enzyme mechanisms for lasso peptides

New data science tool greatly speeds up molecular analysis of our environment

Newsletter sign up

Donate and enjoy an ad-free experience