January 20, 2015

Harnessing data from Nature's great evolutionary experiment

There are 3 billion letters in the human genome, and scientists have endlessly debated how many of them serve a functional purpose. There are those letters that encode genes, our hereditary information, and those that provide instructions about how cells can use the genes. But those sequences are written with a comparative few of the vast number of DNA letters. Scientists have long debated how much of, or even if, the rest of our genome does anything, some going so far as to designate the part not devoted to encoding proteins as "junk DNA."

In work published today in Nature Genetics, researchers at Cold Spring Harbor Laboratory (CSHL) have developed a new computational method to identify which letters in the human genome are functionally important. Their computer program, called fitCons, harnesses the power of evolution, comparing changes in DNA letters across not just related species, but also between multiple individuals in a single species. The results provide a surprising picture of just how little of our genome has been "conserved" by Nature not only across species over eons of time, but also over the more recent time period during which humans differentiated from one another.

"In model organisms, like yeast or flies, scientists often generate mutations to determine which letters in a DNA sequence are needed for a particular gene to function," explains CSHL Professor Adam Siepel. "We can't do that with humans. But when you think about it, Nature has been doing a similar experiment on a very large scale as species evolve. Mutations occur across the genome at random, but important letters are retained by natural selection, while the rest are free to change with no adverse consequence to the organism."

It was this idea that became the basis of their analysis, but it alone wasn't enough. "Massive research consortia, like the ENCODE Project, have provided the scientific community with a trove of information about genomic function over the last few years," says Siepel. "Other groups have sequenced large numbers of humans and nonhuman primates. For the first time, these big data sets give us both a broad and exceptionally detailed picture of both biochemical activity along the genome and how DNA sequences have changed over time."

Siepel's team began by sorting ENCODE consortium data based on combinations of biochemical markers that indicate the type of activity at each position. "We didn't just use sequence patterns. ENCODE provided us with information about where along the full genome DNA is read and how it is modified with biochemical tags," says Brad Gulko, a Ph.D. student in Computer Science at Cornell University and lead author on the new paper. The combinations of these tags revealed several hundred different classes of sites within the genome each having a potentially different role in genomic activity.

The researchers then turned to their previously developed computational method, called INSIGHT, to analyze how much the sequences in these classes had varied over both short and long periods of evolutionary time. "Usually, this, kind of analysis is done comparing different species - like humans, dogs, and mice - which means researchers are looking at changes that occurred over relatively long time periods," explains Siepel. But the INSIGHT model considers the changes among dozens of human individuals and close relatives, such as the chimpanzee, which provides a picture of evolution over much shorter time frames.

The scientists found that, at most, only about 7% of the letters in the human genome are functionally important. "We were impressed with how low that number is," says Siepel. "Some analyses of the ENCODE data alone have argued that upwards of 80% of the genome is functional, but our evolutionary analysis suggests that isn't the case." He added, "other researchers have estimated that similarly small fractions of the genome have been conserved over long time evolutionary periods, but our analysis indicates that the much larger ENCODE-based estimates can't be explained by gains of new functional sequences on the human lineage. We think most of the sequences designated as 'biochemically active' by ENCODE are probably not evolutionarily important in humans."

According to Siepel, this analysis will allow researchers to isolate functionally important sequences in diseases much more rapidly. Most genome-wide studies implicate massive regions, containing tens of thousands of letters, associated with disease. "Our analysis helps to pinpoint which letters in these sequences are likely to be functional because they are both biochemically active and have been preserved by evolution." says Siepel. "This provides a powerful resource as scientists work to understand the genetic basis of disease."

More information: "A method for calculating probabilities of fitness consequences for point mutations across the human genome" appears online in Nature Genetics on January 19, 2015. The authors are: Brad Gulko, Melissa Hubisz, Ilan Gronau, and Adam Siepel. The paper can be obtained online at: dx.doi.org/10.1038/ng.3196

Journal information: Nature Genetics

Provided by Cold Spring Harbor Laboratory

Citation: Harnessing data from Nature's great evolutionary experiment (2015, January 20) retrieved 19 July 2024 from https://phys.org/news/2015-01-harnessing-nature-great-evolutionary.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Study suggests a unified model for how DNA is read, offering insight into how genes evolve

694 shares

Feedback to editors

Harnessing data from Nature's great evolutionary experiment

Early riser: The sun is already starting its next solar cycle—despite being halfway through its current one

Of ants and trees: 'Evolutionary déjà vu' in the tropical rainforest

New AI approach accelerates targeted materials discovery and sets the stage for self-driving experiments

Studies find China-based emissions of three potent climate-warming greenhouse gases have spiked in past decade

New technique to diagnose cancer metastasis uses origami nanoprobes

Q&A: Creators of first-ever hurricane evacuation order database say it may hold keys to future readiness

Study reveals key gene protecting plants from harmful metals in soil

NASA's Curiosity rover discovers a surprise in a Martian rock

Groundcherry gets genetic upgrades: Turning a garden curiosity into an agricultural powerhouse

Using AI to scrutinize and validate theories on animal evolution

Relevant PhysicsForums posts

Innovative ideas and technologies to help folks with disabilities

Understanding COVID Quarantine Guidance

New and Interesting Publications Relevant to the Origin of Life

The Cass Report (UK)

Medical tape cut off blood flow to fetus?

Is meat broth really nutritious?

Study suggests a unified model for how DNA is read, offering insight into how genes evolve

How much of your DNA is functional?

Genetic switches play big role in human evolution

Variation in expression of thousands of genes kept under tight constraint in mice, humans

8.2 percent of our DNA is 'functional'

Protein coding 'junk genes' may be linked to cancer

Groundcherry gets genetic upgrades: Turning a garden curiosity into an agricultural powerhouse

Studies explore converting wastewater to fertilizer with fungal treatment

Dynamic view of opioid receptor could refine pain relief

Gene silencing tool has a need for speed: Research provides deeper insight into RNAi tool design

Smart soil can water and feed itself

Microbes found to destroy certain 'forever chemicals' by cleaving stubborn fluorine-to-carbon bonds

Medical Xpress

Tech Xplore

Science X

Harnessing data from Nature's great evolutionary experiment

Early riser: The sun is already starting its next solar cycle—despite being halfway through its current one

Of ants and trees: 'Evolutionary déjà vu' in the tropical rainforest

New AI approach accelerates targeted materials discovery and sets the stage for self-driving experiments

Studies find China-based emissions of three potent climate-warming greenhouse gases have spiked in past decade

New technique to diagnose cancer metastasis uses origami nanoprobes

Q&A: Creators of first-ever hurricane evacuation order database say it may hold keys to future readiness

Study reveals key gene protecting plants from harmful metals in soil

NASA's Curiosity rover discovers a surprise in a Martian rock

Groundcherry gets genetic upgrades: Turning a garden curiosity into an agricultural powerhouse

Using AI to scrutinize and validate theories on animal evolution

Relevant PhysicsForums posts

Related Stories

Study suggests a unified model for how DNA is read, offering insight into how genes evolve

How much of your DNA is functional?

Genetic switches play big role in human evolution

Variation in expression of thousands of genes kept under tight constraint in mice, humans

8.2 percent of our DNA is 'functional'

Protein coding 'junk genes' may be linked to cancer

Recommended for you

Groundcherry gets genetic upgrades: Turning a garden curiosity into an agricultural powerhouse

Studies explore converting wastewater to fertilizer with fungal treatment

Dynamic view of opioid receptor could refine pain relief

Gene silencing tool has a need for speed: Research provides deeper insight into RNAi tool design

Smart soil can water and feed itself

Microbes found to destroy certain 'forever chemicals' by cleaving stubborn fluorine-to-carbon bonds

Newsletter sign up

Donate and enjoy an ad-free experience