September 4, 2008

Future of biology rests in harnessing data avalanche

(PhysOrg.com) -- Like most sciences, biology is inundated with data. However, a group of researchers warns in a Nature feature that the avalanche of biological information is at the point where the discipline may be unable to reach its full potential without improvements for curating data into on-line databases. The commentary appears in the September 4, issue of the journal and outlines specific remedies to harness the information overload.

By July 2008, data-extractors or curators had indexed over 18 million articles in PubMed and sequences of over 260,000 organisms into GenBank. Both are examples of databases where biological information is stored for public access. Data curation is very labor intensive.

“There is a lack of standardization or consistency in the way scientists report their findings in different journals,” remarked corresponding author Sue Rhee of the Carnegie Institution’s Department of Plant Biology and principal investigator of The Arabidopsis Information Resource (TAIR). “In some cases the researchers don’t even specify the species of a gene under study. That leaves biocurators, who have advance degrees in biology, and expertise with databases and scripting languages, to read the full text and transfer the essence of the information into specific fields in the database. They spend a lot of time just figuring out the basics. And that leaves a lot of room for error.”

Curation is not just a data organization tool. Such input has become essential to biological research. The authors note that eleven different databases had ¾ of a million visitors who viewed 20 million pages in just one month. And with inference programs that feed on the curated data, researchers can now tap into other work that relates to theirs and use that data in their own experiments—a huge advancement that is accelerating the pace of biology. “With this vast universe of information, the whole nature of experimentation is changing,” continued Rhee. “But the field is being held back with the curation backlog.”

The group of authors outlined a series of solutions to the problem. The first is to have authors input their data directly into databases upon acceptance in refereed journals. This step has already begun with Plant Physiology and TAIR. When a manuscript in accepted, researchers now fill in a web form about Arabidopsis genes. Second, the commentators urge the biological community to adopt standard reporting formats that are universally agreed upon. And third, curation needs to be elevated by academic institutions and funding agencies. There should also be incentives for researchers to curate their own data, such as increases in academic recognition, career advancement, and funding. They additionally suggest that “community annotation” could be modeled after large-scale astronomy projects like the Sloan Digital Sky Survey, or the Galaxy Zoo, where 80,000 astronomers and interested amateurs classified one million galaxies in less than three weeks.

“The effort and cost required to curate the data is small compared with the cost of carrying out the research in the first place, yet this additional step adds tremendously to the value of the research results to society,” commented Eva Huala, director of TAIR.

Wolf Frommer, acting director of Carnegie’s Department of Plant Biology noted that “advances in our understanding of biology will affect our food supply, our health-care system, the development of remedies for climate change, and many other aspects of daily life. Basic and applied research have to go hand in hand with curation of databases so that humanity can adapt to the quickly changing world as fast as possible.”

Provided by Carnegie Institution

Citation: Future of biology rests in harnessing data avalanche (2008, September 4) retrieved 4 May 2024 from https://phys.org/news/2008-09-future-biology-rests-harnessing-avalanche.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers define new class of regulatory element in DNA

0 shares

Feedback to editors

Nanotech opens door to future of insulin medication

5 hours ago

How evolving landscapes impacted First Peoples' early migration patterns into Australia

8 hours ago

Saturday Citations: Parrots on the internet; a map of human wakefulness; the most useless rare-earth element

9 hours ago

When injecting pure spin into chiral materials, direction matters

14 hours ago

New quantum sensing scheme could lead to enhanced high-precision nanoscopic techniques

14 hours ago

Boeing's Starliner finally ready for first crewed mission

14 hours ago

Hungry, hungry white dwarfs: Solving the puzzle of stellar metal pollution

May 3, 2024

How E. coli get the power to cause urinary tract infections

May 3, 2024

Male or female? Scientists discover the genetic mechanism that determines sex development in butterflies

May 3, 2024

New study is first to use statistical physics to corroborate 1940s social balance theory

May 3, 2024

Load comments (1)

Future of biology rests in harnessing data avalanche

Nanotech opens door to future of insulin medication

How evolving landscapes impacted First Peoples' early migration patterns into Australia

Saturday Citations: Parrots on the internet; a map of human wakefulness; the most useless rare-earth element

When injecting pure spin into chiral materials, direction matters

New quantum sensing scheme could lead to enhanced high-precision nanoscopic techniques

Boeing's Starliner finally ready for first crewed mission

Hungry, hungry white dwarfs: Solving the puzzle of stellar metal pollution

How E. coli get the power to cause urinary tract infections

Male or female? Scientists discover the genetic mechanism that determines sex development in butterflies

New study is first to use statistical physics to corroborate 1940s social balance theory

Researchers define new class of regulatory element in DNA

Examining the role of social media likes in follower network evolution

Florida's garden equipment helps it lead the nation in this type of pollution

Attoscience unveils a light-matter hybrid phase in graphite reminiscent of superconductivity

Paleontologist discovers rare soft tissue in fossil of crab

New way to charge batteries harnesses the power of 'indefinite causal order'

Nanotech opens door to future of insulin medication

How evolving landscapes impacted First Peoples' early migration patterns into Australia

Saturday Citations: Parrots on the internet; a map of human wakefulness; the most useless rare-earth element

Boeing's Starliner finally ready for first crewed mission

When injecting pure spin into chiral materials, direction matters

New quantum sensing scheme could lead to enhanced high-precision nanoscopic techniques

Medical Xpress

Tech Xplore

Science X

Future of biology rests in harnessing data avalanche

Nanotech opens door to future of insulin medication

How evolving landscapes impacted First Peoples' early migration patterns into Australia

Saturday Citations: Parrots on the internet; a map of human wakefulness; the most useless rare-earth element

When injecting pure spin into chiral materials, direction matters

New quantum sensing scheme could lead to enhanced high-precision nanoscopic techniques

Boeing's Starliner finally ready for first crewed mission

Hungry, hungry white dwarfs: Solving the puzzle of stellar metal pollution

How E. coli get the power to cause urinary tract infections

Male or female? Scientists discover the genetic mechanism that determines sex development in butterflies

New study is first to use statistical physics to corroborate 1940s social balance theory

Related Stories

Researchers define new class of regulatory element in DNA

Examining the role of social media likes in follower network evolution

Florida's garden equipment helps it lead the nation in this type of pollution

Attoscience unveils a light-matter hybrid phase in graphite reminiscent of superconductivity

Paleontologist discovers rare soft tissue in fossil of crab

New way to charge batteries harnesses the power of 'indefinite causal order'

Recommended for you

Nanotech opens door to future of insulin medication

How evolving landscapes impacted First Peoples' early migration patterns into Australia

Saturday Citations: Parrots on the internet; a map of human wakefulness; the most useless rare-earth element

Boeing's Starliner finally ready for first crewed mission

When injecting pure spin into chiral materials, direction matters

New quantum sensing scheme could lead to enhanced high-precision nanoscopic techniques

Newsletter sign up

Donate and enjoy an ad-free experience