New curation tool a boon for genetic biologists

June 21, 2011

With the BeeSpace Navigator, University of Illinois researchers have created both a curation tool for genetic biologists and a new approach to searching for information.

The project was a collaboration between researchers at the Institute for Genomic and the department of computer science. Led by Bruce Schatz, professor and head of medical information science at the U. of I., the team described the software and its applications in the web server issue of the journal .

When biologists need information about a gene or its function, they turn to curators, who keep and organize vast quantities of information from academic papers and scientific studies. A curator will extract as much information as possible from the papers in his or her collection and provide the biologist with a detailed summary of what's known about the gene – its location, function, sequence, regulation and more – by placing this information into an online database such as FlyBase.

"The question was, could you make an automatic version of that, which is accurate enough to be helpful?" Schatz said.

Schatz and his team developed BeeSpace Navigator, a free online software that draws upon databases of scholarly publications. The semantic indexing to support the automatic curation used the Cloud Computing Testbed, a national computing datacenter hosted at U. of I.

While BeeSpace originally was built around literature about the bee genome, it has since been expanded to the entire Medline database and has been used to study a number of insects as well as mice, pigs and fish.

The efficiency of BeeSpace Navigator is in its specific searches. A broad, general search of all known data would yield a chaotic myriad of results – the millions of hits generated by a Google search, for example. But with BeeSpace, users create "spaces," or special collections of literature to search. It also can take a large collection of articles on a topic and automatically partition it into subsets based on which words occur together, a function called clustering.

"The first thing you have to do if you have something that's simulating a curator is to decide what papers it's going to look at," Schatz said. "Then you have to decide what to extract from the text, and then what you're going to do with what you've extracted, what service you're going to provide. The system is designed to have easy ways of doing that."

The user-friendly interface allows biologists to build a unique space in a few simple steps, utilizing sub-searches and filters. For example, an entomologist interested in the genetic basis for foraging as a social behavior in bees would start with insect literature, then zero in on that are associated in literature with both foraging and social behavior – a specific intersection of topics that typical search engines could not handle.

This type of directed data navigation has several advantages. It is much more directed than a simple search, but able to process much more data than a human curator. It can also be used in fields where there are no human curators, since only the most-studied animals like mice and flies have their own professional curators.

Schatz and his team equipped the navigator to perform several tasks that biologists often perform when trying to interpret gene function. Not only does the program summarize a gene, as a curator would, but it also can perform analysis to extrapolate functions from literature.

For example, a study will show that a gene controls a particular chemical, and another study will show that chemical plays a role in a certain behavior, so the software makes the link that the gene could, in part, control that behavior.

BeeSpace can also perform vocabulary switching, an automatic translation across species or behaviors. For example, if it is known that a specific gene in a honeybee is analogous to another gene in a fruit fly, but the function of that gene has been documented in much more detail in a fruit fly, the navigator can make the connection and show a bee scientist information on the fly gene that may be helpful.

"The main point of the project is automatically finding out what genes do that don't have known function," Schatz said. "If a biologist is trying to figure out what these genes do, they're happy with anything. They want to get as much information as possible."

More information: The paper, "BeeSpace Navigator: Exploratory Analysis of Gene Function Using Semantic Indexing of Biological Literature," is available online at http://nar.oxfordj … 285.abstract

Provided by University of Illinois at Urbana-Champaign search and more info website


Rank not rated yet
Relevant PhysicsForums posts

More news stories

Scientist: Evolution debate will soon be history

(AP) -- Richard Leakey predicts skepticism over evolution will soon be history. Not that the avowed atheist has any doubts himself.

Biology / Evolution

created 15 hours ago | popularity 3.4 / 5 (16) | comments 46

More plant species responding to global warming than previously thought

(Phys.org) -- Far more wild plant species may be responding to global warming than previous large-scale estimates have suggested.

Biology / Ecology

created May 22, 2012 | popularity 4.6 / 5 (14) | comments 18 | with audio podcast

Thousands of shellfish found dead in Peru

Thousands of crustaceans were found dead off the coast of Lima following the mystery mass death of dolphins and pelicans, the Peruvian Navy said Friday.

Biology / Ecology

created May 26, 2012 | popularity 4.8 / 5 (4) | comments 7

For monogamous sparrows, it doesn't pay to stray (but they do it anyway)

It's quite common for a female song sparrow to stray from her breeding partner and mate with the male next door, but a new study shows that sleeping around can be costly.

Biology / Plants & Animals

created May 22, 2012 | popularity 5 / 5 (1) | comments 7 | with audio podcast

Study uncovers secret to speedy burrowing by razor clams

(Phys.org) -- If you look at a razor burrowing clam sitting in a bucket, you’d never guess that it could burrow itself down into the soil, much less do it with any speed. Razor clams look like fat straws, ...

Biology / Plants & Animals

created May 25, 2012 | popularity 1 / 5 (1) | comments 3 | with audio podcast report


Nvidia trumpets Tegra 3 phone design wins for 2012

(Phys.org) -- Nvidia’s competitive war paint has a name, Tegra 3. On the heels of Nvidia announcements about lowering costs of its Tegra 3 processors and Nvidia-enabled tablets running Android Ice Cream ...

Browser wars flare in mobile space

The browser wars are heating up again, but this time the fight is for dominance of the mobile Internet.

Dell tablet leak: 10.1-inch display, two-battery choice

(Phys.org) -- Headline after headline talks about vendors’ tablets in the wings as likely number-one contenders for the iPad. Such claims have justifiably been taken with a grain of salt, considering ...

Keep food safety in mind this memorial day weekend

(HealthDay) -- Picnics, parades and cookouts are as much a part of Memorial Day weekend as tributes to the United States' war veterans.

Social welfare cuts ultimately come with heavy price, researchers say

(Phys.org) -- Slashing government funding for Medicaid, food stamps and other programs that serve the poor – while politically popular with some lawmakers and many conservatives – may do more harm ...

Is a classical electrodynamics law incompatible with special relativity?

(Phys.org) -- The laws of classical electromagnetism that were developed in the 19th century are the same laws that scientists use today. They include Maxwell’s four equations along with the Lorentz la ...