Locating patterns in human proteins
April 26, 2011 By: Colin Poitras
A genetic motif. Credit: Sanguthevar Rajasekaran
A national research team, led by University of Connecticut engineering professor Sanguthevar Rajasekaran, is developing a new generation of exact algorithms that will help biologists locate patterns in human proteins and DNA. The work could eventually lead to new medicines to help fight disease.
The research is supported by a $1.5 million, four-year grant from the National Institutes of Health that will allow the scientists to develop novel algorithms that can be used to analyze genomes for complex, biologically-relevant patterns called motifs.
Genetic analysis and other approaches have identified many mutations, often found in protein coding regions, that are associated with inherited human disease, says Rajasekaran, principal investigator for the grant. If we can identify a drug that interferes with the protein containing the mutation, we can devise effective treatments. Analysis of protein and DNA sequences is an important approach for predicting protein function, and therefore an important part of the pipeline in drug discovery.
Rajasekaran, UTC Chair Professor in the Department of Computer Science and Engineering, is joined by Reda A. Ammar, professor and head of the computer science and engineering department, on the research team. Others involved in the initiative are Sartaj Sahni, a professor at the University of Florida, and Martin Schiller, an associate professor at the University of Nevada-Las Vegas.
Rajasekaran, who received his Ph.D. from Harvard, is considered an international expert in the field of applied algorithms. He holds nine U.S. patents, and in 2010 was inducted as a Fellow of the American Association for the Advancement of Science, an international non-profit organization dedicated to advancing science around the world.
An online motif search.
Supercomputers and efficient algorithms have become crucial tools for biologists trying to sort through the vast amount of data generated by the Human Genome Project, in which researchers set out to identify the approximately 20,000 to 25,000 genes of the human genome and determine the sequences of the three billion chemical base pairs that make up human DNA.Finding patterns in genomes that are repeated over many sequences and possibly over many species is one way of identifying potentially useful information. For instance, if a particular motif is found in a protein that is believed to repress a certain disease trait, and researchers go on to discover a mutation of that motif in individuals with the disease, then drugs can be developed that may be able to repress the disease in those individuals and help them lead healthier lives.
But existing algorithms used in this kind of research tend to be complicated and take up large amounts of computing time and memory, which can be problematic for research teams with limited resources. Using the currently best known algorithms, identifying motifs of length 27 can take more than a month on a regular PC, Rajasekaran says. Identifying motifs of length 31 or more can take more than 5 years. Biologists would benefit greatly from algorithms that can find these long and complex motifs quickly and reliably. The longer and more complex the identified motif, the greater its usefulness and the less likely that comparative matching will lead to false positives.
Our role is to make things faster and more efficient while running in real time, says Ammar. We are building tools that need to be friendly and easy to use for a non-technical person. In the past, biologists relied solely on experiments in the lab; now they can turn to the computer, which is faster and can give those results in just minutes.
As part of their earlier work in this area, members of the research team created a web tool called the Minimotif Miner to search for motifs. The tool is now used by biologists worldwide.
With the new grant, the team will develop a web-based system incorporating three variations of the problem: Planted Motif Search, Edit-distance Motif Search, and Simple Motif Search. Rajasekaran says the new algorithms will help biologists find highly reliable short strains of genomic sequences among the huge number of possible strains available. It is akin to directing scientists to key shelves in a library full of millions of books.
One can look at the entire genomic sequences of healthy individuals and compare them to those with cancer. There could be millions of differences, because those genomes are so huge, Rajasekaran says. Thats why we target mutations in motifs, because they are very fundamental and instrumental in protein-protein interactions.
Sahni, distinguished professor and chair of the computer and information science and engineering department at the University of Florida, is excited about developing sequential and parallel algorithms for motif search as a member of the team. He hopes the research will improve the well-being of society at large.
Schiller, a biologist and bioinformatician who was a co-developer of the Minimotif Miner system when he was an associate professor at the UConn Health Center from 2000 to 2009, likens the algorithm and motif research to scientists trying to understand hieroglyphics.
We have all this information that appears to us as symbols but we dont have the cipher key, the Rosetta Stone, says Schiller. Were trying to put things in some order to extract meaningful information. By doing a pattern search, we are pulling the rules of life out of the genome and finding out what it means.
Provided by
University of Connecticut
-
From lemons to lemonade: Reaction uses carbon dioxide to make carbon-based semiconductor,
32 comments
-
Thioridazine kills cancer stem cells in human while avoiding toxic side-effects of conventional cancer treatments,
3 comments
-
SpaceX private rocket blasts off for space station (Update),
42 comments
-
Climate scientists say they have solved riddle of rising sea,
30 comments
-
Research team claims to have found evidence Lake Cheko is impact crater for Tunguska Event,
18 comments
-
What would stain as translucent on light-coloured fabric?
11 hours ago
-
How do I identify different bacteria on culture plates?
21 hours ago
-
Why Do Dogs do Strange things...
May 25, 2012
-
What does exophillic and endophillic mean in terms of mosquito and their control?
May 24, 2012
-
Semen stains glows under black lights (uv light)?
May 23, 2012
-
Question on Human Chromosome 2
May 23, 2012
- More from Physics Forums - Biology
More news stories
Scientist: Evolution debate will soon be history
(AP) -- Richard Leakey predicts skepticism over evolution will soon be history. Not that the avowed atheist has any doubts himself.
10 hours ago |
3.5 / 5 (11) |
23
Thousands of shellfish found dead in Peru
Thousands of crustaceans were found dead off the coast of Lima following the mystery mass death of dolphins and pelicans, the Peruvian Navy said Friday.
20 hours ago |
4.8 / 5 (4) |
6
More plant species responding to global warming than previously thought
(Phys.org) -- Far more wild plant species may be responding to global warming than previous large-scale estimates have suggested.
May 22, 2012 |
4.6 / 5 (14) |
18
|
Totally rad: Scientists create rewritable digital data storage in DNA
(Phys.org) -- Scientists from Stanford's Department of Bioengineering have devised a method for repeatedly encoding, storing and erasing digital data within the DNA of living cells.
May 21, 2012 |
4.9 / 5 (17) |
11
|
For monogamous sparrows, it doesn't pay to stray (but they do it anyway)
It's quite common for a female song sparrow to stray from her breeding partner and mate with the male next door, but a new study shows that sleeping around can be costly.
May 22, 2012 |
5 / 5 (1) |
7
|
Dell tablet leak: 10.1-inch display, two-battery choice
(Phys.org) -- Headline after headline talks about vendors tablets in the wings as likely number-one contenders for the iPad. Such claims have justifiably been taken with a grain of salt, considering ...
SpotterRF debuts Radar Backpack Kit (w/ Video)
(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...
SpaceX capsule has 'new car' smell, astronauts say (Update)
SpaceX's Dragon cargo vessel smells like a new car, said astronauts at the International Space Station after opening the hatches Saturday following the spacecraft's landmark mission to the orbiting lab.
Astronomers seize last chance in lifetime for Venus Transit
Astronomers are gearing for one the rarest events in the Solar System: an alignment of Earth, Venus and the Sun that will not be seen for another 105 years.
Australia hails surprise super-telescope decision
Australia has hailed a surprise decision giving it a role in a radio telescope project aimed at revolutionising astronomy, vowing to draw on its decades of experience in space science.
Keep food safety in mind this memorial day weekend
(HealthDay) -- Picnics, parades and cookouts are as much a part of Memorial Day weekend as tributes to the United States' war veterans.
