February 24, 2013

Taking the gamble out of DNA sequencing

Two USC scientists have developed an algorithm that could help make DNA sequencing affordable enough for clinics – and could be useful to researchers of all stripes.

Andrew Smith, a computational biologist at the USC Dornsife College of Letters, Arts and Sciences, developed the algorithm along with USC graduate student Timothy Daley to help predict the value of sequencing more DNA, to be published in Nature Methods on February 24.

Extracting information from the DNA means deciding how much to sequence: sequencing too little and you may not get the answers you are looking for, but sequence too much and you will waste both time and money. That expensive gamble is a big part of what keeps DNA sequencing out of the hands of clinicians. But not for long, according to Smith.

"It seems likely that some clinical applications of DNA sequencing will become routine in the next five to 10 years," Smith said. "For example, diagnostic sequencing to understand the properties of a tumor will be much more effective if the right mathematical methods are in place."

The beauty of Smith and Daley's algorithm, which predicts the size and composition of an unseen population based on a small sample, lies in its broad applicability.

"This is one of those great instances where a specific challenge in our research led us to uncover a powerful algorithm that has surprisingly broad applications," Smith said.

Think of it: how often do scientists need to predict what they haven't seen based on what they have? Public health officials could use the algorithm to estimate the population of HIV positive individuals; astronomers could use it to determine how many exoplanets exist in our galaxy based on the ones they have already discovered; and biologists could use it to estimate the diversity of antibodies in an individual.

The mathematical underpinnings of the algorithm rely on a model of sampling from ecology known as capture-recapture. In this model, individuals are captured and tagged so that a recapture of the same individual will be known – and the number of times each individual was captured can be used to make inferences about the population as a whole.

In this way scientists can estimate, for example, the number of gorillas remaining in the wild. In DNA sequencing, the individuals are the various different genomic molecules in a sample. However, the mathematical models used for counting gorillas don't work on the scale of DNA sequencing.

"The basic model has been known for decades, but the way it has been used makes it highly unstable in most applications. We took a different approach that depends on lots of computing power and seems to work best in large-scale applications like modern DNA sequencing," Daley said.

Scientists faced a similar problem in the early days of the human genome sequencing project. A mathematical solution was provided by Michael Waterman of USC, in 1988, which found widespread use. Recent advances in sequencing technology, however, require thinking differently about the mathematical properties of DNA sequencing data.

"Huge data sets required a novel approach. I'm very please it was developed here at USC," said Waterman.

Journal information: Nature Methods

Provided by University of Southern California

Citation: Taking the gamble out of DNA sequencing (2013, February 24) retrieved 12 July 2024 from https://phys.org/news/2013-02-gamble-dna-sequencing.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New method for sequencing genome in a single cell

0 shares

Feedback to editors

Taking the gamble out of DNA sequencing

Real-life 'stillsuit': Dune-inspired upgrade for spacesuits allow astronauts to recycle urine into water

New research reveals how galaxies avoid early death

Oxygen tweaking may be key to accelerator optimization

A stealth fungus has decimated North American bats, but scientists may be a step closer to treating white-nose syndrome

Scientific definition of a planet says it must orbit our sun: A new proposal would change that

Forest carbon storage has declined across much of the Western U.S., likely due to drought and fire

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Q&A: Researcher discusses how gravitational waves hint at dark matter and Big Bang mysteries

Team develops the first cell-free system in which genetic information and metabolism work together

Chemists develop robust molecule that gives organic electronic devices a boost

Relevant PhysicsForums posts

Is meat broth really nutritious?

Havana Syndrome

Innovative ideas and technologies to help folks with disabilities

COVID Virus Lives Longer with Higher CO2 In the Air

Conflicting interpretations of rosemary oil study

Who chooses official designations for individual dolphins, such as FB15, F153, F286?

New method for sequencing genome in a single cell

454 Life Sciences and Baylor College of Medicine complete sequencing of DNA pioneer

Sequencing hundreds of chloroplast genomes now possible

No need to prepare: New method to directly sequence small genomes without library preparation

BGI develops whole exome sequencing analysis of FFPE DNA samples to boost biomedicine

Solving puzzles without a picture: New algorithm assembles chromosomes from next generation sequencing data

Team develops the first cell-free system in which genetic information and metabolism work together

Novel protein found to inhibit activity of CRISPR-Cas system

New system makes lifesaving phages accessible, transportable and much easier to use

Method that silences specific genes while leaving others unaffected could aid research and therapy

How artificial intelligence can help prevent illegal wildlife trade

Tiny TnpB: The next-generation genome editing tool for plants unveiled

Medical Xpress

Tech Xplore

Science X

Taking the gamble out of DNA sequencing

Real-life 'stillsuit': Dune-inspired upgrade for spacesuits allow astronauts to recycle urine into water

New research reveals how galaxies avoid early death

Oxygen tweaking may be key to accelerator optimization

A stealth fungus has decimated North American bats, but scientists may be a step closer to treating white-nose syndrome

Scientific definition of a planet says it must orbit our sun: A new proposal would change that

Forest carbon storage has declined across much of the Western U.S., likely due to drought and fire

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Q&A: Researcher discusses how gravitational waves hint at dark matter and Big Bang mysteries

Team develops the first cell-free system in which genetic information and metabolism work together

Chemists develop robust molecule that gives organic electronic devices a boost

Relevant PhysicsForums posts

Related Stories

New method for sequencing genome in a single cell

454 Life Sciences and Baylor College of Medicine complete sequencing of DNA pioneer

Sequencing hundreds of chloroplast genomes now possible

No need to prepare: New method to directly sequence small genomes without library preparation

BGI develops whole exome sequencing analysis of FFPE DNA samples to boost biomedicine

Solving puzzles without a picture: New algorithm assembles chromosomes from next generation sequencing data

Recommended for you

Team develops the first cell-free system in which genetic information and metabolism work together

Novel protein found to inhibit activity of CRISPR-Cas system

New system makes lifesaving phages accessible, transportable and much easier to use

Method that silences specific genes while leaving others unaffected could aid research and therapy

How artificial intelligence can help prevent illegal wildlife trade

Tiny TnpB: The next-generation genome editing tool for plants unveiled

Newsletter sign up

Donate and enjoy an ad-free experience