Analyzing complex plant genomes with the newest next-generation DNA sequencing techniques
Genomes are catalogs of hereditary information that determine whether an organism becomes a plant, animal, fungus or microbe, and whether the organism is adapted to its surroundings. Determining the sequence of DNA within genomes is crucial to human medicine, crop genetics, biotechnology, forensic science, threatened species management, and evolutionary studies. The last 5 years have witnessed tremendous advances in DNA sequencing technologies, and it is now possible to sequence millions of fragments of DNA in a single analysis, and at a fraction of their previous cost. These "next-generation" methods are spurring a revolution in plant biology by providing powerful tools to examine previously-unimagined questions, in any plant of interest.
Richard Cronn and colleagues (from the USDA Forest Service, Oregon State University, Brigham Young University, and Linfield College) have published an overview of newly developed, up-and-coming DNA sequencing techniques as one of a series of articles in a Special Issue on Methods and Applications of Next-Generation Sequencing in Botany in the American Journal of Botany. In their article, Cronn and co-authors summarize "targeted enrichment" strategies that can be used to obtain specific DNA sequences from complex plant genomes. Articles in the Special Issue provide a detailed snapshot of how "next-generation" sequencing is transforming plant biology.
"Plant genomes range from simple to exceptionally complex," noted Cronn. "Combining next-generation sequencing with targeted enrichment allows plant scientists to reduce the complexity of plant genomes and focus on specific genes or unique regions that are easy to analyze."
Modern next-generation sequencers are capable of sequencing all nucleotides (the G, A, T, and C components) contained within simple genomes, and from the genomes of well-studied organisms like humans. Direct genome sequencing is less tractable for many plants, whose simple appearance belies their genomic complexity. As complex as we are, humans possess about as many genes (22,000) as diminutive mosses, and less than half the number of genes found in alfalfa or apple. Moreover, our genome of 3.2 billion nucleotides is dwarfed by many plants, such as pines and coastal redwood (which are 10 times larger). Finally, plants have an additional genome within each cell that encodes the genes required to carry out photosynthesis inside the chloroplast.
"At the moment, these sequencers are less than optimal for comparative plant biologists who focus on select regions of the genome from hundreds of samples," adds David Spooner, one of the editors for the special issue.
The enrichment methods described by the authors fill this much-needed gap, and are based on time-tested methodologies, including: (1) PCR-based enrichment, (2) hybridization-based enrichment, (3) restriction enzyme-based enrichment, and (4) enrichment of expressed gene sequences. The advantages and disadvantages of each technique are explored, and suggestions are offered for their optimal use.
In their study, the authors conclude that traditional PCR-based methods offer a cost-effective strategy for accessing small genomic targets in the range of 50,000 nucleotides or less. Larger genomic targets are most efficiently enriched using hybridization- and transcriptome-based methods. The article is accompanied by open-access electronic appendices that provide example protocols for hybridization-enrichment techniques, and detailed cost estimates for each targeted sequencing method.
"The use of these methods is sure to hasten the pace of discovery in all aspects of plant biologycrop production, ecosystem health, and our understanding of plant diversity in the present and the distant evolutionary past," adds Cronn.