Bacteria Are Better Gene Packers Than We Thought

February 12, 2010
Organization of ten novel open reading frames (ORFs) overlapping predicted genes in P. fluorescens Pf0-1. The novel ORFs are colored red and indicated by “nov” (or n7 for nov7), while predicted Pf0-1 genes are colored blue and labeled with the Pfl01 number of the locus tag corresponding to each in the Pf0-1 GenBank entry. Three forward (numbered 1-3) and three reverse (numbered 4-6) reading frames are shown. Parallel diagonal lines indicate that the complete ORF is not shown to scale.

( -- In microbial genomes, genes are typically depicted as linear series of separate regulatory and coding regions. This leads to the assumption that annotations done by computer to predict such arrangements completely describe the coding capacity of bacterial genomes.

However, the more complex organisms such as plants and animals pack their into their DNA very densely. One common packing trick is to code genes on both strands of the DNA, allowing the genes to overlap along the chromosome. Bacterial genes previously shown to reside on the second, or anti-sense, strand overlap just a little -- a couple dozen DNA bases, for example. Less than a handful of genes overlap completely.

To test whether bacteria might be packing genes more tightly than this, scientists at Pacific Northwest National Laboratory and Tufts University compared the proteins made by a bacterial species with what is known about its genome. They chose Pseudomonas fluorescens, Gram-negative rod-shaped bacteria that inhabit soil, plants, and water surfaces.

The researchers analyzed these proteins at the U.S. Department of Energy's EMSL, a national scientific user facility at PNNL, using ultra high-pressure reversed-phase high-performance liquid coupled to an .

Using all the information from a 6-frame translation of the bacterial genome, the team identified as many proteins made by P. fluorescens as their instruments would allow and deduced the needed to create those proteins. Then they compared the deduced genes to the genes found by the annotation of the genome.

The team subsequently analyzed coding sequences, reading frames, and comparative alignments and found 16 genes for proteins not previously mapped to the P. fluorescens genome. The researchers found nine previously unknown genes coded on the anti-sense strand of DNA. Unlike other anti-sense genes found in bacteria, however, these genes overlapped other ones on the sense strand completely or nearly so. This suggests that researchers have under-estimated how often bacteria pack genes by overlapping them.

The results suggest that the cues researchers use to identify genes by sequence in a stretch of DNA have not all been identified. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of anti-sense protein-coding genes indicates the underappreciated complexity of organization.

The results show that tools currently used to identify the complete set of genes and proteins in organisms, especially bacteria, are insufficient. But work such as this will lead to a more comprehensive understanding of how the genomic blueprint within bacteria translates into functioning proteins that converge into a living organism. Such an understanding could also lead to insights in evolutionary biology.

This work was supported by DOE's Office of Biological and Environmental Research's Genomics Science Program. The research team includes Kim Wook, Mark Silby, Julie Nicoll, and Stuart Levy, Tufts; and Sam Purvine, Kim Hixson, Matt Monroe, Carrie Nicora, and Mary Lipton, PNNL.

Explore further: Blending bacterial genomes for megacloning

More information: Wook K, MW Silby, SO Purvine, JS Nicoll, KK Hixson, ME Monroe, CD Nicora, MS Lipton, and SB Levy. 2009. "Proteomic Detection of Non-Annotated Protein-Coding Genes in Pseudomonas fluorescens Pf0-1," PLoS ONE 4(12):e8455. doi:10.1371/journal.pone.0008455

Related Stories

Study: Junk DNA is critically important

October 19, 2005

A University of California-San Diego scientist says genetic material derisively called "junk" DNA is important to an organism's evolutionary survival.

Comparing Chimp, Human DNA

October 12, 2006

Most of the big differences between human and chimpanzee DNA lie in regions that do not code for genes, according to a new study. Instead, they may contain DNA sequences that control how gene-coding regions are activated ...

New gene prediction method capitalizes on multiple genomes

December 20, 2007

Researchers at Stanford University report in the online open access journal, Genome Biology, a new approach to computationally predicting the locations and structures of protein-coding genes in a genome. Gene finding remains ...

Recommended for you

Genomes uncover life's early history

August 24, 2015

A University of Manchester scientist is part of a team which has carried out one of the biggest ever analyses of genomes on life of all forms.

Rare nautilus sighted for the first time in three decades

August 25, 2015

In early August, biologist Peter Ward returned from the South Pacific with news that he encountered an old friend, one he hadn't seen in over three decades. The University of Washington professor had seen what he considers ...

Why a mutant rice called Big Grain1 yields such big grains

August 24, 2015

(—Rice is one of the most important staple crops grown by humans—very possibly the most important in history. With 4.3 billion inhabitants, Asia is home to 60 percent of the world's population, so it's unsurprising ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.