Genome assembly in the spotlight

Jul 23, 2013

The largest systematic assessment the process of genome assembly is published today in BGI and BioMed Central's open access journal GigaScience. The second Assemblaton competition saw 21 teams submit 43 entries based on data from three different unassembled bird, fish, and snake genomes sequenced using three different technologies. BGI participated in the competition with their SOAPdenovo team, and also provided sequencing data for the bird genome. Ten key metrics are outlined, based on over 100 different measures for each assembly, and they focus on different aspects of an assembly's quality.

The research came to publication via an unusual process. Assemblathon2 is on a preprint server and the named reviewers have blogged and commented on their reviews of the paper. Since the data was in the public domain and the authors enjoyed the discussion, GigaScience's editors encouraged open discussion of the peer review of this article.

With a new species announced almost daily, genomics is getting faster and cheaper all the time. Piecing together genomes from raw sequencing data to produce high quality finished genome sequences without the aid of a previously assembled reference is still technically challenging and requires a huge amount of and resources. It is performed by more and more labs around the world. With new sequencing tools every month, and nearly limitless ways of carrying this complex process out, it is not clear as to which is the best method of piecing a genome together. The Assemblathon is a set of periodic aiming to address this issue to help improve how genomics is carried out.

The logistics of carrying out such a large competition were challenging, with large volumes of test and entry data hosted by supercomputing centers and mirrored in the cloud, and automated scripts calculated and presented the many results. Reviewing the paper was equally challenging and novel; everyone embraced GigaScience's open and transparent review process, with authors and reviewers tweeting and posting comments online and in blogs during the review process. The results of this real-time, open peer-review are available to view on the Assemblathon website, with the signed reviewer reports and history also archived and viewable alongside the article. To boost reproducibility the supporting data and 27 GB of entries are hosted in the GigaScience GigaDB database and in the NCBI SRA database.

Explore further: Being sheepish about climate adaptation

More information: GigaScience 2013 2:10 Doi: 10.1186/2047-217X-2-10

add to favorites email to friend print save as pdf

Related Stories

New cost-effective genome assembly process developed

May 05, 2013

The U.S. Department of Energy Joint Genome Institute (DOE JGI) is among the world leaders in sequencing the genomes of microbes, focusing on their potential applications in the fields of bioenergy and environment. ...

100K Pathogen Genome Project maps first genomes

May 22, 2013

(Phys.org) —Striking a blow at foodborne diseases, the 100K Pathogen Genome Project at the University of California, Davis, today announced that it has sequenced the genomes of its first 10 infectious microorganisms, including ...

Online game aims to improve scientific peer review accuracy

Nov 09, 2011

Peer review of scientific research is an essential component of research publication, the awarding of grants, and academic promotion. Reviewers are often anonymous. However, a new study by researchers at the Johns Hopkins ...

Recommended for you

Being sheepish about climate adaptation

8 hours ago

For thousands of years, man has domesticated animals, selecting the best traits possible for survival. Now, livestock such as sheep offer an intriguing animal to examine adaptation to climate change, with a genetic legacy ...

Turning winery waste into biofuels

19 hours ago

Researchers at Swinburne University of Technology have developed a technique for converting winery waste into compounds that could have potential value as biofuels or medicines.

User comments : 0