Team presents draft assembly of the Norway spruce genome sequence

May 22, 2013
This image shows, from left: Ove Nilsson, Pär Ingvarsson and Stafan Jansson, Umea University. Credit: Johan Gunséus/Synk

Swedish scientists have mapped the gene sequence of Norway spruce (the Christmas tree) – a species with huge economic and ecological importance - and that is the largest genome to have ever been mapped. The genome is complex and seven times larger than that of humans. The results have been published in the journal Nature.

This major research project has been led by Umeå Centre (UPSC) in Umea and the Science for Life Laboratory (SciLifeLab) in Stockholm.

In addition to its scientific interest this new knowledge has immense importance to the in many countries.

"Forest tree breeding is now entering a new era, and Sweden has the potential to be in the forefront of development," says Professor Ove Nilsson from UPSC. "Newer and more effective methods can begin to be used to ensure that the over 200 million tree seedlings planted each year in Sweden are as strong, healthy and well-adapted as possible for both poor and rich soil areas in different parts of the country."

The scientists have identified about 29,000 functional genes, marginally more than humans have, but the question arises: why is the spruce still seven times larger than ours? According to the study an explanation is "genome obesity" caused by extensive , which have accumulated for several hundred million years of . Other plant and animal species have efficient mechanisms to eliminate such repetitive DNA, but these do not seem to operate so well in conifers.

"It is remarkable that the spruce is doing so well despite this unnecessary genetic load," says Professor Pär Ingvarsson at UPSC. "Of course, some of this DNA has a function but it seems strange that it would be beneficial to have so very much. This appears to be something special for conifers."

The greatest challenge in the project has been to get the approximately 20 billion "letters" found in spruce's into the correct order, rather than obtaining the actual DNA sequences.

"Imagine a library with ten thousand books as thick as the bible, written in a language with only four letters," explains Professor Stefan Jansson at UPSC. "If someone took one hundred identical copies of each of the ten thousand titles, passed them all through a document shredder and mixed all the shreds, and you then were asked to piece together an accurate copy of each title, you can realize that it can be a bit problematic."

"We had to customise computers and rewrite many of the computer programmes used in similar studies in order to handle the large amount of DNA sequences," says Professor Joakim Lundeberg from SciLifeLab. The national data storage system was stretched to the limit, and there were many other practical problems that had to be solved along the way to pull through the project.

"But the timing was optimal; when the new DNA sequencing machines were unpacked at our newly established laboratory, DNA arrived from our model spruce tree. By sequencing and analysing the largest genome in the world so far, we have shown that SciLifeLab has both technical and scientific capacity for research at the highest international level," concludes Joakim Lundeberg.

Explore further: The origin of the language of life

More information: Nystedt, B. et al. The Norway spruce genome sequence and conifer genome evolution. Nature. dx.doi.org/10.1038/nature12211

Related Stories

Mixed forest provides beneficial effects

Jan 30, 2013

Forestry and nature conservation can benefit from promoting more different varieties of trees, according to a new study in which researchers from the University of Gothenburg, Sweden, were involved.

Learning the alphabet of gene control

Jan 17, 2013

Scientists at Karolinska Institutet in Sweden have made a large step towards the understanding of how human genes are regulated. In a new study, published in the journal Cell, they identified the DNA sequences that bind t ...

Sequencing hundreds of chloroplast genomes now possible

Jan 31, 2013

Researchers at the University of Florida and Oberlin College have developed a sequencing method that will allow potentially hundreds of plant chloroplast genomes to be sequenced at once, facilitating studies of molecular ...

Recommended for you

The origin of the language of life

Dec 19, 2014

The genetic code is the universal language of life. It describes how information is encoded in the genetic material and is the same for all organisms from simple bacteria to animals to humans. However, the ...

Quest to unravel mysteries of our gene network

Dec 18, 2014

There are roughly 27,000 genes in the human body, all but a relative few of them connected through an intricate and complex network that plays a dominant role in shaping our physiological structure and functions.

EU court clears stem cell patenting

Dec 18, 2014

A human egg used to produce stem cells but unable to develop into a viable embryo can be patented, the European Court of Justice ruled on Thursday.

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.