Technologies for analyzing gene expression at the genomic scale

Apr 18, 2014
Figure 1: The ZENBU system makes it easier to browse very large libraries of genomic data. Credit: Alistair Forrest, RIKEN Center for Life Science Technologies

The emergence of technologies for analyzing gene expression at the genomic scale has required parallel efforts to develop software that make sense of the data. Such 'browser' tools provide scientists with a visual atlas of the thousands of genes that are switched on and off in a given experiment. However, these tools become increasingly unwieldy as the studies grow larger. A team of researchers led by Alistair Forrest and Jessica Severin from the RIKEN Center for Life Science Technologies have now developed software that can efficiently handle far greater volumes of data.

The FANTOM5 project has the ambitious goal of mapping the circuits that regulate gene activity throughout the based on comparative analysis of numerous datasets from hundreds of cell types. "With existing genome browsers, you basically needed to create a separate 'track', or horizontal visual representation, for each experiment," explains Forrest. "We were looking at thousands of experiments and needed some way of visualizing the data without ending up with a web page that stretched for tens of meters!"

Forrest, Severin and other FANTOM5 colleagues worked to produce ZENBU, a system that pools large numbers of individual datasets into a single track. As incorporating each dataset individually would slow analysis to a crawl, the researchers employed computational methods that simultaneously access information from numerous experiments.

"Only a few elements of data from this pooled track are in memory at any moment in time, which allows us to process and analyze the data in an interactive manner in near-real-time," says Severin. These datasets are all 'linked' to the single genome track in a way that enables users to easily zoom in on and analyze activity at specific genetic loci of interest across every single experiment.

FANTOM5 scientists use techniques ranging from RNA transcript sequencing to determination of the binding patterns of transcription factors to specific chromosomal sequences. ZENBU is preconfigured to immediately interpret these various experimental formats without any additional tinkering. As such, although specifically designed for FANTOM5, ZENBU stands to benefit the broader genome research community by bringing unprecedented speed and ease-of-use to the interpretation process.

Forrest and Severin envision a general ecosystem for collective data sharing and analysis. "Every site will have access to all the data in the ZENBU federation," says Severin. "Each location can focus on the curation of their data, but every other ZENBU site can also access it remotely as if it was loaded locally."

Explore further: New bioinformatics tool to visualize transcriptomes

More information: Severin, J., Lizio, M., Harshbarger, J., Kawaji, H., Daub, C. O., Hayashizaki, Y., The FANTOM Consortium, Bertin, N. & Forrest, A. R. R. "Interactive visualization and analysis of large-scale sequencing datasets using ZENBU." Nature Biotechnology 32, 217–219 (2014). DOI: 10.1038/nbt.2840

add to favorites email to friend print save as pdf

Related Stories

New bioinformatics tool to visualize transcriptomes

Mar 09, 2014

ZENBU, a new, freely available bioinformatics tool developed at the RIKEN Center for Life Science Technology in Japan, enables researchers to quickly and easily integrate, visualize and compare large amounts of genomic information ...

Researchers present comprehensive 'roadmap' of blood cells

Mar 26, 2014

Research published online today in Blood, the Journal of the American Society of Hematology, presents an unprecedented look at five unique blood cells in the human body, pinpointing the location of key genetic regulators in the ...

NIH launches first phase of microbiome cloud project

Sep 26, 2013

The National Institutes of Health (NIH) has launched the first phase of the Microbiome Cloud Project (MCP), a collaboration with Amazon Web Services that aims to improve access to and analysis of data from the Human Microbiome P ...

Recommended for you

Iberian pig genome remains unchanged after five centuries

13 hours ago

A team of Spanish researchers have obtained the first partial genome sequence of an ancient pig. Extracted from a sixteenth century pig found at the site of the Montsoriu Castle in Girona, the data obtained indicates that ...

New concepts based on advances in animal systematics

16 hours ago

The way in which most multicellular organisms have been classified has been the same for more than a century. Only recently have scientists developed the tools and knowledge to question the way we classify ...

New dawn for pasta wheat in Australia

20 hours ago

The University of Adelaide's durum breeding program today at the Hart Field Day will release a new durum wheat variety called DBA-Aurora which promises a step-change in potential durum production in southern Australia.

User comments : 0