Integrated bioinformatics gateways portal and interface

Sep 22, 2011
Figure 1: Accessing and analyzing raw biological data from millions of individual data records is now easier using RIKEN’s bioinformatics portal BioLOD.org and interface, Semantic-JSON. Credit: © 2011 iStockphoto/Petrovich9

An easy-to-use bioinformatics interface has been developed by a research group led by Tetsuro Toyoda called the RIKEN Bioinformatics And Systems Engineering division (BASE), Yokohama. The web-service-based tool, called Semantic-JSON, and the portal, BioLOD, integrate access to information contained within genomics, proteomics, and other ‘omics’-based data repositories. 

“Advances in life sciences increasingly depend upon cross-analysis and integration of diverse information from multiple large databases maintained on remote computer servers,” explains Toyoda. “The challenge is to facilitate data retrieval, integration and collaboration while maintaining database security.”

As a first step, various research organizations worldwide, including RIKEN, recently published 192 public and 190 private mammalian, plant and protein databases. The data are integrated by SciNetS.org, the Scientists’ Networking System. These databases contain more than 8.2 million individual data records. 

Pioneering a new global trend, BASE provides ‘structured linked open data’ and private data via the newly developed BioLOD.org portal connecting with the World Wide Web Consortium (W3C) Linking Open Data initiative. These self-described data are interlinked using standard web technologies allowing automatic reading by computers, thereby making them more useful to researchers. The system facilitates information sharing and collaboration between researchers, but brings new challenges.

“The sheer amount of data contained in our biological data cloud outstripped the capacity of existing interfaces to cope with the complexity of researcher queries, motivating us to develop Semantic-JSON,” explains Toyoda.

Semantic-JSON has two major components. The secured, unified data repository integrates data meaningfully—or ‘semantically’ in computer parlance— from numerous sources. The web-based allows researchers to retrieve linked data seamlessly and securely using established bioinformatics programming languages and processing. Bioinformatics researchers can then use their specialized computational tools to analyze raw biological data (Fig. 1).

Databases already available through Semantic-JSON and BioLOD.org include the RIKEN Integrated Database of Mammals with 79 human and mouse omics databases, the RIKEN Integrated Database of Plants incorporating 30 similar databases for the plant species Arabidopsis thaliana, and the RIKEN Integrated Protein Database containing 18 databases.

Since December 2009, international researchers have successfully used the system to identify 28 million data relationships, generating some 4.5 terabytes of associated files. Around 134,000 programs from non-RIKEN researchers have accessed the server as of March 2011. Biological applications include genome design, DNA sequence processing, and the inference of phenotype biological characteristics from genomic information.

“Our next goal is to develop and improve the system to increase its functionality and the usefulness of its linked open data to the worldwide biological community,” says Toyoda.

Explore further: Researchers develop fast, economical method for high-definition video compositing

More information: Kobayashi, N., et al. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases. Nucleic Acids Research published online 1 June, 2011 (doi: 10.1093/nar/gkr353).

add to favorites email to friend print save as pdf

Related Stories

Controlling a sea of information

Mar 17, 2008

Curators at one of the world’s most widely used biological databases, The Arabidopsis Information Resource, or TAIR, have joined forces with the journal Plant Physiology, to solve the “flood of information” dilemma. It is ...

Developing web technologies to share secure information

Mar 02, 2010

Dr. Lalana Kagal and fellow researchers at the Massachusetts Institute of Technology are developing a standard policy language to achieve flexible and dynamic Web security when information is shared between agencies, countries ...

Recommended for you

Review: Google music plan solid, serendipitous

1 hour ago

Google's new music service offers a lot of eye candy to go with the tunes. The song selection of around 18 million tracks is comparable to popular services such as Spotify and Rhapsody, and a myriad of playlists ...

User comments : 0

More news stories

Review: Google music plan solid, serendipitous

Google's new music service offers a lot of eye candy to go with the tunes. The song selection of around 18 million tracks is comparable to popular services such as Spotify and Rhapsody, and a myriad of playlists ...

Solar plane sets distance record on US tour

The first manned aircraft that can fly day and night powered only by solar energy set a new distance record Thursday when it landed after the second leg of a cross-country US tour.

A quantum simulator for magnetic materials

Physicists understand perfectly well why a fridge magnet sticks to certain metallic surfaces. But there are more exotic forms of magnetism whose properties remain unclear, despite decades of intense research. ...