Crowdsourcing platform makes public gene expression data more accessible

Scientists from the National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health, have developed a free online platform that uses a crowdsourcing approach to make public gene expression data more accessible to biomedical researchers without computational expertise. They describe the platform, called OMics Compendia Commons (OMiCC), in the June 20 online issue of Nature Biotechnology.

Public databases contain millions of profiles—data that describe the degree to which genes are turned on or off under certain conditions. Potentially, scientists could reuse these data to generate and address new research questions. For example, researchers could re-purpose a dataset comparing blood samples from drug-treated and untreated people to investigate the effects of gender on treatment. However, this wealth of information remains largely untapped for such data reuse, partially because many biologists lack the computer programming expertise needed for data retrieval, processing and analysis. In addition, public database entries typically contain raw study data, which need to be structured for analysis.

OMiCC aims to use crowdsourcing techniques to harness the expertise of the research community to overcome these challenges. Within the platform, users can create groups of and "annotate" them by assigning parameters, such as sample type and disease, using a standardized vocabulary. OMiCC saves these user-created groups and associated annotations, making them available to others for reuse.

Within OMiCC, users can pool these groups of data and analyze information from multiple studies to search for biological relationships, a statistical approach known as meta-analysis. The NIAID scientists anticipate that as the OMiCC user community grows, the platform will develop into a rich resource that can transform the increasing amounts of public data into novel biological insights.

OMiCC is available at The website provides videos and a step-by-step tutorial to help users navigate the .

More information: Nature Biotechnology, DOI: 10.1038/nbt.3603

Journal information: Nature Biotechnology

Citation: Crowdsourcing platform makes public gene expression data more accessible (2016, June 20) retrieved 5 December 2022 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Furthering data analysis of next-generation sequencing to facilitate research


Feedback to editors