New algorithm enables data integration at single-cell resolution

April 2, 2018, New York University
Credit: CC0 Public Domain

A team of computational biologists has developed an algorithm that can 'align' multiple sequencing datasets with single-cell resolution. The new method, published today in the journal Nature Biotechnology, has implications for better understanding how different groups of cells change during disease progression, in response to drug treatment, or across evolution.

"This approach for data integration will enable the comparison of single-cell datasets and the ability to dissect the differences between them," explains Rahul Satija, the study's senior author, who is an assistant professor in NYU's Center for Genomics and Systems Biology and a core faculty member at the New York Genome Center. "Moreover, these methods will be valuable for the integration of diverse datasets produced across individuals and laboratories—and even for researchers studying the same tissue across different species."

The field of single-cell sequencing is rapidly expanding, with the potential to precisely study how the basic building blocks of life function and evolve. However, significant computational challenges remain, particularly when analyzing multiple datasets. For example, when the team independently analyzed datasets of the same bone-marrow stem cells, produced by two separate labs, they obtained strikingly different results.

"We needed a new that could identify and align shared groups of cells present in multiple experiments so that we could integrate the datasets together," says Andrew Butler, a graduate student at NYU and lead author of the study.

To accomplish this, the researchers modified analytical techniques specialized at finding shared patterns across images—for example, to align facial visualizations across different lighting conditions for single-cell sequencing data. When they repeated their bone-marrow analysis, the same cell populations consistently appeared.

"We realized that we could use these methods to learn how cells modify their behavior—for example, in response to ," notes Butler.

By analyzing a of stimulated with interferon—a signaling protein created in response to pathogens or tumor —the team could precisely identify which genes were switched on in each of 13 responding cell types. Furthermore, they integrated single-cell datasets of pancreatic tissue from humans and mice, thereby identifying 10 cell types that were shared across species and defining the evolutionary changes occurring in each group.

Looking forward, the researchers are applying their approach to study cellular drug responses in clinical samples, but also aim to make their methods widely accessible.

"All of our software is open-source and freely available online," adds Satija. "We hope these methods will help others in the community discover exciting new biological phenomena."

Explore further: New tool allows analysis of single-cell RNA data in pre-malignant tumours

More information: Integrating single-cell transcriptomic data across different conditions, technologies, and species, Nature Biotechnology (2018). nature.com/articles/doi:10.1038/nbt.4096

Related Stories

Democratizing single-cell analysis

March 15, 2018

Scientists at the Allen Institute and the University of Washington have developed a new low-cost technique for profiling gene expression in hundreds of thousands of cells. Split Pool Ligation-based Transcriptome sequencing ...

Software package processes huge amounts of single-cell data

February 13, 2018

Scientists from the Helmholtz Zentrum München have developed a program that for managing enormous datasets. The software, called Scanpy, is a candidate for analyzing the Human Cell Atlas, and has recently been published ...

Researchers measure gene activity in single cells

March 16, 2018

For biologists, a single cell is a world of its own: It can form a harmonious part of a tissue, or go rogue and take on a diseased state, like cancer. But biologists have long struggled to identify and track the many different ...

Recommended for you

Fungus senses gravity using gene borrowed from bacteria

April 24, 2018

The pin mold fungus Phycomyces blakesleeanus forms a dense forest of vertically growing fruiting bodies, but how does it know which way is "up"? New research publishing 24 April in the open access journal PLOS Biology, from ...

Team discovers a new take on early evolution of photosynthesis

April 24, 2018

A team of scientists from Arizona State University's School of Molecular Sciences has begun re-thinking the evolutionary history of photochemical reaction centers (RCs). Their analysis was recently published online in Photosynthesis ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.