New algorithm enables data integration at single-cell resolution

April 2, 2018, New York University
Credit: CC0 Public Domain

A team of computational biologists has developed an algorithm that can 'align' multiple sequencing datasets with single-cell resolution. The new method, published today in the journal Nature Biotechnology, has implications for better understanding how different groups of cells change during disease progression, in response to drug treatment, or across evolution.

"This approach for data integration will enable the comparison of single-cell datasets and the ability to dissect the differences between them," explains Rahul Satija, the study's senior author, who is an assistant professor in NYU's Center for Genomics and Systems Biology and a core faculty member at the New York Genome Center. "Moreover, these methods will be valuable for the integration of diverse datasets produced across individuals and laboratories—and even for researchers studying the same tissue across different species."

The field of single-cell sequencing is rapidly expanding, with the potential to precisely study how the basic building blocks of life function and evolve. However, significant computational challenges remain, particularly when analyzing multiple datasets. For example, when the team independently analyzed datasets of the same bone-marrow stem cells, produced by two separate labs, they obtained strikingly different results.

"We needed a new that could identify and align shared groups of cells present in multiple experiments so that we could integrate the datasets together," says Andrew Butler, a graduate student at NYU and lead author of the study.

To accomplish this, the researchers modified analytical techniques specialized at finding shared patterns across images—for example, to align facial visualizations across different lighting conditions for single-cell sequencing data. When they repeated their bone-marrow analysis, the same cell populations consistently appeared.

"We realized that we could use these methods to learn how cells modify their behavior—for example, in response to ," notes Butler.

By analyzing a of stimulated with interferon—a signaling protein created in response to pathogens or tumor —the team could precisely identify which genes were switched on in each of 13 responding cell types. Furthermore, they integrated single-cell datasets of pancreatic tissue from humans and mice, thereby identifying 10 cell types that were shared across species and defining the evolutionary changes occurring in each group.

Looking forward, the researchers are applying their approach to study cellular drug responses in clinical samples, but also aim to make their methods widely accessible.

"All of our software is open-source and freely available online," adds Satija. "We hope these methods will help others in the community discover exciting new biological phenomena."

Explore further: New tool allows analysis of single-cell RNA data in pre-malignant tumours

More information: Integrating single-cell transcriptomic data across different conditions, technologies, and species, Nature Biotechnology (2018). nature.com/articles/doi:10.1038/nbt.4096

Related Stories

Democratizing single-cell analysis

March 15, 2018

Scientists at the Allen Institute and the University of Washington have developed a new low-cost technique for profiling gene expression in hundreds of thousands of cells. Split Pool Ligation-based Transcriptome sequencing ...

Software package processes huge amounts of single-cell data

February 13, 2018

Scientists from the Helmholtz Zentrum München have developed a program that for managing enormous datasets. The software, called Scanpy, is a candidate for analyzing the Human Cell Atlas, and has recently been published ...

Researchers measure gene activity in single cells

March 16, 2018

For biologists, a single cell is a world of its own: It can form a harmonious part of a tissue, or go rogue and take on a diseased state, like cancer. But biologists have long struggled to identify and track the many different ...

Recommended for you

Staying a step ahead of the game

October 18, 2018

Trypanosoma brucei, which causes sleeping sickness, evades the immune system by repeatedly altering the structure of its surface coat. Sequencing of its genome and studies of its 3-D genome architecture have now revealed ...

Elucidating cuttlefish camouflage

October 18, 2018

The unique ability of cuttlefish, squid and octopuses to hide by imitating the colors and texture of their environment has fascinated natural scientists since the time of Aristotle. Uniquely among all animals, these mollusks ...

Biodiversity can also destabilize ecosystems

October 18, 2018

Ecosystems have a variety of benefits to humans, including food, water and other resources, as well as recreational space. It is therefore important that these systems remain functional and stable—especially in view of ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.