New computational method unravels single-cell data from multiple people

Credit: CC0 Public Domain

A new computational method for assigning the donor in single cell RNA sequencing experiments provides an accurate way to unravel data from a mixture of people. The Souporcell method, created by Wellcome Sanger Institute researchers and their collaborators could help study how genetic variants in different people affect which genes are expressed during infection or response to drugs.

Published this week in Nature Methods, the software could increase efficiency of single-cell experiments, assisting research into transplants, personalized medicine and malaria.

Single-cell RNA sequencing (RNAseq) can reveal exactly which genes are switched on in each individual cell, revealing and what they do. Pooling multiple people's cells into a single cell RNAseq experiment helps to identify how different genomes affect this gene expression. However it is essential to be able to separate the resulting data by individual, which can be very difficult.

The authors tested Souporcell against three other using placental cells, pluripotent stem cell lines and malaria parasites.

Haynes Heaton, the first author from the Wellcome Sanger Institute, said: "Our method, called Souporcell, is able to separate mixtures of individuals' cells in scRNAseq experiments without knowing each individual's full genome sequence beforehand, unlike previous methods. One of the key features of the method is that it estimates the amount of background RNA from dead cells, which is often referred to as the soup. This then allows the removal of that source of noise, and hence the name Souporcell."

Being able to combine the cells into a single experiment increases the accuracy, enabling more information to be found, and also reduces the cost of these experiments.

Dr. Martin Hemberg, a senior author from the Wellcome Sanger Institute, said: "The exact genetic sequence of each person can affect their response to infections, or to drug treatments. The new method enables single cell expression data from multiple people to be analyzed, to show links between genotype and phenotype, in diseases and in the presence of drugs. This will have implications for personalized medicine."

In addition, some samples inherently have a mix of cells with different genomes, including samples from transplant patients who have their original cells and cells from the donor, or populations of parasites, such as malaria, from an infected individual.

Dr. Mara Lawniczak, a senior author from the Wellcome Sanger Institute, said: "This method is helping us understand malaria. People get infected with multiple strains of malaria at once, but we don't know how these strains are competing with each other to reproduce. To even ask the question we have to be able to split out of different strains, and Souporcell is enabling this."

Explore further

Map of malaria behavior set to revolutionize research

More information: Haynes Heaton et al. Souporcell: robust clustering of single-cell RNA-seq data by genotype without reference genotypes, Nature Methods (2020). DOI: 10.1038/s41592-020-0820-1
Journal information: Nature Methods

Provided by Wellcome Sanger Institute
Citation: New computational method unravels single-cell data from multiple people (2020, May 6) retrieved 20 June 2021 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Feedback to editors

User comments