April 19, 2021

New algorithm uses online learning for massive cell data sets

The fact that the human body is made up of cells is a basic, well-understood concept. Yet amazingly, scientists are still trying to determine the various types of cells that make up our organs and contribute to our health.

A relatively recent technique called single-cell sequencing is enabling researchers to recognize and categorize cell types by characteristics such as which genes they express. But this type of research generates enormous amounts of data, with datasets of hundreds of thousands to millions of cells.

A new algorithm developed by Joshua Welch, Ph.D., of the Department of Computational Medicine and Bioinformatics, Ph.D. candidate Chao Gao and their team uses online learning, greatly speeding up this process and providing a way for researchers world-wide to analyze large data sets using the amount of memory found on a standard laptop computer. The findings are described in the journal Nature Biotechnology.

"Our technique allows anyone with a computer to perform analyses at the scale of an entire organism," says Welch. "That's really what the field is moving towards."

The team demonstrated their proof of principle using data sets from the National Institute of Health's Brain Initiative, a project aimed at understanding the human brain by mapping every cell, with investigative teams throughout the country, including Welch's lab.

Typically, explains Welch, for projects like this one, each single-cell data set that is submitted must be re-analyzed with the previous data sets in the order they arrive. Their new approach allows new datasets to the be added to existing ones, without reprocessing the older datasets. It also enables researchers to break up datasets into so-called mini-batches to reduce the amount of memory needed to process them.

"This is crucial for the sets increasingly generated with millions of cells," Welch says. "This year, there have been five to six papers with two million cells or more and the amount of memory you need just to store the raw data is significantly more than anyone has on their computer."

Welch likens the online technique to the continuous data processing done by social media platforms like Facebook and Twitter, which must process continuously-generated data from users and serve up relevant posts to people's feeds. "Here, instead of people writing tweets, we have labs around the world performing experiments and releasing their data."

The finding has the potential to greatly improve efficiency for other ambitious projects like the Human Body Map and Human Cell Atlas. Says Welch, "Understanding the normal compliment of cells in the body is the first step towards understanding how they go wrong in disease."

More information: Chao Gao et al, Iterative single-cell multi-omic integration using online learning, Nature Biotechnology (2021). DOI: 10.1038/s41587-021-00867-x

Journal information: Nature Biotechnology

Provided by University of Michigan

Citation: New algorithm uses online learning for massive cell data sets (2021, April 19) retrieved 25 June 2024 from https://phys.org/news/2021-04-algorithm-online-massive-cell.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New search engine for single cell atlases

155 shares

Feedback to editors

New algorithm uses online learning for massive cell data sets

Researchers develop high-performance anion exchange membranes for sustainability applications

Half of world's lakes are less resilient to disturbance than they used to be

Modeling software reveals patterns in continuous seismic waveforms during series of stick-slip, magnitude-5 earthquakes

Discovery of vast sex differences in cellular activity has major implications for disease treatment

Researchers discover new flat electronic bands, paving way for advanced quantum materials

Not all calcite crystals perfect; synthesis methods can alter internal structure, affect chemical reactivity

Boosting 'natural killer' cell activity could improve cancer therapy

AI predicts upper secondary education dropout as early as the end of primary school

Study reveals how one enzyme hitches a ride on another to recognize tRNA

1,500-year-old reliquary discovered

Relevant PhysicsForums posts

Color Recognition: What we see vs animals with a larger color range

Innovative ideas and technologies to help folks with disabilities

Is meat broth really nutritious?

COVID Virus Lives Longer with Higher CO2 In the Air

Periodical Cicada Life Cycle

A DNA Animation

New search engine for single cell atlases

A Swiss army knife for genomic data

New tool integrates diverse single-cell datasets, aids definition of cell types

Scalable software system conducts integrative single-cell chromatin accessibility analysis

A new way to visualize mountains of biological data

Researchers take a big step towards a comprehensive single-cell atlas

Discovery of vast sex differences in cellular activity has major implications for disease treatment

Boosting 'natural killer' cell activity could improve cancer therapy

Study reveals how one enzyme hitches a ride on another to recognize tRNA

Bacteria found to produce proteins that act like antifreeze, helping marine worms survive in polar waters

Study identifies gene targets to combat cognitive decline

Global South health care practices contribute to spread of antimicrobial resistance across the world, study suggests

Medical Xpress

Tech Xplore

Science X

New algorithm uses online learning for massive cell data sets

Researchers develop high-performance anion exchange membranes for sustainability applications

Half of world's lakes are less resilient to disturbance than they used to be

Modeling software reveals patterns in continuous seismic waveforms during series of stick-slip, magnitude-5 earthquakes

Discovery of vast sex differences in cellular activity has major implications for disease treatment

Researchers discover new flat electronic bands, paving way for advanced quantum materials

Not all calcite crystals perfect; synthesis methods can alter internal structure, affect chemical reactivity

Boosting 'natural killer' cell activity could improve cancer therapy

AI predicts upper secondary education dropout as early as the end of primary school

Study reveals how one enzyme hitches a ride on another to recognize tRNA

1,500-year-old reliquary discovered

Relevant PhysicsForums posts

Related Stories

New search engine for single cell atlases

A Swiss army knife for genomic data

New tool integrates diverse single-cell datasets, aids definition of cell types

Scalable software system conducts integrative single-cell chromatin accessibility analysis

A new way to visualize mountains of biological data

Researchers take a big step towards a comprehensive single-cell atlas

Recommended for you

Discovery of vast sex differences in cellular activity has major implications for disease treatment

Boosting 'natural killer' cell activity could improve cancer therapy

Study reveals how one enzyme hitches a ride on another to recognize tRNA

Bacteria found to produce proteins that act like antifreeze, helping marine worms survive in polar waters

Study identifies gene targets to combat cognitive decline

Global South health care practices contribute to spread of antimicrobial resistance across the world, study suggests

Newsletter sign up

Donate and enjoy an ad-free experience