January 3, 2024

Researchers release initial dataset for protist genomes project

Protists, single-celled eukaryotic organisms encompassing unicellular algae and protozoans, inhabit aquatic environments. Functioning as primary producers and oxygen generators, they play crucial roles in the carbon cycle and serve as vital sources of human nutrition, bioenergy, and food for aquatic animals. However, they can also pose challenges, causing harmful algal blooms and red tides, acting as both pathogens and beneficial partners in symbiotic relationships.

The NCBI taxonomy system has documented over 60,000 identified protist species. In December 2019, a group of scientists led by the Institute of Hydrobiology (IHB) of the Chinese Academy of Sciences (CAS) launched the Protist 10,000 Genomes Project (P10K). The primary aim of this project is to create a comprehensive genetic resource database for protists.

Recently, Prof. Miao Wei's team at the IHB and Prof. Zhang Zhang's team from the Beijing Institute of Genomics of CAS (China National Center for Bioinformation) released the initial dataset from the P10K project which is now available, and the related paper was published in Nucleic Acids Research.

The inaugural data released from the P10K comprises a comprehensive set of 2,959 protist datasets, featuring 1,601 genomes and 1,358 transcriptomes. Among these datasets, 1,858 were integrated from public databases. The P10K team undertook new sequencing for 1,101 datasets with a primary focus on ciliates. The newly sequenced data contributed to a substantial 37% expansion in the overall size of the protist dataset.

To overcome the analytical challenges posed by large-scale single-cell omics data, the P10K team developed a standardized analysis pipeline tailored for single-cell sequencing data of protists. This pipeline encompasses the assembly, decontamination, species identification, gene annotation, and evaluation processes.

Quality assessments revealed that genomes annotated through this pipeline exhibit a similar proportion of medium and high-quality data compared to those available in public databases.

The researchers believe the P10K database will promote research on eukaryotic origins, diversity, and microbial interactions, and the applications of protist genetic resources in ecological conservation, pollutant degradation, nutrition, health, and disease prevention. In addition, the database will support the identification of planktonic organisms based on environmental DNA (eDNA), facilitating aquatic ecological health assessments.

More information: Xinxin Gao et al, The P10K database: a data portal for the protist 10 000 genomes project, Nucleic Acids Research (2023). DOI: 10.1093/nar/gkad992

Journal information: Nucleic Acids Research

Provided by Chinese Academy of Sciences

Citation: Researchers release initial dataset for protist genomes project (2024, January 3) retrieved 30 June 2024 from https://phys.org/news/2024-01-dataset-protist-genomes.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

The end of genes: Routine test reveals unique divergence in genetic code

80 shares

Feedback to editors

Researchers release initial dataset for protist genomes project

The Milky Way's eROSITA bubbles are large and distant

Saturday Citations: Armadillos are everywhere; Neanderthals still surprising anthropologists; kids are egalitarian

NASA astronauts will stay at the space station longer for more troubleshooting of Boeing capsule

The beginnings of fashion: Paleolithic eyed needles and the evolution of dress

Analysis of NASA InSight data suggests Mars hit by meteoroids more often than thought

New computational microscopy technique provides more direct route to crisp images

A harmless asteroid will whiz past Earth Saturday. Here's how to spot it

Tiny bright objects discovered at dawn of universe baffle scientists

New method for generating monochromatic light in storage rings

Soft, stretchy electrode simulates touch sensations using electrical signals

Relevant PhysicsForums posts

Who chooses official designations for individual dolphins, such as FB15, F153, F286?

Color Recognition: What we see vs animals with a larger color range

Innovative ideas and technologies to help folks with disabilities

Is meat broth really nutritious?

COVID Virus Lives Longer with Higher CO2 In the Air

Periodical Cicada Life Cycle

The end of genes: Routine test reveals unique divergence in genetic code

Researchers find large diversity of protists in the Parabasalia phylum in both mice and humans

Underneath it all: Dishing the dirt on protists

Scientists develop open archive of plant images and related phenotypic traits

The unraveling of a protist genome could unlock the mystery of marine viruses

Exploring the existence of life at 50°C

Researcher discovers 1 in 5 bacteria can break down plastic

Supercomputing in the age of AI to accelerate protein structure prediction

Under pressure: How comb jellies have adapted to life at the bottom of the ocean

The worm has turned: DIY lab platform evaluates new molecules in minutes

Research team develops surfaces designed to discourage spread of resistant bacteria

Researchers develop deep-learning model that outperforms Google AI system to predict peptide structures

Medical Xpress

Tech Xplore

Science X

Researchers release initial dataset for protist genomes project

The Milky Way's eROSITA bubbles are large and distant

Saturday Citations: Armadillos are everywhere; Neanderthals still surprising anthropologists; kids are egalitarian

NASA astronauts will stay at the space station longer for more troubleshooting of Boeing capsule

The beginnings of fashion: Paleolithic eyed needles and the evolution of dress

Analysis of NASA InSight data suggests Mars hit by meteoroids more often than thought

New computational microscopy technique provides more direct route to crisp images

A harmless asteroid will whiz past Earth Saturday. Here's how to spot it

Tiny bright objects discovered at dawn of universe baffle scientists

New method for generating monochromatic light in storage rings

Soft, stretchy electrode simulates touch sensations using electrical signals

Relevant PhysicsForums posts

Related Stories

The end of genes: Routine test reveals unique divergence in genetic code

Researchers find large diversity of protists in the Parabasalia phylum in both mice and humans

Underneath it all: Dishing the dirt on protists

Scientists develop open archive of plant images and related phenotypic traits

The unraveling of a protist genome could unlock the mystery of marine viruses

Exploring the existence of life at 50°C

Recommended for you

Researcher discovers 1 in 5 bacteria can break down plastic

Supercomputing in the age of AI to accelerate protein structure prediction

Under pressure: How comb jellies have adapted to life at the bottom of the ocean

The worm has turned: DIY lab platform evaluates new molecules in minutes

Research team develops surfaces designed to discourage spread of resistant bacteria

Researchers develop deep-learning model that outperforms Google AI system to predict peptide structures

Newsletter sign up

Donate and enjoy an ad-free experience