This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

trusted source

proofread

Improving accuracy of molecular quantification in high throughput sequencing

New research improves accuracy of molecular quantification in high throughput sequencing
A schematic showing homotrimer UMI majority vote error correction. We constructed UMIs with homotrimeric nucleotide blocks (composed of combinations of AAA, CCC, GGG, TTT). By evaluating trimer nucleotide similarity, errors of deletion, insertion or substitution are identified and corrected through a "majority vote" system, selecting the most frequent nucleotide. Credit: Nature Methods (2024). DOI: 10.1038/s41592-024-02168-y

A team at NDORMS has developed a new approach to significantly improve the accuracy of RNA sequencing. They have pinpointed the primary source of inaccurate quantification in both short and long-read RNA sequencing, and have introduced the concept of "majority vote" error correction leading to a substantial improvement in RNA molecular counting.

Accurate sequencing of genetic material is crucial in modern biology, particularly for comprehending and addressing diseases linked to genetic anomalies. However, current methodologies encounter substantial constraints.

In a landmark study, an international consortium of researchers, led by Adam Cribbs, Associate Prof. in Computational Biology, and Jianfeng Sun, Postdoctoral Research Associate at the Botnar Institute, University of Oxford, have developed an innovative method to correct errors in PCR amplification—a widely used technique used in high-throughput sequencing.

By pinpointing PCR artifacts as the primary source of inaccurate quantification, the researchers, address a long-standing challenge in generating accurate absolute counts of RNA molecules, which is crucial for various applications in genomics research. The study is published in the journal Nature Methods.

The researchers focused on Unique Molecular Identifiers (UMIs), which are random oligonucleotide sequences used to remove biases introduced during PCR amplification. While UMIs have been widely adopted in sequencing methods, the study reveals that PCR errors can undermine the accuracy of molecular quantification, particularly across different sequencing platforms.

Jianfeng explained, "PCR amplification, essential for most RNA sequencing techniques, can introduce errors, compromising data integrity. We tackled this by synthesizing UMI barcodes using homotrimer nucleotide blocks, enhancing error correction and enabling near-absolute RNA molecule quantification, markedly improving molecular counting accuracy."

Homotrimers are nucleotide sequences consisting of three identical bases, for example AAA, CCC, GGG. By evaluating homotrimers nucleotide similarity, errors are detected and corrected through a "majority vote" method (see figure above).

The study demonstrates that homotrimer UMIs significantly outperform traditional monomer UMIs in reducing false positive fold enrichment during the analysis of differentially expressed genes and transcripts (DEGs and DETs). This enhancement is vital for the accurate identification and quantification of DEGs or DETs, particularly in bulk sequencing approaches.

Additionally, in single-cell sequencing, where extensive PCR amplification is often required, homotrimer UMIs have proven effective in mitigating the effects of PCR artifacts, thereby substantially improving the reliability of sequencing data.

"By constructing UMIs from homogenous blocks of nucleosides, we aimed to improve error correction in both short- and long-read sequencing, showcasing our commitment to enhancing sequencing technology applications," says Associate Professor Adam Cribbs, senior author of the paper and Group Leader in .

This research has profound implications. By rectifying PCR errors in UMIs, it greatly boosts molecular quantification accuracy in various sequencing applications. It's a vital tool for researchers in bulk RNA, single-cell RNA, and DNA sequencing, enabling accurate gene expression and molecular profile analyses.

Enhanced UMI not only reduces the incidence of false positives but also offers multiple diagnostic applications, especially in scenarios necessitating longitudinal analysis of samples.

More information: Jianfeng Sun et al, Correcting PCR amplification errors in unique molecular identifiers to generate accurate numbers of sequencing molecules, Nature Methods (2024). DOI: 10.1038/s41592-024-02168-y

Homotrimer barcodes enable accurate counting of RNA molecules during high-throughput RNA sequencing, Nature Methods (2024). DOI: 10.1038/s41592-024-02169-x , www.nature.com/articles/s41592-024-02169-x

Citation: Improving accuracy of molecular quantification in high throughput sequencing (2024, February 5) retrieved 1 May 2024 from https://phys.org/news/2024-02-accuracy-molecular-quantification-high-throughput.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New gold-standard quality control tool for RNA sequencing in single cells

13 shares

Feedback to editors