Barcoding long DNA quantifies CRISPR effects

Barcoding long DNA quantifies CRISPR effects
The sequencing setup for the study: an Oxford Nanopore sequencer and a laptop computer. The screen in background shows the DNA strand fed through the sequencer. Credit: 2020 KAUST Mo Li. 

Current sequencing techniques lack the sensitivity to detect rare gene mutations in a pool of cells, which is particularly important, for example, in early cancer detection. Now, scientists at KAUST have developed an approach, called targeted individual DNA molecule sequencing (IDMseq), that can accurately detect a single mutation in a pool of 10,000 cells.

Importantly, the team successfully used IDMseq to determine the number and frequency of caused by the gene editing tool, CRISPR/Cas9, in human embryonic stem cells. Clinical trials are underway to test CRISPR's safety to treat some genetic diseases. "Our study revealed potential risks associated with CRISPR/Cas9 editing and provides tools to better study genome editing outcomes," says KAUST bioscientist Mo Li, who led the study.

IDMseq is a sequencing technique that involves attaching a unique barcode to every DNA molecule in a sample of cells and then making a large number of copies of each molecule using a polymerase chain reaction (PCR). Copied molecules carry the same barcode as the original ones.

A bioinformatics tool kit, called variant analysis with unique molecular identifier for long-read technology (VAULT), then decodes the barcodes and places similar molecules into their own "bins", with every bin representing one of the original DNA molecules. VAULT uses a combination of algorithms to detect mutations in the bins. The process works especially well with third-generation long-read sequencing technologies and helps scientists detect and determine the frequency of all types of mutations, from changes in single DNA letters to large deletions and insertions in the original DNA .

The approach successfully detected a deliberately caused gene mutation that was mixed with a group of wild-type cells at ratios of 1:100, 1:1,000 and 1:10,000. It also correctly reported its frequency.

The researchers also used IDMseq to look for mutations caused by CRISPR/Cas9 genome editing. "Several recent studies have reported that Cas9 introduces unexpected, large DNA deletions around the edited genes, leading to safety concerns. These deletions are difficult to detect and quantitate using current DNA sequencing strategies. But our approach, in combination with various sequencing platforms, can analyze these large DNA mutations with high accuracy and sensitivity," says Ph.D. student Chongwei Bi.

The tests found that large deletions accounted for 2.8-5.4 percent of Cas9 editing outcomes. They also discovered a three-fold rise in single-base DNA variants in the edited region. "This shows that there is a lot that we need to learn about CRISPR/Cas9 before it can be safely used in the clinic," says Yanyi Huang of Peking University, who is an international collaborator co-funded by KAUST.

IDMseq can currently sequence only one DNA strand, but work to enable double-strand sequencing could further improve performance, say the researchers.

More information: Chongwei Bi et al, Long-read individual-molecule sequencing reveals CRISPR-induced genetic heterogeneity in human ESCs, Genome Biology (2020). DOI: 10.1186/s13059-020-02143-8

Journal information: Genome Biology

Citation: Barcoding long DNA quantifies CRISPR effects (2020, August 26) retrieved 25 September 2023 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Using light to speed up CRISPR-Cas9 gene editing


Feedback to editors