When one become two: Separating DNA for more accurate nanopore analysis

When one become two: Separating DNA for more accurate nanopore analysis
Credit: Earlham Institute

A new software tool developed by Earlham Institute researchers will help bioinformaticians improve the quality and accuracy of their biological data, and avoid mis-assemblies. The fast, lightweight, user-friendly tool visualizes genome assemblies and gene alignments from the latest next generation sequencing technologies.

Called Alvis, the new visualization tool examines mappings between DNA sequence data and reference databases. This allows bioinformaticians to more easily analyze their data generated from common genomics tasks and formats by producing efficient, ready-made vector images.

First author and post-doctoral scientist at the Earlham Institute Dr. Samuel Martin in the Leggett Group, said: "Typically, alignment tools output plain text files containing lists of alignment data. This is great for computer parsing and for being incorporated into a pipeline, but it can be difficult to interpret by humans.

"Visualization of alignment data can help us to understand the problem at hand. As a new technology, several new alignment formats have been implemented by new tools that are specific to nanopore sequencing technology.

"We found that existing visualization tools were not able to interpret these formats; Alvis can be used with all common alignment formats, and is easily extensible for future ones."

A key feature of the new command line tool is its unique ability to automatically highlight chimeric sequences—weak links in the DNA chain. This is where two sequences—from different parts of a genome or different species—are linked together by mistake to make one, affecting the data's accuracy.

Chimera sequences can be problematic for bioinformaticians when identifying specific DNA. The chimera formation can physically happen to the DNA molecules during either sequencing library preparation, during the sequencing process on some platforms, and by assembly tools when trying to piece together a genome.

During the development of the tool, the team compared genome assemblies with and without using Alvis chimera detection. The vector image produced shows an example output, where the intuitive tool tracks all reads it recognizes as chimeras.

"Although chimeric sequences don't make up a large proportion of samples, they can have a significant effect, so we have to be careful that we have identified them during analysis," said Dr. Martin.

"In the Alvis diagram example of chimera data, each rectangle across the page represents a read, and the colored blocks inside them represent alignments. Most chimeras are easy to see because their alignments are different colors, meaning they map to different genomes. Others are more subtle because both alignments are to the same genome, but different regions."

The Alvis tool can pinpoint visualization of only chimeric sequences for further inspection, and output numerical data describing the chimeras. This demonstrates that by applying the tool and then bioinformatically splitting the chimeras, the quality of the assemblies is significantly improved.

Accessed over 600 times since being made available at the beginning of March this year, Dr. Martin, adds: "We hope that Alvis continues to be useful to other researchers working with, for example, nanopore sequencing; improving their understanding of their data by visualizing alignments."

"Alignments are so fundamental to bioinformatics that it could be of use to anyone working with long read sequencing data, as well as alignments generated by sequencing data from short-read platforms. The diagrams that Alvis generates can be easily exported to directly use in publications, demonstrated in our study already."

The paper "Alvis: a for contig and read ALignment VISualization and chimera detection" is published in BMC Bioinformatics.

Explore further

New genome alignment tool empowers large-scale studies of vertebrate evolution

More information: Samuel Martin et al, Alvis: a tool for contig and read ALignment VISualisation and chimera detection, BMC Bioinformatics (2021). DOI: 10.1186/s12859-021-04056-0
Provided by Earlham Institute
Citation: When one become two: Separating DNA for more accurate nanopore analysis (2021, May 18) retrieved 22 June 2021 from https://phys.org/news/2021-05-dna-accurate-nanopore-analysis.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Feedback to editors

User comments