Researchers studying the viruses that affect agricultural production or human health now have a new tool for investigating where viruses have spread, on a local, national, or even global scale.

VirusDetect is a free, open-source bioinformatics pipeline that can efficiently analyze small RNA (sRNA) datasets to identify both known and . Boyce Thompson Institute (BTI) Associate Professor Zhangjun Fei and colleagues present this pipeline in a recent paper in Virology.

"We choose this small RNA because it is a highly efficient technology for virus identification and discovery," said Fei. "VirusDetect is the first bioinformatics tool that is specifically designed to analyze this kind of data for virus detection."

There are many ways to detect a virus. Traditional methods use microscopes, antibodies or molecular techniques that detect specific sequences of viral genetic material—all techniques that are not highly efficient, especially in detecting novel viruses. With newer sequencing technology, researchers can sequence viral DNA or RNA along with the host material, but this approach requires expensive, deep sequencing, and early, low-level infections can be easily missed.

VirusDetect takes advantage of an antiviral defense system shared by both plants and animals called RNA interference (RNAi). When a plant or animal cell is invaded by a virus, the cell churns out numerous small RNAs that are just 21-24 nucleotides long. By sequencing these small RNAs and feeding the dataset into the VirusDetect pipeline, scientists can predict the presence of RNA viruses, DNA viruses and viroids.

Fei's lab used this tool to create a Pan-African sweet potato virome, which describes all the viruses affecting sweet potatoes in multiple locations throughout Sub-Saharan Africa. With more than 1,000 field samples, the researchers couldn't process this much data manually and needed a high-throughput pipeline to identify the known and unknown .

"You can use this strategy to investigate virus distribution, diversity and evolution at a large scale—a continent or even global scale. You can collect samples from all over the world," said Fei.

More information: Yi Zheng et al, VirusDetect: An automated pipeline for efficient virus discovery using deep sequencing of small RNAs, Virology (2017). DOI: 10.1016/j.virol.2016.10.017

Journal information: Virology