Tending the future of data analysis with MVApp

Tending the future of data analysis with MVApp
The team typically uses small flowering plants called Arabidobsis thaliana as a model plant for their research. Credit: KAUST

The vast datasets generated by modern plant-science technologies require clever data-mining methods to extract useful information. Now, KAUST researchers have developed MVApp—an open-source, online statistics platform for conducting multivariate analyses of these intricate data.

The recent development of high-throughput phenotyping techniques has rapidly produced huge datasets on the characteristics of plants. These multivariate data hold crucial details about : how a plant responds to and how a plant's growth patterns and potential yields change—all of which are valuable in developing sustainable agriculture and ensuring food security.

"Our experiments typically include measurements of thousands of plants every day for multiple traits, from leaf size through to salt-stress tolerance or resistance to plant pathogens," says Magdalena Julkowska, a research scientist working in Mark Tester's lab at KAUST. "These data are extremely powerful, but overwhelming to sieve through, she adds, "As a team, we know the struggles that come with large data analyses, and we figured if we struggle, then others must too."

The team built MVApp using R-language—a popular tool for statistical analyses—and incorporated the most pertinent R packages for analyzing phenotyping data. MVApp can be used with datasets of different sizes, from exploratory analyses of large-scale natural diversity to smaller-scale projects comparing mutant phenotypes to wild-type plants.

The team also incorporated a technique called quantile regression into MVApp—this specialist tool is used in other fields but has not yet reached its full potential in plant science.

"When we screen populations of hundreds of diverse plant accessions, originating from different parts of the globe, the plants that yield well might have different traits contributing to yield than the plants that produce low yields," says Julkowska. "Let's say you'd like to explain the yield of a specific plant type by its biomass and —quantile regression can help quantify how much each trait is contributing to your main trait of interest."

MVApp produces comprehensive, easy-to-follow outputs, and generates attractive, publication-ready figures that have clear links back to the raw data used to create them. The MVApp team are passionate about improving data transparency and streamlining data curation, ensuring that every scientist can produce valuable, reproducible results.

"We hope that MVApp will help the entire scientific community—not just plant scientists—to become familiar with various and to know how to implement them, particularly with big data," says Julkowska. "We welcome feedback and hope users will help us improve MVApp."

Explore further

Directed evolution comes to plants

More information: Magdalena M. Julkowska et al, MVApp—Multivariate Analysis Application for Streamlined Data Analysis and Curation, Plant Physiology (2019). DOI: 10.1104/pp.19.00235
Citation: Tending the future of data analysis with MVApp (2019, July 16) retrieved 17 October 2021 from https://phys.org/news/2019-07-tending-future-analysis-mvapp.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Feedback to editors