This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

A comprehensive database for the study of protein aggregation

Comprehensive database for the study of protein aggregation
Credit: Autonomous University of Barcelona

Researchers at the IBB-UAB have developed the most comprehensive database available to date to help understand the basis of protein aggregation, a phenomenon associated with aging and several pathologies. The new resource, A3D-MOBD, brings together the proteomes of 12 of the most studied model organisms, which cover distant biological clades, and contains more than half a million predictions of protein regions with a propensity to form aggregates.

The A3D-MOBD was developed by the Protein Folding and Computational Diseases Group at the Institut de Biotecnologia i de Biomedicina of the Universitat Autònoma de Barcelona (IBB-UAB), which is directed by Biochemistry and Molecular Biology Professor Salvador Ventura.

In collaboration with scientists from the University of Warsaw, the study was recently published in the journal Nucleic Acids Research. It provides pre-calculated aggregation propensity analyses and tools for the study of this phenomenon on a proteomic scale as well as evolutionary comparison between different species.

The new resource builds on the method that the same research group designed in 2015, Aggrescan 3D, but significantly expands the obtainable data. In total, it contains more than 500,000 structural predictions for more than 160,000 proteins from 12 highly characterized model organisms widely used in biology, biotechnology and biomedicine research.

It includes the herbaceous plant Arabidopsis thaliana, the nematode worm Caenorhabditis elegans, zebrafish Danio rerio, the enteric bacterium Escherichia coli, the minimal genome bacteria Mycoplasma genitalium, mouse Mus musculus, the fusion and fission yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe, human Homo sapiens, rat Rattus norvegicus, the fruit fly Drosophila melanogaster and the COVID-19 causative virus SARS-CoV-2.

The adaptive architecture of A3D-MOBD allows for future additions of other organisms relevant to the medical, biological, agricultural and industrial sectors.

In addition, the tool provides results on solubility and stability and includes additional information to contextualize the aggregation process. To develop it, researchers used several computational sources including the AI-based protein structure modeling algorithm AlphaFold and TOPCONS for the prediction of protein interaction with , as well as linking to organism specific gold-reference databases such as the Human Protein Atlas and Wormbase.

Protein aggregation is associated with aging and is the basis of different pathologies, such as Parkinson's disease, Alzheimer's disease and amyotrophic lateral sclerosis (ALS). It is also one of the most important barriers in the industrial production of therapeutic molecules, increasing their final price. With the publication of this database, researchers hope to obtain new clues to understand why some diseases caused by protein aggregation develop in some species, while other organisms are not susceptible to them.

The resource researchers represents the most comprehensive tool available to date for the prediction of aggregation-prone regions. "We anticipate that it will offer solutions to a much wider audience of researchers, not only because of the large collection of structures, but also because of its integration with databases from different biological fields," says Salvador Ventura. "We are confident that it will set a new standard in protein research and we expect it to become a basic resource in this field," concludes the UAB researcher.

More information: Aleksandra E Badaczewska-Dawid et al, A3D Model Organism Database (A3D-MODB): a database for proteome aggregation predictions in model organisms, Nucleic Acids Research (2023). DOI: 10.1093/nar/gkad942

Journal information: Nucleic Acids Research

Citation: A comprehensive database for the study of protein aggregation (2023, November 3) retrieved 27 April 2024 from https://phys.org/news/2023-11-comprehensive-database-protein-aggregation.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Computational method increases design efficiency of protein-based drugs

1 shares

Feedback to editors