This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

AI-based approach matches protein interaction partners

AI matches protein interaction partners
Comparing the AFM default MSA Transformer pairing strategy with DiffPALM for a protein structure. Credit: Lupo et al 2024, DOI: 10.1073/pnas.2311887121

Proteins are the building blocks of life, involved in virtually every biological process. Understanding how proteins interact with each other is crucial for deciphering the complexities of cellular functions, and has significant implications for drug development and the treatment of diseases.

However, predicting which proteins bind together has been a challenging aspect of , primarily due to the vast diversity and complexity of protein structures. But a new study from the group of Anne-Florence Bitbol at EPFL might now change all that.

The team of scientists, including Umberto Lupo, Damiano Sgarbossa and Bitbol, has developed DiffPALM (Differentiable Pairing using Alignment-based Language Models), an AI-based approach that can significantly advance the prediction of interacting protein sequences. The study is published in PNAS.

DiffPALM leverages the power of protein language models, an advanced machine learning concept borrowed from , to analyze and predict protein interactions among the members of two protein families with unprecedented accuracy.

It uses these machine learning techniques to predict interacting protein pairs. This leads to a significant improvement over other methods that often require large, diverse datasets, and struggle with the complexity of eukaryotic protein complexes.

Another advantage of DiffPALM is its versatility, as it can work even with smaller sequence datasets and thus address rare proteins that have few homologs—proteins of different species that share common evolutionary ancestry. It relies on protein language models trained on multiple sequence alignments (MSAs), such as the MSA Transformer and AlphaFold's EvoFormer module, which allow it to understand and predict the between proteins with a high degree of accuracy.

Additionally, using DiffPALM shows high promise when it comes to predicting the structure of protein complexes, which are intricate structures formed by the binding of multiple proteins, and are essential for many of the cell's processes.

In the study, the team compared DiffPALM with traditional coevolution-based pairing methods, which study how evolve together over time when they interact closely—changes in one protein can lead to changes in its interacting partner. This is an extremely important aspect of molecular and , which is well-captured by protein language models trained on MSAs.

DiffPALM is shown to outperform traditional methods on challenging benchmarks, demonstrating its robustness and efficiency.

The application of DiffPALM is obvious in the field of basic protein biology, but extends beyond it, as it has the potential to become a powerful tool in medical research and drug development. For instance, accurately predicting protein interactions can help understand disease mechanisms and develop targeted therapies.

The researchers have made DiffPALM freely available, hoping that the scientific community adopts it widely to further advancements in computational biology and enable researchers to explore the complexities of protein interactions.

By combining advanced machine learning techniques and efficient handling of complex biological data, DiffPALM marks a significant leap forward in computational biology.

It not only enhances our understanding of protein interactions but also opens up new avenues in , potentially leading to breakthroughs in disease treatment and drug development.

More information: Lupo, Umberto et al, Pairing interacting protein sequences using masked language modeling, Proceedings of the National Academy of Sciences (2024). DOI: 10.1073/pnas.2311887121. doi.org/10.1073/pnas.2311887121

Citation: AI-based approach matches protein interaction partners (2024, June 24) retrieved 28 June 2024 from https://phys.org/news/2024-06-ai-based-approach-protein-interaction.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A new tool for protein sequence generation and design

47 shares

Feedback to editors