Machine learning techniques may reveal cause-effect relationships in protein dynamics data

April 17, 2018, American Institute of Physics
Machine learning-based analysis of the signaling pathways found inside amino acids found in human proteins. Credit: Navli Duro/University of South Florida

Machine learning algorithms excel at finding complex patterns within big data, so researchers often use them to make predictions. Researchers are pushing this emerging technology beyond finding correlations to help uncover hidden cause-effect relationships and drive scientific discoveries.

At the University of South Florida, researchers are integrating machine learning techniques into their work studying proteins. As they report in The Journal of Chemical Physics, one of their main challenges has been a lack of methods to identify cause-effect relationships in data obtained from .

"Proteins can be thought of as nanoscopic machines that perform a set of tasks. But when and where proteins carry out their specific tasks is controlled by cells through various stimuli, such as small molecules," said Sameer Varma, an associate professor of biophysics at USF. "These stimuli interact with proteins to switch them 'on' and 'off,' and can even modify their speeds and strengths."

In most proteins, the biological stimuli interact with a site on the that's relatively far away from the part that carries out its corresponding task, requiring a signaling pathway. "This remote-control manner of switching in proteins is known as 'allosteric signaling.' Many proteins of pharmaceutical significance have now been identified where the dynamics or the 'jiggling and wiggling' of their constituent atoms are known to be vital to allosteric signaling," Varma said. "The details, however, remain sketchy."

Varma and colleagues believe machine learning approaches can make a difference. "Developing and using machine learning techniques will enable us to find cause-effect relationships in protein dynamics data and begin to finally address some of the very fundamental questions in protein allostery," he said. "One of our key findings was that the signal initiated at the stimulation site of the protein appeared to weaken as it moved away from the stimulation site. It came as a surprise, because no distance dependence was observed for the coupling of thermal motions between protein sites."

The group's work demonstrates how machine learning approaches can be used to identify cause-effect relationships within data. Beyond this, "these techniques are allowing us to plug critical gaps in protein allostery," Varma said. "Ultimately, when our methods are applied to the many proteins of pharmaceutical interest, we expect the mechanistic details to reveal much-needed new intervention strategies for restoring protein activities in diseased states. The general biophysical insights we gain should also help to inspire novel biomimetic solutions for many nanoengineering problems, such as nanosensor design for targeted drug delivery."

The researchers envision exciting new work that will grow from their recent findings. "So far, we've focused on equilibrium data, but the signaling process has a critical nonequilibrium component that we haven't explored yet," Varma said. The group also plans to explore the role of the surrounding waters in signaling in greater detail, as well as apply their machine learning techniques to a wide set of protein families to determine the extent to which their new biophysical findings are generalizable.

Explore further: Ras protein's role in spreading cancer

More information: "Machine learning approaches to evaluate correlation patterns in allosteric signaling: a case study of PDZ2 domain," Journal of Chemical Physics (2018). DOI: 10.1063/1.5022469

Related Stories

Ras protein's role in spreading cancer

February 18, 2018

Protein systems, such as Ras, make up the complex signaling pathways that control whether a cell divides or, in some cases, becomes cancerous and metastasizes into other regions of the body. For example, 98 percent of pancreatic ...

Quantum machine shows promise for biological research

February 27, 2018

To date, much has been stated about the promise of quantum computing for myriad of applications but there have been few examples of a quantum advantage for real-world problems of practical interest. This might change with ...

Recommended for you

Coffee-based colloids for direct solar absorption

March 22, 2019

Solar energy is one of the most promising resources to help reduce fossil fuel consumption and mitigate greenhouse gas emissions to power a sustainable future. Devices presently in use to convert solar energy into thermal ...

Physicists reveal why matter dominates universe

March 21, 2019

Physicists in the College of Arts and Sciences at Syracuse University have confirmed that matter and antimatter decay differently for elementary particles containing charmed quarks.

ATLAS experiment observes light scattering off light

March 20, 2019

Light-by-light scattering is a very rare phenomenon in which two photons interact, producing another pair of photons. This process was among the earliest predictions of quantum electrodynamics (QED), the quantum theory of ...

How heavy elements come about in the universe

March 19, 2019

Heavy elements are produced during stellar explosion or on the surfaces of neutron stars through the capture of hydrogen nuclei (protons). This occurs at extremely high temperatures, but at relatively low energies. An international ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.