Molecular dynamics, machine learning create 'hyper-predictive' computer models

May 15, 2017, North Carolina State University
Molecular dynamics (MD) simulations of ERK2 inhibitors to extract MD descriptors for next-generation cheminformatics analysis and machine learning. Credit: North Carolina State University

Researchers from North Carolina State University have demonstrated that molecular dynamics simulations and machine learning techniques could be integrated to create more accurate computer prediction models. These "hyper-predictive" models could be used to quickly predict which new chemical compounds could be promising drug candidates.

Drug development is a costly and time-consuming process. To narrow down the number of chemical compounds that could be potential candidates, scientists utilize computer models that can predict how a particular chemical compound might interact with a biological target of interest - for example, a key protein that might be involved with a disease process. Traditionally, this is done via quantitative structure-activity relationship (QSAR) modeling and molecular docking, which rely on 2- and 3-D information about those chemicals.

Denis Fourches, assistant professor of computational chemistry, wanted to improve upon the accuracy of these QSAR models. "When you're screening a set of 30 million compounds, you don't necessarily need a very high reliability with your model - you're just getting a ballpark idea about the top 5 or 10 percent of that virtual library. But if you're attempting to narrow a field of 200 analogues down to 10, which is more commonly the case in drug development, your modeling technique must be extremely accurate. Current techniques are definitely not reliable enough."

Fourches and Jeremy Ash, a graduate student in bioinformatics, decided to incorporate the results of calculations - all-atom simulations of how a particular compound moves in the binding pocket of a protein - into based on .

"Most models only use the two-dimensional structures of molecules," Fourches says. "But in reality, chemicals are complex three-dimensional objects that move, vibrate and have dynamic intermolecular interactions with the protein once docked in its binding site. You cannot see that if you just look at the 2-D or 3-D structure of a given molecule."

In a proof-of-concept study, Fourches and Ash looked at the ERK2 kinase - an enzyme associated with several types of cancer - and a group of 87 known ERK2 inhibitors, ranging from very active to inactive. They ran independent molecular dynamics (MD) simulations for each of those 87 compounds and computed critical information about the flexibility of each compound once in the ERK2 pocket. Then they analyzed the MD descriptors using cheminformatics techniques and machine learning. The MD descriptors were able to accurately distinguish active ERK2 inhibitors from weakly actives and inactives, which was not the case when the models used only 2-D and 3-D structural information.

"We already had data about these 87 molecules and their activity at ERK2," Fourches says. "So we tested to see if our model was able to reliably find the most active compounds. Indeed, it accurately distinguished between strong and weak ERK2 inhibitors, and because MD descriptors encoded the interactions those compounds create in the pocket of ERK2, it also gave us more insight into why the strong inhibitors worked well.

"Before computing advances allowed us to simulate this kind of data, it would have taken us six months to simulate one single molecule in the pocket of ERK2. Thanks to GPU acceleration, now it only takes three hours. That is a game changer. I'm hopeful that incorporating data extracted from molecular dynamics into QSAR models will enable a new generation of hyper-predictive models that will help bringing novel, effective drugs onto the market even faster. It's artificial intelligence working for us to discover the drugs of tomorrow."

Explore further: Computer models could allow researchers to better understand, predict adverse drug reactions

More information: Jeremy Ash et al, Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories, Journal of Chemical Information and Modeling (2017). DOI: 10.1021/acs.jcim.7b00048

Related Stories

How computers are searching for drugs of the future

April 6, 2017

Drug discovery may bring to mind images of white lab coats and pipettes, but when Henry Lin, PhD, recently set out to find a better opioid with fewer side effects, his first step was to fire up the computers.

Recommended for you

Saving lives with platypus milk

March 15, 2018

A breakthrough by Australian scientists has brought the introduction of an unlikely hero in the global fight against antibiotic resistance a step closer; the humble platypus.

Turbocharging fuel cells with a multifunctional catalyst

March 14, 2018

Powering clean, efficient cars is just one way fuel cell technology could accelerate humanity into a sustainable energy future, but unfortunately, the technology has been a bit sluggish. Now, engineers may be able to essentially ...

The element of surprise

March 14, 2018

Many of us are often told we bear a resemblance to another member of our family—for instance, that we have our mother's nose or our father's eyes.


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.