A way to use artificial intelligence to predict chemical reactions

December 8, 2017 by Bob Yirka, Phys.org report

(Phys.org)—A team of researchers with IBM has applied artificial intelligence to predict organic chemical reactions. In their paper uploaded to the preprint server arXiv, the group outlines their approach, which they describe as an improvement over other models.

Predicting what will happen when chemicals are mixed or treated in certain ways is difficult because of all the variables involved. But scientists would like to have a tool that does it anyway, because it would dramatically speed up development of useful new materials, especially drugs. In this new effort, the team at IBM has taken an entirely new approach to creating such a tool.

The new approach involves treating chemical reactions as a translation problem by rephrasing elements in such predictions as letters and words rather than atoms and molecules. That changes the problem from one of predicting how chemicals will react to translating words from one form to another—a problem that has been mostly solved by AI systems.

In using such an , the group was able to feed components into a trained on a dataset of 395,496 reactions. The neural network then used what it had learned about prior reactions to make predictions about what would occur under new conditions. In practice, the system responded to such requests by offering a top-five list of possible outcomes. Testing showed that the top prediction turned out be correct 80 percent of the time, though the team has thus far only trained it on molecules with 150 atoms or less. They plan to keep working on the system and have a current goal of improving its accuracy to 90 percent. They also have plans for modifying it so that parameters such as heat, pH levels and solvents can be included. They even envision one-day hosting contests between their system and human chemists to demonstrate how well it works.

The group notes that the development of such a system is not meant to serve as a replacement for chemists, but instead to serve as a tool for them, to develop products faster or more cheaply. They plan to put the system on a cloud server so that anyone who wishes to use it may do so.

The team presented their work at this week's Neural Information Processing Systems conference.

Explore further: Using neural networks to predict outcomes of organic chemistry

More information: "Found in Translation": Predicting Outcomes of Complex Organic Chemistry Reactions using Neural Sequence-to-Sequence Models, arXiv:1711.04810 [cs.LG] arxiv.org/abs/1711.04810

There is an intuitive analogy of an organic chemist's understanding of a compound and a language speaker's understanding of a word. Consequently, it is possible to introduce the basic concepts and analyze potential impacts of linguistic analysis to the world of organic chemistry. In this work, we cast the reaction prediction task as a translation problem by introducing a template-free sequence-to-sequence model, trained end-to-end and fully data-driven. We propose a novel way of tokenization, which is arbitrarily extensible with reaction information. With this approach, we demonstrate results superior to the state-of-the-art solution by a significant margin on the top-1 accuracy. Specifically, our approach achieves an accuracy of 80.1% without relying on auxiliary knowledge such as reaction templates. Also, 66.4% accuracy is reached on a larger and noisier dataset.

Related Stories

Calcium compound breaks 'like repels like' rule

December 4, 2017

(Phys.org)—A combined team of chemists from the University of Bath in the U.K. and Université Toulouse III–Paul Sabatier, UMR in France has found an instance of a calcium compound breaking the 'like repels like' chemistry ...

Modeling surface chemistry and predicting new materials

November 3, 2017

The ruddy flakes of a rusted nail are a sure sign that an undesirable chemical reaction has occurred at the surface. Understanding how molecules and atoms behave with each other, especially at surfaces, is central to managing ...

Recommended for you

Detecting metabolites at close range

June 22, 2018

A novel concept for a biosensor of the metabolite lactate combines an electron transporting polymer with lactate oxidase, which is the enzyme that specifically catalyzes the oxidation of lactate. Lactate is associated with ...

Research team uncovers lost images from the 19th century

June 22, 2018

Art curators will be able to recover images on daguerreotypes, the earliest form of photography that used silver plates, after a team of scientists led by Western University learned how to use light to see through degradation ...

CryoEM study captures opioid signaling in the act

June 22, 2018

Opioid drugs like morphine and fentanyl are a mainstay of modern pain medicine. But they also cause constipation, are highly addictive, and can lead to fatal respiratory failure if taken at too high a dose. Scientists have ...

Researchers achieve unprecedented control of polymer grids

June 21, 2018

Synthetic polymers are ubiquitous—nylon, polyester, Teflon and epoxy, to name just a few—and these polymers are all long, linear structures that tangle into imprecise structures. Chemists have long dreamed of making polymers ...

Template to create superatoms could make for better batteries

June 21, 2018

Virginia Commonwealth University researchers have discovered a novel strategy for creating superatoms—combinations of atoms that can mimic the properties of more than one group of elements of the periodic table. These superatoms ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.