January 3, 2024

In a world run by catalysts, why is optimizing them still so tough?

by Kaitlyn Landram, Carnegie Mellon University Mechanical Engineering

We depend on catalysts to turn our milk into yogurt, to produce Post-It notes from paper pulp, and to unlock renewable energy sources like biofuels. Finding optimal catalyst materials for specific reactions requires laborious experiments and computationally intensive quantum chemistry calculations.

Oftentimes, scientists turn to graph neural networks (GNNs) to capture and predict the structural intricacy of atomic systems, an efficient system only after the meticulous conversion of 3D atomic structures into precise spatial coordinates on the graph is complete.

CatBERTa, an energy prediction Transformer model, was developed by researchers in Carnegie Mellon University's College of Engineering as an approach to tackle molecular property prediction using machine learning.

"This is the first approach using a large language model (LLM) for this task, so we are opening up a new avenue for modeling," said Janghoon Ock, Ph.D. candidate in Amir Barati Farimani's lab.

A key differentiator is the model's ability to directly employ text (natural language) without any preprocessing to predict the properties of the adsorbate-catalyst system. This method is notably beneficial as it remains easily interpretable by humans, allowing researchers to integrate observable features into their data seamlessly.

Additionally, applying the transformer model in their research offers substantial insights. The self-attention scores, particularly, are crucial in enhancing their comprehension of interpretability within this framework.

"I can't say that it will be an alternative to state-of-the-art GNNs, but maybe we can use this as a complementary approach," said Ock. "As they say, 'The more the merrier.'"

The model delivers predictive accuracy comparable to that achieved by earlier versions of GNNs. Notably, CatBERTa was more successful when trained on limited-size data sets. Additionally, CatBERTa has surpassed the error cancellation abilities of existing GNNs.

The team focused on adsorption energy but said that the approach can be extended to other properties, such as the HOMO-LUMO gap and stabilities related to adsorbate-catalyst systems, given an apt dataset.

By integrating the capabilities of extensive language models with the demands of catalyst discovery, the team aims to streamline the process of effective catalyst screening. Ock is working to improve the accuracy of the model.

The findings are published in the journal ACS Catalysis.

More information: Janghoon Ock et al, Catalyst Energy Prediction with CatBERTa: Unveiling Feature Exploration Strategies through Large Language Models, ACS Catalysis (2023). DOI: 10.1021/acscatal.3c04956

Journal information: ACS Catalysis

Provided by Carnegie Mellon University Mechanical Engineering

Citation: In a world run by catalysts, why is optimizing them still so tough? (2024, January 3) retrieved 27 April 2024 from https://phys.org/news/2024-01-world-catalysts-optimizing-tough.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A machine learning model for identifying new compounds to fight against global warming

47 shares

Feedback to editors

In a world run by catalysts, why is optimizing them still so tough?

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Researchers reconstruct landscapes that greeted the first humans in Australia around 65,000 years ago

High-precision blood glucose level prediction achieved by few-molecule reservoir computing

Enhancing memory technology: Multiferroic nanodots for low-power magnetic storage

Researchers advance detection of gravitational waves to study collisions of neutron stars and black holes

Relevant PhysicsForums posts

Ideas for a project in computational chemistry?

Very confused about Naunyn definition of acid and base

Can you eat the Periodic Table?

New Insight into the Chemistry of Solvents

Separation of KCl from potassium chromium(III) PDTA

Zirconium Versus Zirconium Carbide For Use With Galinstan

A machine learning model for identifying new compounds to fight against global warming

Graph neural networks: A new frontier in predicting hospital infections

More metal-organic frameworks, fewer problems: A self-supervised transformer model for property prediction

Study explores the scaling of deep learning models for chemistry research

Fine-structure sensitive deep learning framework for prediction of catalytic properties with high precision

Artificial intelligence for drug discovery offers up unexpected results

Scientists discover safer alternative for an explosive reaction used for more than 100 years

Thiol-ene click reaction offers a novel approach to fabricate elastic ferroelectrics

More efficient molecular motor widens potential applications

A shortcut for drug discovery: Novel method predicts on a large scale how small molecules interact with proteins

Freeze casting—a guide to creating hierarchically structured materials

Synthesis of two new carbides provides perspective on how complex carbon structures could exist on other planets

Medical Xpress

Tech Xplore

Science X

In a world run by catalysts, why is optimizing them still so tough?

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Researchers reconstruct landscapes that greeted the first humans in Australia around 65,000 years ago

High-precision blood glucose level prediction achieved by few-molecule reservoir computing

Enhancing memory technology: Multiferroic nanodots for low-power magnetic storage

Researchers advance detection of gravitational waves to study collisions of neutron stars and black holes

Relevant PhysicsForums posts

Related Stories

A machine learning model for identifying new compounds to fight against global warming

Graph neural networks: A new frontier in predicting hospital infections

More metal-organic frameworks, fewer problems: A self-supervised transformer model for property prediction

Study explores the scaling of deep learning models for chemistry research

Fine-structure sensitive deep learning framework for prediction of catalytic properties with high precision

Artificial intelligence for drug discovery offers up unexpected results

Recommended for you

Scientists discover safer alternative for an explosive reaction used for more than 100 years

Thiol-ene click reaction offers a novel approach to fabricate elastic ferroelectrics

More efficient molecular motor widens potential applications

A shortcut for drug discovery: Novel method predicts on a large scale how small molecules interact with proteins

Freeze casting—a guide to creating hierarchically structured materials

Synthesis of two new carbides provides perspective on how complex carbon structures could exist on other planets

Newsletter sign up

Donate and enjoy an ad-free experience