September 12, 2019 report

Chemists show how bias can crop up in machine learning algorithm results

by Bob Yirka , Phys.org

A team of material scientists at Haverford College has shown how human bias in data can impact the results of machine-learning algorithms used to predict new reagents for use in making desired products. In their paper published in the journal Nature, the group describes testing a machine-learning algorithm with different types of datasets and what they found.

One of the more well-known applications of machine-learning algorithms is in facial recognition. But there are possible problems with such algorithms. One such problem occurs when a facial algorithm intended to look for an individual among many faces has been trained using people of just one race. In this new effort, the researchers wondered if bias, unintentional or otherwise, might be cropping up in machine learning algorithm results used in chemistry applications designed to look for new products.

Such algorithms use data describing the ingredients of reactions that result in the creation of a new product. But the data the system is trained on could have a major impact on the results. The researchers note that currently, such data is obtained from published research efforts, which means they are typically generated by humans. They note that the data from such efforts could have been generated by the researchers themselves, or by other researchers working on separate efforts. Data could even come from a single person simply relating from memory, or from a professor's suggestion, or a graduate student with a bright idea. The point is, the data could be biased in terms of the background of the resource.

In this new effort, the researchers wanted to know if such biases might have an impact on the results of machine-learning algorithms used for chemistry applications. To find out, they looked at a specific set of materials called amine-templated vanadium borates. When they are synthesized successfully, crystals form—an easy way to determine if a reaction was successful.

The experiment consisted of training a machine-learning algorithm on data surrounding the synthesis of vanadium borates, and then programming the system to create its own. Some of the data collected by the researchers was human-generated, and some of it was collected randomly. They report that the algorithm trained on the random data did better at finding ways to synthesize the vanadium borates than when it used data generated from humans. They claim this shows a clear bias in the data that was created by humans.

More information: Xiwen Jia et al. Anthropogenic biases in chemical reaction data hinder exploratory inorganic synthesis, Nature (2019). DOI: 10.1038/s41586-019-1540-5

Journal information: Nature

Citation: Chemists show how bias can crop up in machine learning algorithm results (2019, September 12) retrieved 17 July 2024 from https://phys.org/news/2019-09-chemists-bias-crop-machine-algorithm.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Algorithm able to accurately see differences between cancerous lung tumors

115 shares

Feedback to editors

Chemists show how bias can crop up in machine learning algorithm results

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Active ingredients in carbon deposit solvent

Trying to understand alveolar surface tension

Hydrochloric Acid, NaOH, and English Ivy

Endothermic crystallization

Storing chemicals on my balcony (storing in changing temps)

Order of Reactions occurring in aqueous solutions

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Scientists develop new AI method to create material 'fingerprints'

Researchers report pathway to stronger alloys for extreme environments

Superlubricity study shows a frictionless state can be achieved at macroscale

Nano-confinement may be key to improving hydrogen production

Producing hydrogen and fertilizer at the same time

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Medical Xpress

Tech Xplore

Science X

Chemists show how bias can crop up in machine learning algorithm results

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Related Stories

Algorithm able to accurately see differences between cancerous lung tumors

New algorithm limits bias in machine learning

How CERN machine-learning techniques could improve autonomous vehicles

A Hippocratic Oath for data science? We'll settle for a little more data literacy

An algorithm could play a major role in helping radiologists diagnose cancer early, accurately

Programming and prejudice: Computer scientists discover how to find bias in algorithms

Recommended for you

Scientists develop new AI method to create material 'fingerprints'

Researchers report pathway to stronger alloys for extreme environments

Superlubricity study shows a frictionless state can be achieved at macroscale

Nano-confinement may be key to improving hydrogen production

Producing hydrogen and fertilizer at the same time

Study introduces lead-coated nickel catalyst for enhanced hydrogen evolution reaction efficiency

Newsletter sign up

Donate and enjoy an ad-free experience