June 5, 2023

The digital dark matter clouding AI in genome analysis

by Luis Sandoval, Cold Spring Harbor Laboratory

Artificial intelligence has entered our daily lives. First, it was ChatGPT. Now, it's AI-generated pizza and beer commercials. While we can't trust AI to be perfect, it turns out that sometimes we can't trust ourselves with AI either.

Cold Spring Harbor Laboratory (CSHL) Assistant Professor Peter Koo has found that scientists using popular computational tools to interpret AI predictions are picking up too much "noise," or extra information, when analyzing DNA. And he's found a way to fix this. Now, with just a couple new lines of code, scientists can get more reliable explanations out of powerful AIs known as deep neural networks. That means they can continue chasing down genuine DNA features. Those features might just signal the next breakthrough in health and medicine. But scientists won't see the signals if they're drowned out by too much noise.

So, what causes the meddlesome noise? It's a mysterious and invisible source like digital "dark matter." Physicists and astronomers believe most of the universe is filled with dark matter, a material that exerts gravitational effects but that no one has yet seen. Similarly, Koo and his team discovered the data that AI is being trained on lacks critical information, leading to significant blind spots. Even worse, those blind spots get factored in when interpreting AI predictions of DNA function. The study is published in the journal Genome Biology.

Koo says, "The deep neural network is incorporating this random behavior because it learns a function everywhere. But DNA is only in a small subspace of that. And it introduces a lot of noise. And so we show that this problem actually does introduce a lot of noise across a wide variety of prominent AI models."

The digital dark matter is a result of scientists borrowing computational techniques from computer vision AI. DNA data, unlike images, is confined to a combination of four nucleotide letters: A, C, G, T. But image data in the form of pixels can be long and continuous. In other words, we're feeding AI an input it doesn't know how to handle properly.

By applying Koo's computational correction, scientists can interpret AI's DNA analyses more accurately.

Koo says, "We end up seeing sites that become much more crisp and clean, and there is less spurious noise in other regions. One-off nucleotides that are deemed to be very important all of a sudden disappear."

Koo believes noise disturbance affects more than AI-powered DNA analyzers. He thinks it's a widespread affliction among computational processes involving similar types of data. Remember, dark matter is everywhere. Thankfully, Koo's new tool can help bring scientists out of the darkness and into the light.

More information: Antonio Majdandzic et al, Correcting gradient-based interpretations of deep neural networks for genomics, Genome Biology (2023). DOI: 10.1186/s13059-023-02956-3

Journal information: Genome Biology

Provided by Cold Spring Harbor Laboratory

Citation: The digital dark matter clouding AI in genome analysis (2023, June 5) retrieved 18 July 2024 from https://phys.org/news/2023-06-digital-dark-clouding-ai-genome.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

An experimental method for examining ultra-light dark matter using millimeter-wave sensing

76 shares

Feedback to editors

The digital dark matter clouding AI in genome analysis

Observations detect a nearby hypervelocity stellar/substellar object

Study shows small animals use 'stolen' genes from bacteria to protect against infection

Minerals play newly discovered role in Earth's phosphorus cycle

Pompeii skeleton discovery shows another natural disaster may have made Vesuvius eruption even more deadly

New Jersey salt marsh sediments offer evidence of hurricanes back to the 1500s

Study identifies RNA molecule that regulates cellular aging

CERN physicist explains how team uses subatomic splashes to restart experiments after annual upgrades

New research sheds light on river dynamics and cutoff regimes

Microbial structures in Antarctic lake could reveal more about how life evolved

Sea ice's cooling power is waning faster than its area of extent, new study finds

Relevant PhysicsForums posts

Understanding COVID Quarantine Guidance

New and Interesting Publications Relevant to the Origin of Life

The Cass Report (UK)

Medical tape cut off blood flow to fetus?

Is meat broth really nutritious?

Havana Syndrome

An experimental method for examining ultra-light dark matter using millimeter-wave sensing

AI training: A backward cat pic is still a cat pic

Dark matter can make dark atoms, say theoretical astrophysicists

Observation, simulation, and AI join forces to reveal a clear universe

Researchers use deep learning to 'denoise' nanopore data

Cutting through the noise: AI enables high-fidelity quantum computing

Study shows small animals use 'stolen' genes from bacteria to protect against infection

Smart soil can water and feed itself

Study identifies RNA molecule that regulates cellular aging

Microbes found to destroy certain 'forever chemicals' by cleaving stubborn fluorine-to-carbon bonds

Scientists identify brain circuits tied to the behavior of schooling fish

Study shows ancient viruses fuel modern-day cancers

Medical Xpress

Tech Xplore

Science X

The digital dark matter clouding AI in genome analysis

Observations detect a nearby hypervelocity stellar/substellar object

Study shows small animals use 'stolen' genes from bacteria to protect against infection

Minerals play newly discovered role in Earth's phosphorus cycle

Pompeii skeleton discovery shows another natural disaster may have made Vesuvius eruption even more deadly

New Jersey salt marsh sediments offer evidence of hurricanes back to the 1500s

Study identifies RNA molecule that regulates cellular aging

CERN physicist explains how team uses subatomic splashes to restart experiments after annual upgrades

New research sheds light on river dynamics and cutoff regimes

Microbial structures in Antarctic lake could reveal more about how life evolved

Sea ice's cooling power is waning faster than its area of extent, new study finds

Relevant PhysicsForums posts

Related Stories

An experimental method for examining ultra-light dark matter using millimeter-wave sensing

AI training: A backward cat pic is still a cat pic

Dark matter can make dark atoms, say theoretical astrophysicists

Observation, simulation, and AI join forces to reveal a clear universe

Researchers use deep learning to 'denoise' nanopore data

Cutting through the noise: AI enables high-fidelity quantum computing

Recommended for you

Study shows small animals use 'stolen' genes from bacteria to protect against infection

Smart soil can water and feed itself

Study identifies RNA molecule that regulates cellular aging

Microbes found to destroy certain 'forever chemicals' by cleaving stubborn fluorine-to-carbon bonds

Scientists identify brain circuits tied to the behavior of schooling fish

Study shows ancient viruses fuel modern-day cancers

Newsletter sign up

Donate and enjoy an ad-free experience