June 21, 2024

New computational tool helps interpret AI models in genomics

SQUID pries open AI black box — An illustration outlining the SQUID computational pipeline. Credit: Koo and Kinney labs/ Cold Spring Harbor Laboratory

Artificial intelligence continues to squirm its way into many aspects of our lives. But what about biology, the study of life itself? AI can sift through hundreds of thousands of genome data points to identify potential new therapeutic targets. While these genomic insights may appear helpful, scientists aren't sure how today's AI models come to their conclusions in the first place. Now, a new system named SQUID arrives on the scene, armed to pry open AI's black box of murky internal logic.

SQUID, short for Surrogate Quantitative Interpretability for Deepnets, is a computational tool created by Cold Spring Harbor Laboratory (CSHL) scientists. It's designed to help interpret how AI models analyze the genome. Compared with other analysis tools, SQUID is more consistent, reduces background noise, and can lead to more accurate predictions about the effects of genetic mutations.

How does it work so much better? The key, CSHL Assistant Professor Peter Koo says, lies in SQUID's specialized training.

"The tools that people use to try to understand these models have been largely coming from other fields like computer vision or natural language processing. While they can be useful, they're not optimal for genomics. What we did with SQUID was leverage decades of quantitative genetics knowledge to help us understand what these deep neural networks are learning," explains Koo.

SQUID works by first generating a library of over 100,000 variant DNA sequences. It then analyzes the library of mutations and their effects using a program called MAVE-NN (Multiplex Assays of Variant Effects Neural Network). This tool allows scientists to perform thousands of virtual experiments simultaneously. In effect, they can "fish out" the algorithms behind a given AI's most accurate predictions. Their computational "catch" could set the stage for experiments that are more grounded in reality.

"In silico [virtual] experiments are no replacement for actual laboratory experiments. Nevertheless, they can be very informative. They can help scientists form hypotheses for how a particular region of the genome works or how a mutation might have a clinically relevant effect," explains CSHL Associate Professor Justin Kinney, a co-author of the study.

There are tons of AI models in the sea. More enter the waters each day. Koo, Kinney, and colleagues hope that SQUID will help scientists grab hold of those that best meet their specialized needs.

Though mapped, the human genome remains an incredibly challenging terrain. SQUID could help biologists navigate the field more effectively, bringing them closer to their findings' true medical implications.

The research is published in the journal Nature Machine Intelligence.

More information: Interpreting cis-regulatory mechanisms from genomic deep neural networks using surrogate models, Nature Machine Intelligence, DOI: 10.1038/s42256-024-00851-5

Journal information: Nature Machine Intelligence

Provided by Cold Spring Harbor Laboratory

Citation: New computational tool helps interpret AI models in genomics (2024, June 21) retrieved 6 August 2024 from https://phys.org/news/2024-06-tool-ai-genomics.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Rare deep-sea squid filmed at depth

76 shares

Feedback to editors

New computational tool helps interpret AI models in genomics

Study finds seasonal shifts in moral values

Researchers reveal atomic-scale details of catalysts' active sites

Sniff test for explosives detection extends its reach

Researchers dig deeper into stability challenges of nuclear fusion—with mayonnaise

New X-ray world record: Looking inside a microchip with 4 nanometer precision

Groundwater reserves in southwestern Europe more stable overall than previously thought

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers discover optimum twilight time for plant growth

Patents can help researchers understand wildlife trade trends, new study shows

New technology protects crops by testing the air for the DNA of plant diseases

Relevant PhysicsForums posts

Contradictory statements made by two different professors about IQ scores

New and Interesting Publications Relevant to the Origin of Life

The Cass Report (UK)

The predictive brain (Stimulus-Specific Error Prediction Neurons)

Understanding COVID Quarantine Guidance

Innovative ideas and technologies to help folks with disabilities

Rare deep-sea squid filmed at depth

The digital dark matter clouding AI in genome analysis

Why some RNA drugs work better than others

Genome editing vs natural mutation for variations in tomato size

Making AI algorithms show their work

Using math to calculate the path of cancer

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers find book scorpion venom effective against hospital germs

Researchers identify over 2,000 potential toxins using machine learning

Hunt for herbicide solution in snap bean reveals master switch for stress resistance

Researchers identify gene responsible for marsupial fur color

Male poison frogs may use finger placement to channel pheromones to females while mating

Medical Xpress

Tech Xplore

Science X

New computational tool helps interpret AI models in genomics

Study finds seasonal shifts in moral values

Researchers reveal atomic-scale details of catalysts' active sites

Sniff test for explosives detection extends its reach

Researchers dig deeper into stability challenges of nuclear fusion—with mayonnaise

New X-ray world record: Looking inside a microchip with 4 nanometer precision

Groundwater reserves in southwestern Europe more stable overall than previously thought

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers discover optimum twilight time for plant growth

Patents can help researchers understand wildlife trade trends, new study shows

New technology protects crops by testing the air for the DNA of plant diseases

Relevant PhysicsForums posts

Related Stories

Rare deep-sea squid filmed at depth

The digital dark matter clouding AI in genome analysis

Why some RNA drugs work better than others

Genome editing vs natural mutation for variations in tomato size

Making AI algorithms show their work

Using math to calculate the path of cancer

Recommended for you

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers find book scorpion venom effective against hospital germs

Researchers identify over 2,000 potential toxins using machine learning

Hunt for herbicide solution in snap bean reveals master switch for stress resistance

Researchers identify gene responsible for marsupial fur color

Male poison frogs may use finger placement to channel pheromones to females while mating

Newsletter sign up

Donate and enjoy an ad-free experience