July 20, 2021 report

Using machine-learning to find mutations in similar genome sequences of cancer samples

by Bob Yirka , Phys.org

A team of researchers working at the Francis Crick Institute has developed a way to find mutations in similar genome regions of cancer samples. In their paper published in the journal Nature Biotechnology, the group describes using a machine-learning algorithm to spot cancerous mutations in non-unique parts of the genome.

As part of human evolutionary history, sections of the genome have undergone rearrangement, and in some cases, duplication. Such duplications have been found to be problematic when attempting to find mutations. Current scanning methods toss out short sequences that are identified as ambiguous, which means that segments of the genome that are very similar to one another are not included in such reports—and that means that any mutations will be missed. In this new effort, the researchers have developed a means for finding mutations in non-unique parts of the genome.

The approach involved first developing a list of genome regions known to be similar to other regions and then using them to teach a machine-learning algorithm how to recognize them. Researchers then used the algorithm to spot mutations in different tissues—2,658 samples from the Pan-Cancer Analysis of Whole Genome dataset. The researchers uncovered mutations in 1,744 coding sequences along with thousands of other mutations in non-coding sequences. They also found that their algorithm had a false discovery rate of approximately 7% and a validation rate of more than 80%.

The researchers noted that those mutations that involved coding sequences have an impact on protein sequences, some of which have been linked to cancer types. They also found instances of mutations that led to protein changes, that have also been linked to specific kinds of cancers. As one example, they found a recurrent mutation in the KMT2C and PIK3CA genes. They also found mutations that have been linked to breast cancer. And they found mutations that are involved in regulatory regions, including some in the immunoglobulin family.

The researchers suggest their technique can be used by other teams as a means to overcome issues with overlooking mutations in near-duplicate genetic regions.

More information: Maxime Tarabichi et al, A pan-cancer landscape of somatic mutations in non-unique regions of the human genome, Nature Biotechnology (2021). DOI: 10.1038/s41587-021-00971-y

Journal information: Nature Biotechnology

Citation: Using machine-learning to find mutations in similar genome sequences of cancer samples (2021, July 20) retrieved 22 June 2024 from https://phys.org/news/2021-07-machine-learning-mutations-similar-genome-sequences.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Study provides first genome-wide evidence for functional importance of unusual DNA structures

341 shares

Feedback to editors

Using machine-learning to find mutations in similar genome sequences of cancer samples

Saturday Citations: Bulking tips for black holes; microbes influence drinking; new dinosaur just dropped

China, France launch satellite to better understand the universe

Key mechanism in nuclear reaction dynamics promises advances in nuclear physics

Study challenges popular idea that Easter islanders committed 'ecocide'

New AI-driven tool improves root image segmentation

Many more bacteria produce greenhouse gases than previously thought, study finds

Stacking three layers of graphene with a twist speeds up electrochemical reactions

A black hole of inexplicable mass: JWST observations reveal a mature quasar at cosmic dawn

Beyond CRISPR: seekRNA delivers a new pathway for accurate gene editing

Transforming drug discovery with AI: New program transforms 3D information into data that typical models can use

Relevant PhysicsForums posts

COVID Virus Lives Longer with Higher CO2 In the Air

Is meat broth really nutritious?

Periodical Cicada Life Cycle

A DNA Animation

Innovative ideas and technologies to help folks with disabilities

How do fetuses breathe in the womb?

Study provides first genome-wide evidence for functional importance of unusual DNA structures

New method created for identifying genes behind brain tumors

New model by CHOP researchers identifies noncoding mutations across five pediatric cancers

Artificial intelligence helps to pinpoint roots of gastric cancer

Scientists discover 'jumping' genes that can protect against blood cancers

Even DNA that doesn't encode genes can drive cancer

Beyond CRISPR: seekRNA delivers a new pathway for accurate gene editing

New AI-driven tool improves root image segmentation

Transforming drug discovery with AI: New program transforms 3D information into data that typical models can use

Membrane protein analogs could accelerate drug discovery

Refining turbulent flow to scale up iPS cell-based platelet manufacturing

Intricate processes in photosynthesis decoded using advanced electron microscopy technique

Medical Xpress

Tech Xplore

Science X

Using machine-learning to find mutations in similar genome sequences of cancer samples

Saturday Citations: Bulking tips for black holes; microbes influence drinking; new dinosaur just dropped

China, France launch satellite to better understand the universe

Key mechanism in nuclear reaction dynamics promises advances in nuclear physics

Study challenges popular idea that Easter islanders committed 'ecocide'

New AI-driven tool improves root image segmentation

Many more bacteria produce greenhouse gases than previously thought, study finds

Stacking three layers of graphene with a twist speeds up electrochemical reactions

A black hole of inexplicable mass: JWST observations reveal a mature quasar at cosmic dawn

Beyond CRISPR: seekRNA delivers a new pathway for accurate gene editing

Transforming drug discovery with AI: New program transforms 3D information into data that typical models can use

Relevant PhysicsForums posts

Related Stories

Study provides first genome-wide evidence for functional importance of unusual DNA structures

New method created for identifying genes behind brain tumors

New model by CHOP researchers identifies noncoding mutations across five pediatric cancers

Artificial intelligence helps to pinpoint roots of gastric cancer

Scientists discover 'jumping' genes that can protect against blood cancers

Even DNA that doesn't encode genes can drive cancer

Recommended for you

Beyond CRISPR: seekRNA delivers a new pathway for accurate gene editing

New AI-driven tool improves root image segmentation

Transforming drug discovery with AI: New program transforms 3D information into data that typical models can use

Membrane protein analogs could accelerate drug discovery

Refining turbulent flow to scale up iPS cell-based platelet manufacturing

Intricate processes in photosynthesis decoded using advanced electron microscopy technique

Newsletter sign up

Donate and enjoy an ad-free experience