October 11, 2023

Detection and extraction of similar features in disease-related gene groups

Detection and extraction of similar features in the disease-related gene groups — After converting Gene expression profile, DNA methylation profile, SNP profile into line of the subject participants’ square through linear kernelization to each, extract the subject pattern through tensor decompositions, and by selecting genes, DNA methylation sites, SNPs which synchronize with the patterns, we can select through the data-driven method without external information. Since many of the genes and DNA methylation sites are being targeted by transcription factors, and those transcription factors’ parts binding to DNA were statistically and significantly overlapped with SNP, integrated analysis of multiomics data can be said to be successful. In addition, considering the fact that transcription factors are related to disease, multiomics data plus tensor decompositions is a method of analysis which is expected to make future disease predictable (pre-symptomatic state). Credit: Y-h. Taguchi, Shohei Komaki, Yoichi Sutoh, Hideki Ohmomo, Yayoi Otsuka-Yamasaki, Atsushi Shimizu

Multiomics analysis that integrates different layers of profiles altogether is challenging, since the number of variables in profile substantially differ from each other. For instance, gene expression profile and genomic DNA methylation profile are often analyzed together; however, there are only tens of thousands of genes, whereas the number of DNA methylation sites are as many as tens of millions.

The numbers differ by orders of magnitude and the number of pairs between gene and DNA methylation sites are enormous. As such, it requires huge computational resources to conduct integrated analysis without controlling target numbers by focusing on DNA methylation sites in specific regions, such as promoter regions, based on prior knowledge. However, limiting the genomic regions to be analyzed, effects of DNA methylation on other regions (e.g., enhancer) and functions remain unexplored.

The study and the accomplishment

Published in the journal PLOS ONE, a recent study applied the method developed in the previous study to treat multiomics data (gene expression profile, DNA methylation profile, Single Nucleotide Polymorphism (SNP) profile) which Iwate Tohoku Medical Megabank Organization (IMM) comprehensively collected from 100 local resident participants, and confirmed whether the relationships with disease-related gene can be identified or not.

This is the data-driven approach called the variable extraction method which employed kernel tensor decomposition-based unsupervised study (hereafter called tensor decomposition), and this method is applicable to the datasets with all subjects belonging to a healthy group.

In addition, this method is implementable with kernel sized (square of the subject participants, in particular) or so memories per 1 profile, and thus, even for the enormous profiles such as genome and epigenome comprising tens of millions of SNP or DNA methylation sites, data-driven analysis can identify unique patterns across study subjects and identify the variables (gene expression profile, DNA methylation profile, SNP profile) that exhibit similarity to those patterns.

In this study, tensor decomposition was applied to multiomics data of each autosome retrieved from three cell types, CD4 positive T cells, monocytes, and neutrophils. As a result, the two patterns of subjects profiles were identified, and these two patterns of subjects observed in 22 autosomes show very strong mutual correlations between the other autosomes. As the genes identified in each autosome are completely independent from one another, it suggests that the observed patterns shared across chromosomes are not coincidence.

The observed orthogonal patterns also cannot be explained by batch effects, and it is improbable that the same batch effects are present in the three omics profiles obtained by the different methodologies. These two patterns of subjects are obtained as the second and the third singular value vectors by tensor decompositions, respectively. The second singular value vector was detected from all three cell types while the third singular value vector was detected from two cell types except monocytes.

Then, the genes and DNA methylation regions with homological profiles with these patterns were selected to find out that these genes and regions are targeted by many transcription factors. Furthermore, the enrichment analysis revealed that these transcription factors relate to various diseases.

In addition, the study found that identified SNP are statistically and significantly overlapped with binding sites of these transcription factors. Therefore, the authors believe that the application of tensor decompositions is effective for the integrated analysis of multiomics datasets.

More information: Y-h. Taguchi et al, Integrated analysis of human DNA methylation, gene expression, and genomic variation in iMETHYL database using kernel tensor decomposition-based unsupervised feature extraction, PLOS ONE (2023). DOI: 10.1371/journal.pone.0289029

Journal information: PLoS ONE

Provided by Chuo University

Citation: Detection and extraction of similar features in disease-related gene groups (2023, October 11) retrieved 17 July 2024 from https://phys.org/news/2023-10-similar-features-disease-related-gene-groups.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How chemical modifications on DNA keep genes silent

32 shares

Feedback to editors

Detection and extraction of similar features in disease-related gene groups

The study and the accomplishment

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Understanding COVID Quarantine Guidance

New and Interesting Publications Relevant to the Origin of Life

The Cass Report (UK)

Medical tape cut off blood flow to fetus?

Is meat broth really nutritious?

Havana Syndrome

How chemical modifications on DNA keep genes silent

13 special genes linked to epigenetic changes in cancer samples

Scientists reveal genomic distribution and evolutionary patterns of 6mA modifications in plants

Researchers discover potential target for gastric cancers associated with Epstein-Barr virus

Study finds linkage between DNA methylome and RNA transcriptome in Fe deficiency in tomato roots

DNA methylation changes in neurons of bipolar disorder patients

A new addition to the CRISPR toolbox: Teaching the gene scissors to detect RNA

Ancient microbes offer clues to how complex life evolved

Plant research could pave the way for growing crops with seawater

Unique characteristics of previously unexplored protein discovered

Researchers map fibrillization process to reveal mechanisms of amyloid polymorphism

Discovery of a hybrid lineage offers clues to how trees adapt to climate change

Medical Xpress

Tech Xplore

Science X

Detection and extraction of similar features in disease-related gene groups

The study and the accomplishment

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Related Stories

How chemical modifications on DNA keep genes silent

13 special genes linked to epigenetic changes in cancer samples

Scientists reveal genomic distribution and evolutionary patterns of 6mA modifications in plants

Researchers discover potential target for gastric cancers associated with Epstein-Barr virus

Study finds linkage between DNA methylome and RNA transcriptome in Fe deficiency in tomato roots

DNA methylation changes in neurons of bipolar disorder patients

Recommended for you

A new addition to the CRISPR toolbox: Teaching the gene scissors to detect RNA

Ancient microbes offer clues to how complex life evolved

Plant research could pave the way for growing crops with seawater

Unique characteristics of previously unexplored protein discovered

Researchers map fibrillization process to reveal mechanisms of amyloid polymorphism

Discovery of a hybrid lineage offers clues to how trees adapt to climate change

Newsletter sign up

Donate and enjoy an ad-free experience