This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

peer-reviewed publication

trusted source

proofread

Researchers develop novel data representation for transcription factor-binding sequences

gene
Credit: Unsplash/CC0 Public Domain

The diverse characteristics of the human body's various cells are reflected in their gene expression patterns. The regulation of such gene expression is based on transcription factors that bind to specific sequences in the genome.

The elucidation of unique transcription factor-binding sequences for each cell type is pivotal in unraveling the regulatory mechanisms governing within these cell types. Nonetheless, a comprehensive understanding of transcription factor-binding sequences, encompassing commonalities and variations across transcription factor types and cell types, remains elusive.

Using data concerning the binding sites of numerous human transcription factors, researchers at the University of Tsukuba have developed "MOCCS profiles," a novel data representation for transcription factor-binding sequences and analyzed binding sequences across various transcription factors and cell types. The work is published in the journal BMC Genomics.

Their findings reveal that approximately half of the examined transcription factors possess distinct binding sequences for specific cell types. Moreover, utilizing the MOCCS profiles, the researchers developed an index to predict the effect of single nucleotide polymorphisms (SNPs) on DNA binding of transcription factors and showed that the effect of disease-related SNPs on transcription factor binding can be appropriately evaluated from the perspective of transcription factors and .

The MOCCS profiles hold vast potential for various applications such as combining with epigenomic data to decipher cell type-specific gene expression regulatory mechanisms and evaluating the impact of somatic mutations in cancer cells on the binding of .

More information: Saeko Tahara et al, Transcription factor-binding k-mer analysis clarifies the cell type dependency of binding specificities and cis-regulatory SNPs in humans, BMC Genomics (2023). DOI: 10.1186/s12864-023-09692-9

Journal information: BMC Genomics

Citation: Researchers develop novel data representation for transcription factor-binding sequences (2023, October 20) retrieved 30 April 2024 from https://phys.org/news/2023-10-representation-transcription-factor-binding-sequences.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Study clarifies how 'junk DNA' influences gene expression

12 shares

Feedback to editors