November 9, 2015

Complex grammar of the genomic language

A new study from Sweden's Karolinska Institutet shows that the 'grammar' of the human genetic code is more complex than that of even the most intricately constructed spoken languages in the world. The findings, published in the journal Nature, explain why the human genome is so difficult to decipher—and contribute to the further understanding of how genetic differences affect the risk of developing diseases on an individual level.

"The genome contains all the information needed to build and maintain an organism, but it also holds the details of an individual's risk of developing common diseases such as diabetes, heart disease and cancer", says study lead-author Arttu Jolma, doctoral student at the Department of Biosciences and Nutrition. "If we can improve our ability to read and understand the human genome, we will also be able to make better use of the rapidly accumulating genomic information on a large number of diseases for medical benefits."

The sequencing of the human genome in the year 2000 revealed how the 3 billion letters of A, C, G and T, that the human genome consists of, are ordered. However, knowing just the order of the letters is not sufficient for translating the genomic discoveries into medical benefits; one also needs to understand what the sequences of letters mean. In other words, it is necessary to identify the 'words' and the 'grammar' of the language of the genome.

The cells in our body have almost identical genomes, but differ from each other because different genes are active (expressed) in different types of cells. Each gene has a regulatory region that contains the instructions controlling when and where the gene is expressed. This gene regulatory code is read by proteins called transcription factors that bind to specific 'DNA words' and either increase or decrease the expression of the associated gene.

Under the supervision of Professor Jussi Taipale, researchers at Karolinska Institutet have previously identified most of the DNA words recognised by individual transcription factors. However, much like in a natural human language, the DNA words can be joined to form compound words that are read by multiple transcription factors. However, the mechanism by which such compound words are read has not previously been examined. Therefore, in their recent study in Nature, the Taipale team examines the binding preferences of pairs of transcription factors, and systematically maps the compound DNA words they bind to.

Their analysis reveals that the grammar of the genetic code is much more complex than that of even the most complex human languages. Instead of simply joining two words together by deleting a space, the individual words that are joined together in compound DNA words are altered, leading to a large number of completely new words.

"Our study identified many such words, increasing the understanding of how genes are regulated both in normal development and cancer", says Arttu Jolma. "The results pave the way for cracking the genetic code that controls the expression of genes. "

More information: Arttu Jolma et al. DNA-dependent formation of transcription factor pairs alters their binding specificity, Nature (2015). DOI: 10.1038/nature15518

Journal information: Nature

Provided by Karolinska Institutet

Citation: Complex grammar of the genomic language (2015, November 9) retrieved 5 August 2024 from https://phys.org/news/2015-11-complex-grammar-genomic-language.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Language of gene switches unchanged across the evolution

79 shares

Feedback to editors

Complex grammar of the genomic language

An overlooked side-effect of the housing crisis may be putting Californians at increased risk from climate disasters

Greenland fossil discovery stuns scientists and confirms that center of ice sheet melted in recent past

Horse miscarriages offer clues to causes of early human pregnancy loss

Researchers achieve super-Bloch oscillations in strong-driving regime

Molecules get a boost from metallic carbon nanotubes

Hydraulic lift technology may have helped build Egypt's iconic Pyramid of Djoser

Engineers develop general, high-speed technology to model, understand catalytic reactions

The Higgs particle could have ended the universe by now—here's why we're still here

New model refutes leading theory on how Earth's continents formed

Ultrafast electron microscopy technique advances understanding of processes applicable to brain-like computing

Relevant PhysicsForums posts

Contradictory statements made by two different professors about IQ scores

New and Interesting Publications Relevant to the Origin of Life

The Cass Report (UK)

The predictive brain (Stimulus-Specific Error Prediction Neurons)

Understanding COVID Quarantine Guidance

Innovative ideas and technologies to help folks with disabilities

Language of gene switches unchanged across the evolution

Learning the alphabet of gene control

Cell memory mechanism discovered

Novel mutational process targeting gene regulatory elements discovered

The human genome: A complex orchestra

Gene on-off switch works like backpack strap: Team unravels how loops form in genome

Novel siRNA backbone enhances stability, durability of oligonucleotide therapeutic platform

BNP-Track algorithm offers a clearer picture of biomolecules in motion

Scientists 'cautiously optimistic' about AI's role in drug discovery

New DNA analysis helps bust 200-year-old royal conspiracy theory

'Squishy' lasers could reveal how tumors and babies grow

Researchers find unknown effects of existing drugs by mapping protein interactions

Medical Xpress

Tech Xplore

Science X

Complex grammar of the genomic language

An overlooked side-effect of the housing crisis may be putting Californians at increased risk from climate disasters

Greenland fossil discovery stuns scientists and confirms that center of ice sheet melted in recent past

Horse miscarriages offer clues to causes of early human pregnancy loss

Researchers achieve super-Bloch oscillations in strong-driving regime

Molecules get a boost from metallic carbon nanotubes

Hydraulic lift technology may have helped build Egypt's iconic Pyramid of Djoser

Engineers develop general, high-speed technology to model, understand catalytic reactions

The Higgs particle could have ended the universe by now—here's why we're still here

New model refutes leading theory on how Earth's continents formed

Ultrafast electron microscopy technique advances understanding of processes applicable to brain-like computing

Relevant PhysicsForums posts

Related Stories

Language of gene switches unchanged across the evolution

Learning the alphabet of gene control

Cell memory mechanism discovered

Novel mutational process targeting gene regulatory elements discovered

The human genome: A complex orchestra

Gene on-off switch works like backpack strap: Team unravels how loops form in genome

Recommended for you

Novel siRNA backbone enhances stability, durability of oligonucleotide therapeutic platform

BNP-Track algorithm offers a clearer picture of biomolecules in motion

Scientists 'cautiously optimistic' about AI's role in drug discovery

New DNA analysis helps bust 200-year-old royal conspiracy theory

'Squishy' lasers could reveal how tumors and babies grow

Researchers find unknown effects of existing drugs by mapping protein interactions

Newsletter sign up

Donate and enjoy an ad-free experience