Computational approach reveals myriad of protein pairs that combine to regulate gene activity

Nov 20, 2013
Molecular biology: Picking out productive partnerships
A search for transcription factors that act collaboratively to bind DNA revealed a previously unknown pairing between molecules of FOXA1 (blue) that appears to regulate genes associated with prostate cancer. Credit: A*STAR Genome Institute of Singapore

Transcription factor (TF) proteins switch particular genes on and off by homing in on specific sequences of nucleotides scattered throughout the genome. In some cases, this regulation is a team effort, where pairs of TFs—or 'dimers'—come together to modulate one target gene.

A team led by Shyam Prabhakar of the A*STAR Genome Institute of Singapore and Jerzy Tiuryn at the University of Warsaw, Poland, has now devised a computational strategy for mapping TF collaboration at an unprecedented scale. Most TF have been discovered through experiments that focused exclusively on individual TFs or TF pairs. As an alternative, Prabhakar and co-workers identified such binding events on a large scale based on a few basic assumptions.

"We predicted that these dimers would be reasonably compact, with binding sites spaced less than 50 bases apart," he explains, "and that most dimers would have a favorite spacing."

Access was another important consideration; chromosomal DNA is wound around protein cores to form a material known as chromatin, which can be dense enough to thwart TF binding. Prabhakar and co-workers used chromatin maps to flag regions that were 'open' for protein binding. As chromatin structure can vary considerably between cell types, this approach also gave the team an edge in characterizing potential tissue-specific activity of TF dimers.

The researchers used their algorithm to search for combinations of 964 different TF binding sites within regions of open chromatin in 41 different human . Their data predicted 603 potential dimers, including 19 of the 25 pairs that have been identified experimentally to date and many more that were previously unknown. Altogether, these predicted dimers bound almost half a million distinct sites in the genome, vastly exceeding prior predictions.

"I don't think anyone appreciated just how many TF dimers there were in human cells—they were usually treated as an exotic, special case," says Prabhakar. The data also supported his prediction that TFs form fairly rigid complexes with DNA binding sites positioned at a relatively fixed distance from each other, rather than the more flexible mode of dimer interaction supported by some other scientists.

Importantly, Prabhakar and team identified many TF dimers with apparent cell-specific activity, including certain combinations that seem particularly active in tumor cells. For example, various dimers containing the TF FOXA1 were prominently over-represented in prostate cancer cells (see image). Prabhakar and Tiuryn intend to further explore the clinical significance of these and other findings in future experiments with collaborator Ralf Jauch.

Explore further: The origin of the language of life

More information: Jankowski, A., Szczurek, E., Jauch, R., Tiuryn, J. & Prabhakar, S. Comprehensive prediction in 78 human cell lines reveals rigidity and compactness of transcription factor dimers. Genome Research 23, 1307–1318 (2013). dx.doi.org/10.1101/gr.154922.113

add to favorites email to friend print save as pdf

Related Stories

Researchers create atlas of transcription factor combinations

Mar 04, 2010

In a significant leap forward in the understanding of how specific types of tissue are determined to develop in mammals, an international team of scientists has succeeded in mapping the entire network of DNA-binding transcription ...

Errant gliding proteins yield long-sought insight

Nov 11, 2013

In order to react effectively to changes in the surroundings, bacteria must be able to quickly turn specific genes on or off. Although the overall mechanisms behind gene regulation have long been known, the fine details have ...

Recommended for you

The origin of the language of life

Dec 19, 2014

The genetic code is the universal language of life. It describes how information is encoded in the genetic material and is the same for all organisms from simple bacteria to animals to humans. However, the ...

Quest to unravel mysteries of our gene network

Dec 18, 2014

There are roughly 27,000 genes in the human body, all but a relative few of them connected through an intricate and complex network that plays a dominant role in shaping our physiological structure and functions.

EU court clears stem cell patenting

Dec 18, 2014

A human egg used to produce stem cells but unable to develop into a viable embryo can be patented, the European Court of Justice ruled on Thursday.

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.