Machine learning provides new paradigm in understanding microbial gene regulation

gene
Credit: CC0 Public Domain

E. coli are hardy bacteria, able to live in diverse conditions from the surface of a lettuce leaf to an acidic stomach. To survive and thrive in so many environments, the bacteria must use a network of transcriptional regulators to change their gene expression levels in response to their surroundings. Even in E. coli, one of the best characterized bacteria, it is still a significant challenge for scientists to understand how they coordinate the expression of their thousands of genes.

In a paper published in the Dec. 4 issue of Nature Communications, bioengineers at the University of California San Diego report a new method to interpret gene expression datasets. By applying a designed to separate mixed signals into their original sources, researchers were able to split a large high-quality collection of gene expression data into around 100 signals that represent the targeted effects of transcriptional regulators.

The work was led by Bernhard Palsson, Galletti Professor of Bioengineering at UC San Diego, and Anand V. Sastry, a bioengineering Ph.D. student in the Palsson lab.

When analyzing gene expression datasets, scientists traditionally have had to sift through hundreds of differentially expressed , trying to find a cohesive pattern or story that connected them together. However, a problem is that many of these genes may be responding to the same underlying signal, making it difficult to discern the root cause of the organism's response, Sastry explained.

The UC San Diego team pioneered a new framework that automatically extracts the signals for specific transcriptional regulators that cause the measured changes in gene expression. The method also does not require prior knowledge of the transcriptional regulatory network. This makes it easier to apply to less-understood organisms, Sastry said.

The team's analysis was able to characterize two previously-unknown transcription factors, and refined the known targets for many other transcriptional regulators. Additional studies are in progress to validate multiple predictions posed by the study. The team's analysis also identified direct links between mutations in E. coli strains and their gene expression states, introducing a new strategy to compare strains across a species.

"Since the transcriptional regulatory network is how bacteria sense their environment, we now have a way to see what they 'see." We can easily tell if the cell is starved for a nutrient, like iron, or is stressed in any way," Sastry said. "This could be invaluable when studying complex environments, like in vivo infections."

More information: Anand V. Sastry et al. The Escherichia coli transcriptome mostly consists of independently regulated modules, Nature Communications (2019). DOI: 10.1038/s41467-019-13483-w

Journal information: Nature Communications

Citation: Machine learning provides new paradigm in understanding microbial gene regulation (2019, December 5) retrieved 26 April 2024 from https://phys.org/news/2019-12-machine-paradigm-microbial-gene.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Impact of exosomal HIV-1 Tat expression on the human cellular proteome

37 shares

Feedback to editors