Feedback from thousands of designs could transform protein engineering

July 13, 2017
A model of a computationally designed mini protein from a UW Medicine Institute for Protein Design large-scale study. Credit: UW Medicine Institute for Protein Design

The stage is set for a new era of data-driven protein molecular engineering as advances in DNA synthesis technology merge with improvements in computational design of new proteins.

This week's Science reports the largest-scale testing of folding stability for computationally designed proteins, made possible by a new high-throughput approach.

The scientists are from the UW Medicine Institute for Protein Design at the University of Washington in Seattle and the University of Toronto in Ontario.

The lead author of the paper is Gabriel Rocklin, a postdoctoral fellow in biochemistry at the University of Washington School of Medicine. The senior authors are Cheryl Arrowsmith, of the Princess Margaret Cancer Center, the Structural Genomics Consortium and the Department of Medical Biophysics at the University of Toronto, and David Baker, UW professor of biochemistry and a Howard Hughes Medical Institute investigator.

Proteins are biological workhorses. Researchers want to build new molecules, not found naturally, that can perform tasks in preventing or treating disease, in industrial applications, in energy production, and in environmental cleanups.

"However, computationally designed proteins often fail to form the folded structures that they were designed to have when they are actually tested in the lab," Rocklin said.

In the latest study, the researchers tested more than 15,000 newly designed mini-proteins that do not exist in nature to see whether they form folded structures. Even major studies in the past few years have generally examined only 50 to 100 designs.

An animation of a computationally designed mini protein from a UW Medicine Institute for Protein Design large-scale study to assess molecular folding and structural stability. Credit: UW Medicine Institute for Protein Design

"We learned a huge amount at this new scale, but the taste has given us an even larger appetite," said Rocklin. "We're eager to test hundreds of thousands of designs in the next few years."

The most recent testing led to the of 2,788 stable protein structures and could have many bioengineering and synthetic biology applications. Their small size may be advantageous for treating diseases when the drug needs to reach the inside of a cell.

Proteins are made of amino acid chains with specific sequences, and natural protein sequences are encoded in cellular DNA. These chains fold into 3-dimensional conformations. The sequence of the amino acids in the chain guide where it will bend and twist, and how parts will interact to hold the together.

For decades, researchers have studied these interactions by examining the structures of naturally occurring proteins. However, natural protein structures are typically large and complex, with thousands of interactions that collectively hold the protein in its folded shape. Measuring the contribution of each interaction becomes very difficult.

The scientists addressed this problem by computationally designing their own, much simpler proteins. These simpler proteins made it easier to analyze the different types of interactions that hold all proteins in their folded structures.

"Still, even simple proteins are so complicated that it was important to study thousands of them to learn why they fold," Rocklin said. "This had been impossible until recently, due to the cost of DNA. Each designed protein requires its own customized piece of DNA so that it can be made inside a cell. This has limited previous studies to testing only tens of designs."

To encode their designs of short proteins in this project, the researchers used what is called DNA oligo library synthesis technology. It was originally developed for other laboratory protocols, such as large gene assembly. One of the companies that provided their DNA is CustomArray in Bothell, Wash. They also used DNA libraries made by Agilent in Santa Clara, Calif., and Twist Bioscience in San Francisco.

This image is from a comprehensive mutational analysis of stability in designed and natural proteins. The average change in stability due to mutating each position in 13 designed proteins is depicted on the design model structures. Yellow indicates positions where mutations are most destabilizing; positions where there is little effect are blue Credit: UW Medicine Institute for Protein Design

By repeating the cycle of computation and experimental testing over several iterations, the researchers learned from their design failures and progressively improved their modeling. Their design success rate rose from 6 percent to 47 percent. They also produced stable proteins in shapes where all of their first designs failed.

Their large set of stable and unstable mini-proteins enabled them to quantitatively analyze which protein features correlated with folding. They also compared the stability of their designed proteins to similarly sized, naturally occurring proteins.

The most stable the researchers identified was a much-studied protein from the bacteria Bacillus stearothermophilus. This organism basks in high temperatures, like those in hot springs and ocean thermal vents. Most proteins lose their folded structures under such high temperature conditions. Organisms that thrive there have evolved highly stable proteins that stay folded even when hot.

"A total of 774 designed proteins had a higher stabilityscores than this most protease-resistant monomeric protein," the researchersnoted. Proteases are enzymes that break down proteins, and were essentialtools the researchers used to measure stability for their thousands ofproteins.

The researchers predict that, as DNA synthesis technology continues to improve, high-throughput protein design will become possible for larger, more complex protein structures.

"We are moving away from the old style of design, which was a mix ofcomputer modeling, human intuition, and small bits of evidence about whatworked before." Rocklin said. "Protein designers were like master craftsmenwho used their experience to hand-sculpt each piece in their workshop.Sometimes things worked, but when they failed it was hard to say why. Ournew approach lets us collect an enormous amount of data on what makesproteins stable. This data can now drive the design process."

Explore further: Scientists design new protein structure

More information: "Global analysis of protein folding using massively parallel design, synthesis, and testing" Science (2017). science.sciencemag.org/cgi/doi … 1126/science.aan0693

Related Stories

Researchers probe protein diversity

July 10, 2017

Proteins make up a wildly diverse class of molecule, with key roles in everything from catalyzing reactions to helping fight off infection to transporting oxygen through the body. Now, Harvard scientists are beginning to ...

Self-assembling cyclic protein homo-oligomers

May 10, 2017

Cyclic proteins that assemble from multiple identical subunits (homo-oligomers) play key roles in many biological processes, including cell signaling and enzymatic catalysis and protein function. Researchers in Berkeley Lab's ...

A new way to discover structures of membrane proteins

February 7, 2017

University of Toronto scientists have discovered a better way to extract proteins from the membranes that encase them, making it easier to study how cells communicate with each other to create human health and disease.

Recommended for you

Life's building blocks observed in spacelike environment

December 12, 2017

Where do the molecules required for life originate? It may be that small organic molecules first appeared on earth and were later combined into larger molecules, such as proteins and carbohydrates. But a second possibility ...

Hot vibrating gases under the electron spotlight

December 12, 2017

Natural gas is used in refineries as the basis for products like acetylene. The efficiency of gaseous reactions depends on the dynamics of the molecules—their rotation, vibration and translation (directional movement). ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.