Project Summary
There are no reliable methods to stabilize proteins with high accuracy. Currently available methods have
standard errors between observed and predicted effects of mutations on protein stability that range from 1 to 3
kcal/mol. Given the importance of protein stability for biomedical applications such as the shelf-life and
immunogenicity of protein-based pharmaceuticals, development of reliable methods to stabilize proteins with
high accuracy is critical. To address this deficit in current knowledge, we have developed EmCAST (Empirical
C-Alpha Stability Tool) and have shown that it can double the stability of a small three helix bundle, UBA(1),
with four mutations. For a set of eight single, double, triple, and quadruple mutant variants that contain
combinations of these four mutations, the average error between predicted and observed stability was 0.13
kcal/mol, a vast improvement over existing methods to predict stabilizing mutations. EmCAST relies on two
important innovations: 1) use of an empirical potential derived from a database of the alpha carbon (C¿)
dihedral angle preferences for all possible four-residue sequences extracted from the 2018 release of the
Protein Data Bank and 2) selection of surface-exposed sites for introduction of stabilizing mutations. In the
proposed work, we will demonstrate that EmCAST can be an effective tool to stabilize a broad range of protein
folds and that it can be used to tune the position of protein conformational switches and hence control protein
function. We will also release, maintain, and upgrade a web service so that the protein biochemistry community
can readily access and use this valuable tool.
We will accomplish these goals in the context of the following Aims:
• In Aim 1, Rational Stabilization of Pure ¿ and ¿ Domains, we will show that EmCAST can stabilize a
set of four additional helical domains with high accuracy and that it can also be applied to stabilization
of ¿-sheet domains. Predicted stabilizations for these proteins range from 2.5 to 6 kcal/mol.
• In Aim 2, Stabilization of Mixed ¿/¿ Domains and Large Folds, we apply EmCAST to stabilize a set
of four more complex folds that include both ¿-helix and ¿-sheet structure with sequence lengths up to
270 amino acids. EmCAST predicts stabilizations of 3 to 5 kcal/mol for the selected proteins.
• In Aim 3, Regulating Loop Dynamics and Tuning the Position of Conformational Switches, we
will show that EmCAST can be applied to differential stabilization of alternate conformers of proteins,
allowing for tuning of protein function.