Saturday, October 18, 2025 10/18/2025

Fast and slow prediction of stable and transient protein-protein interactions

Award Number: R01LM014674
ORGANIZATION: NATIONAL LIBRARY OF MEDICINE
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 09/01/2025
PERIOD OF PERFORMANCE END DATE: 08/31/2029

Group Awards By:

View Award Description

Fast and slow prediction of stable and transient protein-protein interactions - Protein-protein interactions (PPIs) underpin processes ranging from transduction of signaling networks to maintenance of cellular structure. Identifying new human PPIs may uncover interactions targetable by PPI- modulating drugs, while identifying human-pathogen PPIs could shed light on processes driving infectious diseases. Many studies have experimentally mapped PPIs at scale, yet their cost and complexity, combined with the size of PPI space, have limited the degree to which PPIs can be fully mapped. In humans, only 20% of PPIs are estimated to be known while coverage of other organisms, including pathogens, is far lower. In silico methods can be faster and cheaper but have been beset by low accuracy and lack of generalizability, unable to predict PPIs involving proteins different in sequence or structure from ones they were trained on. Recently however this has begun to change with the development of AlphaFold, which has shown a robust capacity for generalization, including in formal blind competitions. We hypothesize that structure-informed PPI prediction can be made accurate, general, and fast by using new machine learning models and data modalities that AlphaFold does not use. We aim to realize our hypothesis in this proposal. Our team has been at the forefront of molecular machine learning and high-throughput characterization of PPIs, having developed key precursors to AlphaFold as well as OpenFold—the first trainable public implementation of AlphaFold—and some of the most complete experimental PPI maps. We will combine our expertise to tackle PPI prediction. First, we will develop complementary methods to predict transient PPIs involving peptide-binding domains and peptidic ligands. Transient PPIs are highly challenging for AlphaFold and require specialized treatment, in part because they regularly involve post-translational modifications that AlphaFold does not model. We will pursue both supervised approaches that learn directly from domain-peptide binding data and unsupervised approaches that do not rely on binding data but instead detect patterns of co-evolution across whole proteomes to infer domain- peptide binding. We will complement method development with a curation effort to collect transient PPI data from the vast, untapped reservoir of primary literature sources. Second, we will develop a new version of AlphaFold designed to discriminate between true and false PPIs and trained on a wide array of data types, including structural and binding data. This version will have a fast mode that we expect to be as accurate as the current AlphaFold but sufficiently fast to screen PPIs at proteome scale, and a slow mode that will be more accurate than AlphaFold at predicting the structures of protein complexes. We will assess our models using rigorous statistical methods that test their capacity to generalize to novel sequences and structures, and experimentally validate structurally novel predictions of both wild-type and mutated interacting proteins. In the future, we expect our models to help identify novel protein complexes and human and human-pathogen PPIs, and to elucidate the logic of signaling networks and their dysregulation by disease-causing mutations.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2025 ( Subtotal = $1,440,905 )
2025	2025	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK	630 W 168TH ST FL 4	NEW YORK	NY	10032	NEW YORK	USA	Medical Library Assistance	000	1	9/1/2025	NEW	$1,440,905
														Subtotal = $1,440,905

Grand Total All Awards = $1,440,905

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Fast and slow prediction of stable and transient protein-protein interactions

Award Number: R01LM014674

ORGANIZATION: NATIONAL LIBRARY OF MEDICINE

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 09/01/2025

PERIOD OF PERFORMANCE END DATE: 08/31/2029

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer