Thursday, January 1, 2026 1/1/2026

Improving the Accuracy of Implicit Solvents with a Physics-Guided Neural Network

Award Number: R16GM146633
ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 08/01/2022
PERIOD OF PERFORMANCE END DATE: 05/31/2026

Group Awards By:

View Award Description

Improving the Accuracy of Implicit Solvents with a Physics-Guided Neural Network - Project Summary/Abstract Drug discovery is one of the most challenging tasks in biological sciences; it takes about 10-15 years and $2 billion on average to discover a new drug. The main goal in drug discovery is identifying drug-like com- pounds (ligands) capable of modulating speciﬁc biological targets (proteins). One key feature of protein-ligand interactions is the binding free energy change, G, that occurs between the protein and the ligand upon the ligand's attachment. This physiochemical feature heavily dictates how strongly a protein and ligand interact and is particularly useful to understand for drug design. While wet-lab experiments accurately estimate G, they are signiﬁcantly slow, costly, and laborious. On the other hand, computational simulations enable signiﬁcantly faster estimation of G and shed light on the binding mechanism of various structures that could have been complicated to be examined otherwise. The implicit solvent framework, which treats solvent as a continuum with the dielectric and non-polar properties of water, offer much more efﬁcient estimation of G compared to other computational methodologies, such as alchemical free energy methods. Despite noticeable progress in implicit solvent modeling, serious concerns about its accuracy remain that stem from the underlying physi- cal approximations. This research will employ modern machine learning techniques to bridge the accuracy gap between a physics-based implicit solvent model and experimental references in terms of G calculations. In particular, experimental data will be integrated into a generalized Born (GB) implicit solvent model so that with adherence to the physical model, new structural features could improve the accuracy. In addition to the model accuracy, it is essential to retain interpretability (that accounts for the model simplicity) and transferability (that assures consistent performance on different datasets). To this end, a novel multi-objective loss function will be introduced that takes “accuracy”, “interpretability”, and “transferability” into consideration. Standard protein-ligand databases, benchmarks, and datasets will be used for designing the proposed hybrid model, including host-guest systems, SAMPL challenge benchmarks, PDBbind, and BindingDB. While some of these sources contain clean data, many require further post-processing to prepare for running the GB model. Careful data preparation will be performed by following standard protocols and via popular web services. The modular characteristics of the proposed physics-data model will allow for testing various ﬂavors of implicit solvent (physics-based model) and modiﬁcations to the proposed Graph Convolutional Network (data-driven model). This ﬂexibility of the hybrid model facilitates new interdisciplinary research between the classical physics-based and the modern data-driven ends. The ﬁnal source code and parameterized datasets will be available freely to the public. They could be incorporated into the high-throughput virtual screening of candidate drugs in the early stages of drug discovery. The outcome of this research will beneﬁt the biomolecular modeling community by providing an approach to build novel, accurate, and efﬁcient computational models for studying protein-ligand interactions.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2025 ( Subtotal = $182,500 )
2025	2025	CAL STATE LA UNIVERSITY AUXILIARY SERVICES INC	5151 STATE UNIVERSITY DR	LOS ANGELES	CA	90032	LOS ANGELES	USA	Biomedical Research and Research Training	000	4	8/4/2025	NON-COMPETING CONTINUATION	$182,500
														Subtotal = $182,500

Issue Date FY: 2024 ( Subtotal = $207,132 )
2024	2024	CAL STATE LA UNIVERSITY AUXILIARY SERVICES INC	5151 STATE UNIVERSITY DR	LOS ANGELES	CA	90032	LOS ANGELES	USA	Biomedical Research and Research Training	001	3	6/3/2024	SUPPLEMENT FOR EXPANSION	$24,632
2024	2024	CAL STATE LA UNIVERSITY AUXILIARY SERVICES INC	5151 STATE UNIVERSITY DR	LOS ANGELES	CA	90032	LOS ANGELES	USA	Biomedical Research and Research Training	000	3	5/17/2024	NON-COMPETING CONTINUATION	$182,500
														Subtotal = $207,132

Issue Date FY: 2023 ( Subtotal = $182,500 )
2023	2023	CAL STATE LA UNIVERSITY AUXILIARY SERVICES INC	5151 STATE UNIVERSITY DR	LOS ANGELES	CA	90032	LOS ANGELES	USA	Biomedical Research and Research Training	000	2	5/19/2023	NON-COMPETING CONTINUATION	$182,500
														Subtotal = $182,500

Issue Date FY: 2022 ( Subtotal = $182,500 )
2022	2022	CAL STATE L.A. UNIVERSITY AUXILIARY SERVICES, INC.	5151 STATE UNIVERSITY DR GE 314	LOS ANGELES	CA	90032	LOS ANGELES	USA	Biomedical Research and Research Training	000	1	7/21/2022	NEW	$182,500
														Subtotal = $182,500

Grand Total All Awards = $754,632

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Improving the Accuracy of Implicit Solvents with a Physics-Guided Neural Network

Award Number: R16GM146633

ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 08/01/2022

PERIOD OF PERFORMANCE END DATE: 05/31/2026

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer