Tuesday, February 17, 2026 2/17/2026

Develop Multi-modal Foundation Models for Sepsis Early Detection

Award Number: R35GM157217
ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 02/01/2025
PERIOD OF PERFORMANCE END DATE: 01/31/2030

Group Awards By:

View Award Description

Develop Multi-modal Foundation Models for Sepsis Early Detection - Abstract: The PI’s lab focuses on developing effective and scalable machine learning (including deep learning) methodologies to address pressing challenges in healthcare. We have developed foundation models, self-supervised learning methods, multi-level optimization methods, interpretable machine learning (ML) methods, large-scale distributed ML systems, etc. to analyze multi-modal, high-dimensional, and dynamic clinical data, including medical images, electronic health records (EHRs), clinical notes, etc. for medical decision support in diagnosis and treatment. In the next five years, we will develop accurate, efficient, and interpretable multi-modal foundation models for the early detection of sepsis. Sepsis is a life-threatening condition that leads to widespread inflammation, multiple organ failure, and eventually death. Early detection and intervention of sepsis are critical to reducing the risk of death and minimizing the extent of organ damage. A foundation model (FM) is a large-scale ML model, like GPT-4, that is pre-trained on a vast dataset and can be fine-tuned for a wide range of specific tasks and applications. Possessing advanced capabilities for identifying nuanced clinical patterns and signals from large-scale patient datasets, FMs hold immense potential in the early detection of sepsis. Nevertheless, developing FMs for this task presents significant challenges, including scarcity of large-scale EHR data for pre-training, heterogeneity across various data modalities, prevalence of missing values and anomalies in patient records, substantial risk of overfitting during fine-tuning, and lack of interpretability, among other factors. We aim to develop transformative ML methods to address these challenges. First, we will curate large-scale high-quality EHR data for pre-training the FMs, by developing self-supervised learning (SSL) methods, bi-level optimization based methods, and multi-modal diffusion based generative models for imputing missing values, detecting outliers, and synthesizing large-scale pre-training data. Second, we will learn effective representations for EHRs by developing multi-modal Transformer models to handle heterogeneous data modalities, capture long-range dependencies among clinical variables, and incorporate medical knowledge. Third, we will pre- train the multi-modal Transformer model on curated large-scale EHR data by developing novel self-supervised pre- training methods, including a multi-modal masked data prediction method, a hierarchical SSL method, and an automated SSL approach. Fourth, we will fine-tune the pre-trained FM for sepsis early detection, by developing new fine-tuning methods based on meta learning, multi-level optimization, and neural architecture search. Fifth, we will develop interpretable FMs to improve the trustworthiness of detection outcomes. The proposed studies will be conducted on about 29 million patient records, which represent the largest efforts to date to study multi-modal FMs for sepsis. Our proposed research will democratize the early detection of sepsis by making pre-trained FMs accessible to a broader range of clinicians. Smaller medical institutions, which may not have extensive computational infrastructure, can leverage pre-trained FMs to jumpstart their development of in-house, specialized detection models. Besides, the developed technologies can go beyond sepsis and be applied for a broad range of clinical applications.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2026 ( Subtotal = $308,096 )
2026	2026	UNIVERSITY OF CALIFORNIA, SAN DIEGO	9500 GILMAN DR	LA JOLLA	CA	92093	SAN DIEGO	USA	Biomedical Research and Research Training	000	2	1/22/2026	NON-COMPETING CONTINUATION	$308,096
														Subtotal = $308,096

Issue Date FY: 2025 ( Subtotal = $312,974 )
2025	2025	UNIVERSITY OF CALIFORNIA, SAN DIEGO	9500 GILMAN DR	LA JOLLA	CA	92093	SAN DIEGO	USA	Biomedical Research and Research Training	000	1	1/15/2025	NEW	$312,974
														Subtotal = $312,974

Grand Total All Awards = $621,070

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Develop Multi-modal Foundation Models for Sepsis Early Detection

Award Number: R35GM157217

ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 02/01/2025

PERIOD OF PERFORMANCE END DATE: 01/31/2030

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer