Friday, December 12, 2025 12/12/2025

Causal graphical methods for high-dimensional heterogeneous biomedical data

Award Number: F31LM013966
ORGANIZATION: NATIONAL LIBRARY OF MEDICINE
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: FELLOWSHIP/SCHOLARSHIP/STUDENT LOANS
PERIOD OF PERFORMANCE START DATE: 03/21/2022
PERIOD OF PERFORMANCE END DATE: 03/20/2025

Group Awards By:

View Award Description

Causal graphical methods for high-dimensional heterogeneous biomedical data - In the past decade, there has been an explosion of data collected from biological and biomedical systems, both in terms of type and volume. Mining these high-dimensional, heterogeneous, and often dynamic datasets to make biologically or medically important inferences or develop predictive models requires new sophisticated data analytics methods. New machine learning methods have begun filling this gap, but most of these methods generate “black box” models that lack clear interpretability. Additionally, these methods are associative, and are thus incapable of teasing out the complex cause-effect relationships among features in the dataset. Directed causal graphical models (DCGMs) are a powerful tool for filling this gap. DCGMs, learned from observational datasets, can represent causal relationships between variables. This allows DCGMs to generate hypotheses of mechanisms and construct parsimonious, causally informed predictive models. However, biomedical datasets often have features that make it difficult to construct causal graphical models over the full dataset. Examples include: data type heterogeneity, high dimensionality, multicollinearity, cyclicity, and nonstationarity. To address these problems, I propose to develop methods for learning causal graphs in datasets containing (1) a heterogeneous mixture of continuous, categorical, and censored variables, (2) high dimensionality and multicollinearity, and (3) cyclicity and nonstationarity. In Aim 1, I will develop a new causal discovery algorithm that accommodates continuous, categorical and censored variables (e.g., survival). In Aim 2, I will test and compare various methods for matrix decomposition and dimensionality reduction in their ability to learn a meaningful low-dimensional latent feature space to be used in graph learning methods. In Aim 3, I will develop a new method for causal discovery in dynamic, possibly cyclic, gene regulatory networks at single cell resolution. In all cases, testing and validation will be performed on synthetic and real-life publicly available datasets. These methodological improvements constitute important steps forward in the field of causal discovery and they can be utilized together or independently to provide a flexible and powerful platform for analysis of a wide range of biomedical datasets. Once made available, they will enable researchers to make inferences about causal mechanisms, generate hypotheses, and build robust, parsimonious predictive models.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2024 ( Subtotal = $48,974 )
2024	2024	UNIVERSITY OF PITTSBURGH - OF THE COMMONWEALTH SYSTEM OF HIGHER EDUCATION	4200 FIFTH AVENUE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	000	3	2/16/2024	NON-COMPETING CONTINUATION	$47,694
2024	2024	UNIVERSITY OF PITTSBURGH - OF THE COMMONWEALTH SYSTEM OF HIGHER EDUCATION	4200 FIFTH AVENUE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	001	3	5/20/2024	NON-COMPETING CONTINUATION	$1,280
														Subtotal = $48,974

Issue Date FY: 2023 ( Subtotal = $47,694 )
2023	2023	UNIVERSITY OF PITTSBURGH - OF THE COMMONWEALTH SYSTEM OF HIGHER EDUCATION	4200 FIFTH AVENUE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	000	2	3/2/2023	NON-COMPETING CONTINUATION	$47,694
														Subtotal = $47,694

Issue Date FY: 2022 ( Subtotal = $46,752 )
2022	2022	UNIVERSITY OF PITTSBURGH, THE	4200 5TH AVE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	003	1	4/20/2022	NEW	$0
2022	2022	UNIVERSITY OF PITTSBURGH, THE	4200 5TH AVE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	004	1	6/17/2022	NEW	-$1
2022	2022	UNIVERSITY OF PITTSBURGH, THE	4200 5TH AVE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	002	1	4/20/2022	NEW	$0
2022	2022	UNIVERSITY OF PITTSBURGH, THE	4200 5TH AVE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	001	1	4/20/2022	NEW	$717
2022	2022	UNIVERSITY OF PITTSBURGH, THE	4200 5TH AVE	PITTSBURGH	PA	15260	ALLEGHENY	USA	Medical Library Assistance	000	1	2/11/2022	NEW	$46,036
														Subtotal = $46,752

Grand Total All Awards = $143,420

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Causal graphical methods for high-dimensional heterogeneous biomedical data

Award Number: F31LM013966

ORGANIZATION: NATIONAL LIBRARY OF MEDICINE

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: FELLOWSHIP/SCHOLARSHIP/STUDENT LOANS

PERIOD OF PERFORMANCE START DATE: 03/21/2022

PERIOD OF PERFORMANCE END DATE: 03/20/2025

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer