Friday, March 20, 2026 3/20/2026

Computable social factor phenotyping using EHR and HIE data

Award Number: R01HS028636
ORGANIZATION: AGENCY FOR HEALTH CARE RESEARCH AND QUALITY
OPDIV: AHRQ
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 09/30/2021
PERIOD OF PERFORMANCE END DATE: 08/31/2026

Group Awards By:

View Award Description

Computable social factor phenotyping using EHR and HIE data - Most health systems attempt to measure patients' social risk factors, but such data collection is typically fraught with operational and conceptual difficulties. Multi-domain screening questionnaires face reliability, validity, and workflow challenges. Area-level data are not valid proxies for individual characteristics. Diagnosis codes are underutilized. The day-to-day use of natural language processing (NLP) to extract social factors from text is beyond the capacity of most organizations. Thus, health care organizations need more implementable and valid approaches to measuring social factors. With implementable and valid approaches, health systems will more effectively address the negative cost, quality and health outcomes associated with patients' social risk factors. The objective of this proposal is to assess the validity of patient-level computable social factor phenotypes for use in predicting patients' risk of increased healthcare costs and utilization. Computable phenotypes are com- posites of characteristics defined through single data elements or a collection of data elements, observations or events. Because these phenotypes derive from existing healthcare operations and electronic data systems, they are well-positioned for widespread implementation. Our central hypothesis is that phenotypes computed from existing structured demographic, clinical, and business operations data will support equally or more valid infer- ences about patient social risks than other measurement approaches. Building upon strong preliminary data and direction from experts in the field, we will determine the validity and usefulness of six novel social factor pheno- types computed from already collected information within EHRs and health information exchanges (HIE) through the following aims: Aim 1, Assess the concurrent validity of patient-level computable social factor phenotypes, compares the concurrent validity of computed phenotypes, multi-domain questionnaires, and NLP against gold standard measures of social factors in two health systems. Aim 2, Assess the predictive validity of patient-level computable social factor phenotypes, will assess the validity of computable phenotypes, multi-domain question- naires, NLP, and combined approaches in predicting costs and utilization. Aim 3, Assess the reliability (bias) of patient-level computable social factor phenotypes across patient gender, race, ethnicity, and age, assesses the reproducibility of measurement approaches across underserved populations. We will employ a multi-method research approach to identify and mitigate potential bias. This project will lead to more valid and implementable approaches to patient social factor measurement. The proposed research is significant because it directly ad- dresses the challenges organizations face in addressing patients' social risks and will provide key inputs to support organizations efforts at achieving a learning health system. This proposal is innovative by advancing the psychometrics of social factors and identifying novel usages of EHR and HIE data. By working with multiple and diverse populations, we address the priority populations of socioeconomically disadvantaged, racial minority populations, and the elderly.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2024 ( Subtotal = $399,455 )
2024	2024	TRUSTEES OF INDIANA UNIVERSITY	980 INDIANA AVE RM 2232	INDIANAPOLIS	IN	46202	MARION	USA	Research on Healthcare Costs, Quality and Outcomes	000	4	8/13/2024	NON-COMPETING CONTINUATION	$399,455
														Subtotal = $399,455

Issue Date FY: 2023 ( Subtotal = $397,147 )
2023	2023	TRUSTEES OF INDIANA UNIVERSITY	980 INDIANA AVE RM 2232	INDIANAPOLIS	IN	46202	MARION	USA	Research on Healthcare Costs, Quality and Outcomes	000	3	9/1/2023	NON-COMPETING CONTINUATION	$397,147
														Subtotal = $397,147

Issue Date FY: 2022 ( Subtotal = $397,558 )
2022	2022	TRUSTEES OF INDIANA UNIVERSITY	980 INDIANA AVE RM 2232	INDIANAPOLIS	IN	46202	MARION	USA	Research on Healthcare Costs, Quality and Outcomes	000	2	8/24/2022	NON-COMPETING CONTINUATION	$397,558
														Subtotal = $397,558

Issue Date FY: 2021 ( Subtotal = $399,999 )
2021	2021	TRUSTEES OF INDIANA UNIVERSITY	980 INDIANA AVE RM 2232	INDIANAPOLIS	IN	46202	MARION	USA	Research on Healthcare Costs, Quality and Outcomes	000	1	9/13/2021	NEW	$399,999
														Subtotal = $399,999

Grand Total All Awards = $1,594,159

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Computable social factor phenotyping using EHR and HIE data

Award Number: R01HS028636

ORGANIZATION: AGENCY FOR HEALTH CARE RESEARCH AND QUALITY

OPDIV: AHRQ

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 09/30/2021

PERIOD OF PERFORMANCE END DATE: 08/31/2026

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer