Wednesday, March 25, 2026 3/25/2026

Modeling Substance Abuse via a Behavioral Foundation Model Trained on Large-Scale Survey Data

Award Number: R03DA065200
ORGANIZATION: NATIONAL INSTITUTE ON DRUG ABUSE
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 04/01/2026
PERIOD OF PERFORMANCE END DATE: 03/31/2028

Group Awards By:

View Award Description

Modeling Substance Abuse via a Behavioral Foundation Model Trained on Large-Scale Survey Data - Project Summary/Abstract Substance use disorders (SUD) pose a major public health crisis that exacts heavy tolls on communities and healthcare systems, yet current survey data remain underutilized due to limitations in conventional analytic methods. This project proposes to develop a novel behavioral foundation model that transforms qualitative epidemiological survey responses into robust, quantitative latent representations of substance use behaviors. By harmonizing data from NESARC-III, NSDUH, and UK Biobank, we will “textualize” both structured and free- text responses into unified narratives that capture the nuanced details of individual experiences. Our approach leverages advanced natural language processing to convert diverse survey data into coherent, machine- interpretable inputs, and fine-tunes state-of-the-art, open-source large language models (LLMs) with integrated demographic tokens to enhance subgroup-specific predictions. We will rigorously validate the model’s performance against established machine learning techniques using metrics such as area under the ROC curve, calibration, and cross-dataset generalizability. Downstream applications include precise risk stratification for SUD outcomes, latent clustering to identify distinct risk and resilience profiles, and data-driven survey instrument optimization. Open-access dissemination of our tools will empower precision public health initiatives, enhance early identification of high-risk groups, and support targeted interventions to reduce the societal burden of substance use disorders.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2026 ( Subtotal = $336,250 )
2026	2026	YALE UNIV	150 MUNSON ST	NEW HAVEN	CT	06511	SOUTH CENTRAL CT	USA	Drug Use and Addiction Research Programs	000	1	3/20/2026	NEW	$336,250
														Subtotal = $336,250

Grand Total All Awards = $336,250

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Modeling Substance Abuse via a Behavioral Foundation Model Trained on Large-Scale Survey Data

Award Number: R03DA065200

ORGANIZATION: NATIONAL INSTITUTE ON DRUG ABUSE

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 04/01/2026

PERIOD OF PERFORMANCE END DATE: 03/31/2028

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer