Saturday, November 15, 2025 11/15/2025

Predicting the effects of genetic variants on chromatin accessibility with a deep learning approach

Award Number: F31HG013262
ORGANIZATION: NATIONAL HUMAN GENOME RESEARCH INSTITUTE
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: FELLOWSHIP/SCHOLARSHIP/STUDENT LOANS
PERIOD OF PERFORMANCE START DATE: 04/16/2024
PERIOD OF PERFORMANCE END DATE: 04/15/2027

Group Awards By:

View Award Description

Predicting the effects of genetic variants on chromatin accessibility with a deep learning approach - PROJECT SUMMARY/ABSTRACT This project will assess a deep learning approach for predicting the effects of genetic variants on chromatin accessibility (CA). Currently, there is a knowledge gap in understanding the function and causality of genetic variants in human genetics research since over 90% of genetic variants have been found within the non-coding region of the genome. GWAS studies have provided us with information about these genetic variant associations. However, this research has yet to establish the molecular function of these genetic variants. Molecular quantitative trait locus (QTL) analysis has been used to determine variant function, and the identification of variants associated with molecular traits such as caQTLs. However, due to linkage disequilibrium, the identification of causal variants found from molecular QTL analysis is ambiguous; thus, they lack the power to identify associations with rare genetic variants. Furthermore, it has been shown that allelic-specific information, including allele-specific chromatin accessibility (ASCA), can increase the power to detect caQTLs, potentially improving machine learning model predictions. An alternative approach to determining variant function are machine learning methods, which have been utilized to determine the molecular function of genetic variants and have achieved success at predicting gene expression, CA, and transcription factor binding from DNA sequence. However, these machine learning models are solely trained on reference genome sequences and do not consider human genetic variation. The key focus of this research proposal is to investigate the hypothesis that a machine learning model that utilizes genetic variation and allele-specific information will accurately predict the effects of both common and rare genetic variants on chromatin accessibility. To investigate this hypothesis, here are two specific aims: Aim 1 will develop a variant-aware neural network to predict the effect of genetic variants on CA. Aim 2 will predict the function of rare genetic variants. In summary, this proposal strives to establish improved predictions of the molecular function of genetic variants found in the non-coding region of the genome by assessing the utility of genetic variation and ASCA with a deep learning approach. The proposed study will lead to the ability to predict the function of rare genetic variants, which are likely to be highly important to disease traits, but whose function cannot currently be uncovered utilizing QTL-based methods. This training plan will provide the applicant the opportunity to (1) develop expertise in machine learning in genomics, (2) gain skills in CRISPR-based genome editing, (3) improve scientific writing and communication skills, and (4) develop mentoring and teaching aptitude. These professional training goals will provide the applicant with the essential training and scientific experience required to obtain a postdoctoral fellowship and thereafter become an impactful independent research investigator at an R1 University.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2025 ( Subtotal = $44,965 )
2025	2025	SALK INSTITUTE FOR BIOLOGICAL STUDIES, SAN DIEGO, CALIFORNIA, THE	10010 N TORREY PINES RD	LA JOLLA	CA	92037	SAN DIEGO	USA	Human Genome Research	000	2	8/29/2025	NON-COMPETING CONTINUATION	$44,965
														Subtotal = $44,965

Issue Date FY: 2024 ( Subtotal = $44,401 )
2024	2024	SALK INSTITUTE FOR BIOLOGICAL STUDIES, SAN DIEGO, CALIFORNIA, THE	10010 N TORREY PINES RD	LA JOLLA	CA	92037	SAN DIEGO	USA	Human Genome Research	000	1	4/10/2024	NEW	$43,121
2024	2024	SALK INSTITUTE FOR BIOLOGICAL STUDIES, SAN DIEGO, CALIFORNIA, THE	10010 N TORREY PINES RD	LA JOLLA	CA	92037	SAN DIEGO	USA	Human Genome Research	001	1	4/10/2024	NEW	$0
2024	2024	SALK INSTITUTE FOR BIOLOGICAL STUDIES, SAN DIEGO, CALIFORNIA, THE	10010 N TORREY PINES RD	LA JOLLA	CA	92037	SAN DIEGO	USA	Human Genome Research	002	1	5/30/2024	NEW	$1,280
														Subtotal = $44,401

Grand Total All Awards = $89,366

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Predicting the effects of genetic variants on chromatin accessibility with a deep learning approach

Award Number: F31HG013262

ORGANIZATION: NATIONAL HUMAN GENOME RESEARCH INSTITUTE

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: FELLOWSHIP/SCHOLARSHIP/STUDENT LOANS

PERIOD OF PERFORMANCE START DATE: 04/16/2024

PERIOD OF PERFORMANCE END DATE: 04/15/2027

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer