Thursday, March 5, 2026 3/5/2026

Rapid response for pandemics: single cell sequencing and deep learning to predict antibody sequences against an emerging antigen

Award Number: R01AI169543
ORGANIZATION: NATIONAL INSTITUTE OF ALLERGY & INFECTIOUS DISEASES
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 09/16/2021
PERIOD OF PERFORMANCE END DATE: 08/31/2025

Group Awards By:

View Award Description

Rapid response for pandemics: single cell sequencing and deep learning to predict antibody sequences against an emerging antigen - ABSTRACT One of the “holy grails” in immunology is to be able to directly predict tight-binding variable chain antibody sequences in silico against foreign or non-self `antigenic' proteins. Immunoglobulin chain rearrangement can potentially encode approximately 1016 different variants of antibody heavy and light chain sequences. However, only a small fraction of the sequence space is generally accessed for evolving antibodies against foreign proteins. The computational challenge is to go from a model of the structure of an antigen to predicting a set of antibody chain sequences that can bind tightly to the antigen. If solved, it might be possible to move in less than 24 hours from the first cryo-electron-microscopic structure of a novel viral protein to advance a set of potent antibody-like molecular candidates for testing. Towards solving this problem, this project aims to develop a deep learning architecture that will take as input thermodynamic, quantum mechanical (density functional), and local structure- based network topographical features of the antigens and their cognate antibodies, and will output their respective binding affinity constants. We will design a generative adversarial network (GAN), which we think is uniquely suited for regression-based ML approaches for the immune system, to discover associations between the epitope and the variable chain features. This approach requires a large data stream of antigen and cognate antibody sequences, which until recently was difficult to obtain. A recently described single B-cell receptor (BCR) specific tagging method coupled with single cell deep sequencing (“linking B cell receptor to antigen specificity through sequencing” or LIBRA- seq) can rapidly isolate and sequence the BCR variable chain coding regions that can bind with high selectivity to antigenic epitopes. Towards the specific project goals, in Task 1, LIBRA-seq will be used to rapidly identify and generate candidate immunoglobulin coding sequences in response to specific linear and nonlinear epitopes (against controls), chosen through computational/molecular modeling and prioritized with SARS-CoV-2 Spike protein epitopes (but not restricted to these), injected into a mouse model, to generate large training sets; in Task 2, these training sets, along with other data sets already available in public databases, will generate a series of structural features (described above), which will be used to train the GAN; in Task 3, the predicted epitope-antibody interactions will be validated by direct experiments with synthetic antibody and phage-display systems. Thus, the proposed strategy combines foundational principles in evolutionary biology, genomics, structural chemistry, and computer science to the solution of a general biological engineering problem. Results from this project are expected to lay the foundations for a rigorously tested and fully automated machine- learning system that could rapidly generate synthetic antibody candidates from the structure of a novel virus protein, which can enhance the rapid response ability against a future pandemic. The ability to develop targeted antibody therapy against non-infectious or chronic diseases, and on the production of antibody-based industrial enzymes, will also be dramatically enhanced if this project were to be successful. The team: The team-leads of this multi-institutional research project comprise a computer scientist, a protein crystallographer, an immunologist, and a molecular biologist. 1


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2026 ( Subtotal = $0 )
2026	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	000	1	2/11/2026	OTHER REVISION	$0
														Subtotal = $0

Issue Date FY: 2025 ( Subtotal = $0 )
2025	2023	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	003	1	4/7/2025	OTHER REVISION	$0
2025	2023	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	001	1	3/25/2025	TERMINATION	$0
2025	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	002	1	4/7/2025	OTHER REVISION	$0
2025	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	000	1	3/25/2025	TERMINATION	$0
														Subtotal = $0

Issue Date FY: 2024 ( Subtotal = $0 )
2024	2023	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	001	1	8/28/2024	SUPPLEMENT FOR EXPANSION	$0
2024	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	000	1	8/27/2024	NEW	$0
														Subtotal = $0

Issue Date FY: 2023 ( Subtotal = $1,219,945 )
2023	2023	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	002	1	7/12/2023	SUPPLEMENT FOR EXPANSION	$1,219,945
2023	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	001	1	7/11/2023	NEW	$0
2023	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	000	1	5/23/2023	NEW	$0
														Subtotal = $1,219,945

Issue Date FY: 2021 ( Subtotal = $1,851,627 )
2021	2021	KECK GRADUATE INSTITUTE OF APPLIED LIFE SCIENCES	535 WATSON DR	CLAREMONT	CA	91711	LOS ANGELES	USA	Trans-NIH Research Support	000	1	9/16/2021	NEW	$1,851,627
														Subtotal = $1,851,627

Grand Total All Awards = $3,071,572

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Rapid response for pandemics: single cell sequencing and deep learning to predict antibody sequences against an emerging antigen

Award Number: R01AI169543

ORGANIZATION: NATIONAL INSTITUTE OF ALLERGY & INFECTIOUS DISEASES

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 09/16/2021

PERIOD OF PERFORMANCE END DATE: 08/31/2025

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer