Project Summary/Abstract
Glaucoma is a multifactorial disease affecting millions worldwide, leading to substantial visual impairment and
blindness. The lack of reliable tools to predict disease risk and progression has resulted in preventable vision
loss and inefficient use of medical resources. In recent years, polygenic risk scores (PRS), which quantify the
cumulative risk of disease based on multiple genetic variants, offer a promising approach for personalized risk
prediction. However, the performance of these scores remains moderate at best limiting their clinical use. This
project aims to improve PRS performance using a multipronged approach combining innovative genomics and
computational strategies, thereby improving individualized glaucoma risk assessment and prognostication. We
will achieve this goal through the following the following 3 specific aims: 1) Improve PRS Performance through
Enhanced EHR-Based Disease Phenotyping: leveraging cutting-edge Natural Language Processing (NLP)
techniques and data from 68000+ patients across two clinical centers, we will train models to extract refined
glaucoma phenotypes from unstructured EHR data and demonstrate improved performance compared to
traditional ICD-based phenotyping. Results from genome-wide association study (GWAS) of our NLP
phenotypes from over 1 million participants across two biobanks will be used to develop improved PRS for
glaucoma. 2) Utilize ML-Derived Structural Phenotypes to Enhance PRS Performance: by conducting GWAS
on our previously developed machine learning-derived Optical Coherence Tomography (OCT) glaucoma
phenotypes from 3 biobanks including 53000+ subjects, we will pinpoint novel genetic variants and integrate
these variants with previously known variants to create PRS with improved performance. 3) Improve PRS
performance by Constructing Subtype and Pathway-Specific PRS Models: with genomic data from 312,944 All
of Us participants, we will develop specialized PRS models for open-angle glaucoma subtypes as well as
pathway-specific scores based on relevant biologic pathways. These models, informed by previous GWAS
results, will enhance our understanding of glaucoma's complex genetic risk, and better predict disease severity,
trajectory, and treatment response. The culmination of these innovative methodologies seeks to transform
glaucoma care, promoting precision in early detection and tailored management. This interdisciplinary
approach, merging computational expertise with robust genomic insights, holds promise to set new standards
in ophthalmologic precision medicine.