ABSTRACT
Diverticulitis is among the most common gastrointestinal conditions leading to emergency department visits and
hospitalizations. Uncomplicated diverticulitis is usually mild and self-limiting, but about 10% of patients will
develop complications including abscess, fistula, obstruction, bleeding, or perforation, and over 20-30% will
experience at least one recurrence or ongoing gastrointestinal complaints. Although up to 50% of patients with
complicated diverticulitis require emergent surgery, there remains considerable uncertainty about whether most
other diverticulitis patients should be medically managed or offered elective segmental colectomy. Recent
guidelines recommend a personalized approach, with consideration of segmental resection according to risk for
recurrence or future complications. However, little is known about factors predisposing patients to unfavorable
outcomes. Among patients with their first episode of diverticulitis, there is a significant unmet need to identify
those at higher risk of recurrence or future complications to guide decision-making about clinical management.
Thus, I propose to apply artificial intelligence (AI) techniques to clinical data including natural language
processing (NLP) for narrative notes (Aim 1) and deep learning for imaging (Aim 2) in a large-scale electronic
health record (EHR) database to predict recurrence and long-term complications of diverticulitis. Building upon
findings from the first two aims, in Aim 3, I will validate the prediction model for diverticulitis recurrence in an
external EHR dataset. My overarching goal is to develop a clinical decision support tool incorporating NLP of
clinical documentation, computed tomography (CT)-based deep learning, lifestyle, and clinical factors to predict
outcomes in diverticulitis. Findings from this project will enable the utilization of comprehensive information from
EHR for diverticulitis research, advance understanding of novel predictors of long-term outcomes of diverticulitis,
and contribute to the development of a clinical prediction tool to facilitate risk stratification and guide disease
management. Through this work, I will receive formal training in using NLP for unstructured clinical notes;
developing deep learning algorithms for imaging; and modeling for clinical risk prediction. I will be
mentored/advised by an interdisciplinary team that includes Dr. Andrew Chan, a gastroenterologist and
epidemiologist with expertise in diverticulitis; Dr. Katherine Andriole, an expert in radiology imaging informatics,
deep learning, and machine learning; Dr. Ashwin Ananthakrishnan, a gastroenterologist with expertise in utilizing
NLP in EHR; Dr. Lisa Strate, a gastroenterologist with expertise in diverticular disease; Dr. Avinash
Kambadakone-Ramesh, a radiologist with expertise in abdominal imaging and advanced CT applications; and
Dr. Alisa Manning, a biostatistician with expertise in clinical research using EHR data. The outstanding training
opportunities with key leaders in their respective fields will provide me with new skills in the application of AI
methods including NLP, machine learning, and deep learning to clinical data and risk prediction modeling,
positioning me for a successful, independent career in advancing precision medicine in digestive diseases.