Project Summary
Combinatorial screens characterize the molecular pathways of the cell by identifying interactions between
perturbations. Quantifying interactions requires a notion of non-interaction (i.e., a null interaction model), which
is well-established for combinatorial screens measuring cell survival and other simple phenotypes. However,
recent advances in single-cell omics and automated microscopy have enabled combinatorial screens with high-
dimensional phenotypic profiles. These phenotypic profiles consist of features with unknown, non-linear
behavior, such as gene expression or image features. As a result, there is no well-defined null interaction model
for combinatorial screens with high-dimensional phenotypes.
The goal of my proposal is to establish a null interaction model for high-dimensional phenotypes. My approach,
termed DENIM (Data-driven Estimation of Null Interaction Models), aims to use robust estimation models to learn
an additive representation for null perturbations and improve quantification of interactions across combinatorial
screen datasets. In addition, this framework has a key application in drug target identification by enabling
accurate prioritization of chemical-genetic interactions from microscopy-based phenotypic profiles.
My central hypothesis is that null interaction models which accurately model the diverse behavior of high-
dimensional features can substantially improve interaction detection. In Aim 1, I will develop and critically assess
DENIM using both simulated and real-world combinatorial datasets with gene expression or high-content imaging
readouts. I will evaluate DENIM against existing baselines on the ability to accurately interaction strength in
simulation, as well as recapitulate known interactions between gene pairs or drug pairs. Preliminary data from
simulation demonstrates that DENIM can accurately identify and learn non-linear perturbation trajectories, which
leads to improvement in performance relative to the standard baseline (i.e., vector addition). In Aim 2, I will
leverage the improved prioritization of interactions provided by DENIM to perform drug target identification. This
target identification platform requires large-scale genetic perturbations and the study of chemical-genetic
interactions. To this end, preliminary results show that large-scale genetic perturbations can be accurately
conducted using pooled genetic screening followed by in situ sequencing for guide demultiplexing. Moreover, I
analyzed a pilot screen on a well-established MEK1/2 inhibitor, where I accurately recovered target pathway
components, with similar performance using drug-relevant or drug-agnostic biomarkers. Together, completion of
this proposal will result in a rigorous framework for high-dimensional interactions, benchmarking of null
interaction models, and a novel platform for chemical-genetic target identification using microscopy-based
phenotypic profiles.