PROJECT SUMMARY
CRISPR-associated transposons (CAST) represent a class of recently discovered bacterial genetic
elements that can perform programmable transposition. Using a user-provided RNA sequence, CASTs can
insert kilobases of DNA into specific sites in the genome, a crucial function that cannot be achieved with
current state of the art genome editing tools. Thus, CASTs match an unmet need in therapeutics for genetic
disorders plagued by gene mutations like cystic fibrosis. To optimize CASTs for application in next-generation
genome editing tools, a comprehensive understanding of how it inserts DNA is required. Of particular interest
is its unidirectionality, referring to CAST’s propensity to insert DNA in a single orientation. While
unidirectionality is a coveted feature of CASTs, its mechanism remains uncharacterized. Components implied
to contribute to this mechanism are the signature ends flanking the element. These ends recruit proteins, TnsB
in model system shCAST, that excise the transposon from its original location to prepare it for insertion
elsewhere. However, these ends are different in sequence, described in this proposal as “asymmetric”,
suggesting that TnsB performs different functions at each end that could be used to achieve unidirectionality.
This project proposes an interdisciplinary project that will identify key drivers for unidirectionality.
Specific aim 1 will use high-resolution cryo-electron microscopy (cryo-EM) to structurally visualize the shCAST
paired-end complex, a stage of transposition involving interactions between the asymmetric ends facilitated by
TnsB. This structure will identify key features that distinguish the ends from one another from a 3-D standpoint.
Follow-up characterization of the transpososome, the next stage of transposition involving the assembly of the
paired-end complex with other CAST proteins, will recognize how the identified features convey
unidirectionality. Specific aim 2 will utilize bioinformatics to identify features conserved across all CAST
systems that establish orientation specificity. Metagenomic mining will scan the vast trove of sequence data to
compile a comprehensive list of CAST elements. This expansive set enables evolutionary analysis of CASTs to
determine global trends maintaining unidirectionality and will define each element’s mechanism based on
retention of conserved features. Biochemical assays will be used to validate features identified in either aim.
The PI will be extensively trained in both cryo-EM and bioinformatics skills along with critical thinking to
ensure seamless integration of experimental and computational results. The Kellogg and Feschotte labs at
Cornell University have access to a wide-ranging set of resources that ensure mastery of these skills and will
also emphasize soft skills like communication to ensure the PI’s development as a well-rounded scientist.
Together, this proposal sets forth a plan to comprehensively understand CAST unidirectionality and its
associated mechanisms, enabling its optimization and implementation in genome editing tools of interest.