Project Summary
Transposable elements (TEs) are mobile genetic elements that can propagate within the host DNA and have
a significant impact on the genome organization and function. TEs are ubiquitous across the tree of life and
occupy substantial fractions of animal genomes - about 20% of the D. melanogaster, and 45-50% of human and
mouse genomes consist of TE fragments that have accumulated throughout evolution, and active TEs are a
major source of genetic and epigenetic variation. Despite their prevalence, TEs’ unusual characteristics and high
copy numbers left them poorly annotated, and among the most enigmatic and understudied genetic elements.
TEs are typically seen as harmful, as their mobility causes DNA damage and can impact the host genome
and transcriptome by directly disrupting functional elements or introducing ectopic binding sites for transcriptional
and epigenetic regulators. To prevent their deleterious activities, TEs are targeted by various silencing
mechanisms. Among these, the piRNA pathway enforces transcriptional and post-transcription repression of TEs
in animal germlines, which is crucial for fertility and preserving the integrity of genetic information across
generations. On the other hand, TEs are an important source of evolutionary innovation and there is a growing
body of literature on TE-derived regulatory regions and functional products that became incorporated into host
regulatory networks. Interestingly, early embryogenesis of both vertebrates and invertebrates is characterized
by a spike of TE activity, and at least in mice, timely activation of specific TEs is essential for normal
developmental progression. However, the diversity of products from TEs and their functional roles in somatic
cells in both systems remain poorly characterized, largely owing to long-standing technical difficulties in the
genomic analysis of elements that exist in high copy numbers.
The molecular mechanisms of TE regulation and the functional implications of TE activity are central areas
of interest for my laboratory. I present a research program that addresses key questions within two major aspects
of TE biology: 1) the mechanism and regulation of piRNA-mediated TE silencing in the germline and 2) the
characteristics and functions of somatic TE expression during development, using the classic model Drosophila
as a paradigm. First, I propose a focused strategy to dissect the regulation and function of several key piRNA
pathway components, building on my previous findings that protein SUMOylation plays essential roles in piRNA
biogenesis and TE silencing. In parallel, I plan to leverage state-of-the-art long-read and single-cell sequencing
technologies to overcome historical limitations to the genomic analysis of the TE-derived transcriptome, with the
long-term goal of elucidating the molecular basis and functional consequences of somatic TE activity in the
developing organism.