Unexpected complexity in bacterial genomes - SUMMARY The textbook view of bacterial genomes shows a set of discrete genes, transcribed individually or as operons. Transcription initiates at promoters upstream of these genes/operons, producing mostly protein-coding mRNAs along with a smaller number of stable, functional RNAs (tRNAs, rRNAs, sRNAs). Transcription factors bind close to promoters and regulate transcription from those promoters. Transcription terminates downstream of genes, in 3’ UTRs. This view has been the basis for decades of work on gene expression and gene regulation, with enormous advances in our understanding of these processes. However, work from my group and others has shown that bacterial genomes are far more complex. We will leverage my expertise in genetics, genomics, and molecular biology, to continue productive lines of research on four overlapping topics that relate to the major research focus of my group: the unexpected complexity of bacterial genomes. My lab has been very productive on this topic, with 19 papers since 2016 directly relevant to the four themes described in this proposal. Topic #1. Pervasive transcription. We and others have shown that most bacterial promoters are not in intergenic regions, upstream of genes. Rather, they are located within genes, in sense or antisense orientations, and are involved in “pervasive transcription”, whereby short, non-coding RNAs are transcribed before being rapidly terminated by Rho and degraded. The majority of these RNAs are believed to be non-functional, and suppression of pervasive transcription is required to maintain cell fitness. Topic #2. Non-canonical transcription factor (TF) binding. We have mapped the direct and indirect regulatory targets of hundreds of TFs across a wide range of bacterial species. Most TF binding sites are located within genes, not intergenic regions. Moreover, most TF binding events are not associated with detectable regulation of a nearby gene. Our data also show that in vivo binding profiles are often not well explained by a DNA sequence motif, suggesting a role for other factors in determining the genomic sites of TF binding. Topic #3. Widespread gene regulation by attenuation. We have shown that transcription of many Escherichia coli genes is prematurely terminated by the conserved termination factor Rho, either in the 5’ UTR or ORF, a process commonly referred to as “attenuation”. Attenuation has been described previously, but our data indicate that it happens on a much larger scale than previously appreciated. We are interested in the mechanisms of attenuation involving Rho termination, with a particular focus on the role of upstream ORFs (uORFs) that function as cis-acting regulators, since we have identified large numbers of these ORFs in diverse bacterial species. Topic #4. Processive antitermination. RNA polymerase can be protected from the action of the Rho termination factor in a process known as “processive antitermination”. We will identify new regulatory targets of known antiterminator proteins, we will determine the mechanisms of antitermination, and we will discover new antiterminator proteins.