Novel Chemical Probes for Sequencing Multiple DNA Modifications at Single-Nucleotide Resolution - Human DNA is susceptible to chemical and physical agents from endogenous and environmental
sources, producing various DNA modifications. Research has documented a plethora of DNA modifications,
including more than 50 endogenous nucleobase modifications and many covalent adducts derived from
environmental chemicals. Certain DNA modifications function in gene regulation, whereas other lesions have
mutagenic and pathogenic effects. Recent sequencing data have revealed that the distribution of DNA
modifications in the genome is not uniform. Mapping DNA modifications on a genome-wide scale is critical for
clarifying their roles in genetic regulation, development, and pathogenesis. Unfortunately, current methods for
sequencing DNA modifications suffer from one or more drawbacks in terms of sensitivity, specificity, resolution,
and throughput. This proposal addresses these limitations by developing a novel DNA sequencing method on
Illumina sequencers to map more than 10 DNA modifications simultaneously. The successful completion of this
proposal will facilitate the PI’s long-term goal of deciphering the functional importance of DNA modifications in
mutagenesis and gene regulation. The research exploits the chemistry of DNA repair and develops highly
specific chemical probes for sequencing multiple DNA modifications at single-nucleotide resolution. These novel
chemicals capture and enrich abasic (AP) sites, a central intermediate in DNA repair. In addition, two chemical
probes serve as unique locator codes during amplification, allowing sequencing readout. Simultaneous mapping
of different DNA lesions will be achieved through coupling lesion-specific DNA repair enzymes with multiplex
sequencing. The proposal is grounded on our compelling data demonstrating the feasibility of two synthetic
probes to label and enrich AP DNA with high specificity and sensitivity. The proposed sequencing platform will
be further developed and optimized via two aims. Aim 1 is to optimize the workflow for simultaneous sequencing
multiple alkylated DNA modifications. Aim 2 is to synthesize another novel compound for sequencing cytosine
modifications and mispairs. The expected outcome is that the proposed method will address a major unmet need
in sequencing multiple DNA modifications on Illumina sequencers. In the long run, the developed technology will
aid the generation of single-nucleotide resolution genomic maps for various DNA modifications in a high-
throughput and cost-effective manner. The proposed research is significant because, compared to other Illumina-
based methods, the technology will allow greater than one order of magnitude improvement over existing
methods in the number of modifications sequenced, complementing the recent progress with PacBio and
Nanopore technologies. The innovation of the project lies in the development of novel chemical probes to
facilitate enrichment, creative use of multiple repair enzymes to ensure mapping accuracy, and the two unique
locator probes to allow amplification and sequencing readout. Together, the innovative method will achieve
unprecedented specificity and sensitivity, which reduce sequencing depth and costs.