PROJECT SUMMARY (from original grant)
Increasingly persuasive evidence suggests genomic variants driving derived features in humans and among
primates are enriched in regulatory elements, but the vast majority of these evolutionarily relevant variants
have yet to be discovered or characterized. This is unfortunate, as among the approximately 35 million single
nucleotide substitutions (SNPs), 5 million insertions or deletions (indels), and 90 megabases of structural
variants where the human and chimpanzee genomes differ are countless variants associated with
development, function, or disease. Identifying evolutionarily relevant genetic variants, as well as those
implicated in disease or function, can be guided by the analysis of species differences in intermediate
molecular phenotypes (e.g., transcriptomic and epigenomic signatures), which are most likely the primary
effects determined by genomic variation. In this proposal, we propose to perform primate comparative
functional genomics to uncover genetic variants explaining lineage-specific phenotypes affecting the human
and non-human primate (NHP) brain, an organ exhibiting pronounced molecular and functional differences
between species. To do so, in our first aim we will develop a taxonomy of gene expression and open chromatin
across primates, applying single nucleus RNA-seq and single nucleus ATAC-seq to study the mid- fetal and
neonatal (late fetal and early infancy) development of the post-mortem human and NHP brain, as well as brain
organoid co-cultures containing cells differentiated from multiple primate stem cells and fibroblasts and
lymphoblastoid cell lines. In our second aim, we will complement this atlas of species differences in gene
expression and open chromatin by cataloguing SNPs, indels, and large, complex structural variants in multiple
primate species. This will allow us to differentiate between lineage-specific (i.e., human versus chimpanzee and
macaque) and Hominidae-specific (i.e., human and chimpanzee versus macaque) genomic variants. Finally, in
our third aim we will integrate and functionally validate, using the Massively Parallel Reporter Assay,
CRISPR/Cas9 genome editing, human induced pluripotent stem cells, and mouse models of neural
development, key regulatory elements and de novo genes identified through these experiments. Through these
aims, we will identify and functionally validate genomic variants and patterns of gene expression and open
chromatin potentially driving derived phenotypes in the human and non-human brain and consequently
plausibly associated with human cognition, social behaviour, and neuropsychiatric disease.