Overture is a collection of modular, open-source software components designed to make the management of
big-data genomic projects both easy and more cost effective. The Overture system promotes FAIR data
sharing of cancer genomic datasets by overcoming the major obstacles in storing, managing, exploring, and
distributing massive genome-scale datasets.
In this proposed project, we will implement a turnkey Overture Data Management System (DMS) to provide a
complete solution to manage cancer genomic data throughout the data life cycle, and to continue lowering the
technical barriers faced by researchers in the cancer informatics community. To extend the capabilities and
potential of the Overture DMS, we will establish a new software framework for integration with third-party
visualization applications, and for data exchange with analysis platforms. This framework will empower the
community to continue developing extremely valuable visualization applications for specific research interests
and datatypes, and to contribute them as “apps” to be integrated with the Overture DMS.
Over the next five years, we will: (1) combine the core set of Overture components into a scalable DMS
installation bundle to promote both local and cloud deployment for a range of small to consortia-scale datasets;
(2) extend the DMS with third-party visualization and analysis tools through a novel software data integration
framework; (3) enhance data access controls by implementing Global Alliance for Genomics and Health
(GA4GH) Passport and Data Use Ontology standards; (3) enable federated searching within cancer datasets
across different Overture platforms; and (4) improve outreach and community resources to meet the needs of
bioinformaticians, data producers and managers, data scientists, and clinical researchers. The Overture DMS
will be a valuable resource for removing complications of large-scale data management from researchers,
allowing the advancement of scientific research by increasing data sharing and discovery.