Project Summary
Cryo-electron microscopy (cryo-EM) is now a widely established and indispensable method for determining the
high-resolution structures of biomedically important molecules. Given that thousands of images, often acquired
over the course of several days, are required to obtain such structures, automation software has played a critical
role in the large-scale adoption of this method by the scientific community. In just the past five years, cryo-EM
has revolutionized our understanding of entire biological systems, and in 2020 provided the first molecular
descriptions of SARS-CoV-2 interaction with neutralizing antibodies. The widespread adoption of cryo-EM
recently prompted the NIH to invest in three National Centers through the Transformative High Resolution Cryo-
Electron Microscopy Program, providing free, high-end electron microscope access to biologists across the
country. The exponential increase in the popularity of cryo-EM has led to an astonishing number of developments
in sample preparation methodologies and image processing algorithms, which have improved attainable
resolution of single particle reconstructions. However, comparatively little progress has been made in optimizing
the quality of the cryo-EM data being collected. The pioneering software packages Leginon and Appion
demonstrated the power of automated data acquisition and real-time processing (respectively), and there are
now numerous programs for automated data acquisition and real-time processing. Despite advances in
automation, optimally extracting the highest quality data from an EM sample still requires manual involvement of
an expert electron microscopist. User intervention and expertise is necessary to run the appropriate image
analyses, interpret the results, and make informed decisions on how the processed results relate to the ongoing
data collection. However, even experts must content with the fact that the “best grid regions” differ drastically
from sample to sample, and there are no established tools for automatically and quickly assessing the quality of
the specimen across the various microenvironments of an EM grid. Given the ever-increasing incorporation of
cryo-EM into labs’ research programs, it is imperative that data collection and processing be streamlined to
match the growing needs of the structural community. We propose to develop a second generation
Leginon/Appion software package, “Magellon”, to overcome existing bottlenecks and provide an avenue toward
fully automated data acquisition that bypasses need for user input during data collection. Importantly, this
software will support the computational infrastructure to enable real-time image processing results to inform on
and modify the ongoing data collection regime by learning where to acquire images in regions that will yield the
highest resolution structures. We will develop and incorporate new, fast image assessment routines, while also
providing an application programming interface to enable the incorporation of extensions and plugins from
developers in the community. Further, Magellon will enable straightforward, seamless import and export of data
from its database to accommodate remote data acquisition at any of the regional or national cryo-EM centers.