ABSTRACT
The ultimate goal of structural biology is to visualize biomolecules in action in their native environment and to
establish their structure-function relationship. Cellular cryo-electron and X-ray tomography have emerged as
powerful techniques for imaging complex biological samples such as intact cells, organelles, macromolecular
machines, and for quantifying the internal organization of biological objects in their native states in situ at
resolutions ranging from a few microns to tens of nanometers with X-rays to tens of angstroms with electrons.
However, compared to the mature techniques of X-ray crystallography and single-particle cryo-electron
microscopy, cellular tomography is yet to reach its potential due to a severe degradation in the resolution of
reconstructions because of the effects of mechanical misalignment and non-rigid sample deformation due to
radiation damage, missing-wedge artifacts, low signal-noise ratio in a crowded environment, and unresolved
conformational heterogeneity. To address these issues, we propose to leverage our new approach for automated
joint 3D alignment and regularized reconstruction that combines advances in iterative projection methods and
convex optimization to achieve better than state-of-the-art reconstruction resolution from severely misaligned
data. Infusing our framework with new advances in mathematical modeling and machine learning provides a
clear path to a host of new model-based and data-driven algorithms that could address current challenges and
bottlenecks in the analysis of cellular tomography data. In particular we propose to (1) develop techniques for
improved tilt-series alignment that account for rigid-body motion of the sample and recover the anisotropic
effects of radiation-induced warping by using optical flow alignment; (2) develop a decoder that leverages the
full frequency information contained in randomly oriented macromolecules in the cell volume to constrain the
effects of the missing-wedge; and (3) improve the resolution of subtomograms extracted from the refined volume
by developing a volume-encoder--deformation-decoder deep neural network to model conformational
heterogeneity. By developing new data-driven methods that constrain the missing-wedge information and treat
shape variability as a continuum of non-rigid deformations rather than discrete clusters, our algorithmic
framework will provide significant improvements in the resolution and quality of reconstructions over currently
existing methods for data analysis that neglect these effects. As the structural biology community is increasingly
focusing on cellular tomography, there is a growing need for easy to use, automated software amenable to both
experienced and novice users. (4) To this end, algorithms resulting from this proposal will be turned into GPU-
enabled open-source user-friendly software to accelerate the analysis of the growing pool of imaging data.
Ultimately, our algorithmic framework will be capable of yielding high-resolution structures from noisy,
incomplete and complex data, thereby enhancing the predictive power of cellular tomography to answer
important biological questions.