Deep Learning for 3-D reconstruction of heterogeneous molecular structures from Cryo-EM data
Lead Research Organisation:
Imperial College London
Department Name: Electrical and Electronic Engineering
Abstract
Determining the structure of biomolecules is the goal of structural
biology and is essential to understand biological mechanisms
responsible for life and in drug discovery. Biological macromolecules
can be thought of as complex machines that perform complex
operations in living cells. These dynamic machines pass through
various conformations in the course of their actions and a complete
understanding of their working requires the determination of multiple
conformations.
Single particle Electron Cryo-Microscopy (Cryo-EM) has emerged as a unique method to determine molecular structures at near-atomic resolution. Achieving high-resolution estimation of structures of dynamic protein complexes requires large numbers of images and computationally intensive algorithms. Such reconstruction problems have been often approached by devising methods that use information about the imaging procedure and the properties of the object that needs to be reconstructed and many remarkable breakthroughs have been achieved over the years. These computational approaches are called "model-based" methods and
have the advantage to be predictable and stable. However, they can be computationally expensive and do not always derive maximum value from complex data. In particular, they are often unable to resolve complex heterogeneous structures. In contrast "data-driven" methods like deep neural networks have demonstrated, in other contexts, a remarkable ability to improve the quality of biomedical images. The problem with many deep learning approaches is that they are not predictable in the sense that often even small deviations in the input
data can result in a huge deviation of the output, which can have devastating effects in bio-imaging applications. Moreover, it is often very difficult to interpret what a deep network machine is really optimizing.
This project will advance a new family of deep neural networks for the
3-D reconstruction of dynamic protein complexes from cryo-electron
data. By working closely with structural biologists, we will put forward
approaches that systematically embed prior knowledge and constraints
about the signal and the physics of the data formation process into the
deep neural network architectures. We will also collaborate with Prof.
M. Unser and his team from EPFL Switzerland. They will provide
expertise in the area of analysis of stability of deep neural networks
and will share their experience in developing AI-based methods for
molecule reconstruction from Cryo-EM data. We expect that this
approach and these collaborations will allow us to introduce stable and
interpretable neural networks able to resolve heterogeneous biological
structures at a resolution that current methods cannot. The methods
produced will be in open-source format, integrated in existing
computational suites like CCP-EM and made available to the broadest
possible community.
biology and is essential to understand biological mechanisms
responsible for life and in drug discovery. Biological macromolecules
can be thought of as complex machines that perform complex
operations in living cells. These dynamic machines pass through
various conformations in the course of their actions and a complete
understanding of their working requires the determination of multiple
conformations.
Single particle Electron Cryo-Microscopy (Cryo-EM) has emerged as a unique method to determine molecular structures at near-atomic resolution. Achieving high-resolution estimation of structures of dynamic protein complexes requires large numbers of images and computationally intensive algorithms. Such reconstruction problems have been often approached by devising methods that use information about the imaging procedure and the properties of the object that needs to be reconstructed and many remarkable breakthroughs have been achieved over the years. These computational approaches are called "model-based" methods and
have the advantage to be predictable and stable. However, they can be computationally expensive and do not always derive maximum value from complex data. In particular, they are often unable to resolve complex heterogeneous structures. In contrast "data-driven" methods like deep neural networks have demonstrated, in other contexts, a remarkable ability to improve the quality of biomedical images. The problem with many deep learning approaches is that they are not predictable in the sense that often even small deviations in the input
data can result in a huge deviation of the output, which can have devastating effects in bio-imaging applications. Moreover, it is often very difficult to interpret what a deep network machine is really optimizing.
This project will advance a new family of deep neural networks for the
3-D reconstruction of dynamic protein complexes from cryo-electron
data. By working closely with structural biologists, we will put forward
approaches that systematically embed prior knowledge and constraints
about the signal and the physics of the data formation process into the
deep neural network architectures. We will also collaborate with Prof.
M. Unser and his team from EPFL Switzerland. They will provide
expertise in the area of analysis of stability of deep neural networks
and will share their experience in developing AI-based methods for
molecule reconstruction from Cryo-EM data. We expect that this
approach and these collaborations will allow us to introduce stable and
interpretable neural networks able to resolve heterogeneous biological
structures at a resolution that current methods cannot. The methods
produced will be in open-source format, integrated in existing
computational suites like CCP-EM and made available to the broadest
possible community.
Publications
| Description | We have developed a neural network-based method to reconstruct the 3D structure of molecules from Cryo-EM data. The method can achieve higher resolution than existing state-of-the-art approaches and can operate with a small number of projections. |
| Exploitation Route | Structural biologist can use our method to analyse data that they produce with their own Cryo-EM devices. Experts in machine learning and signal processing can build on our method for further expansions in e.g., Cryo-ET data. |
| Sectors | Digital/Communication/Information Technologies (including Software) Education Pharmaceuticals and Medical Biotechnology |
| Description | Collaboration with EPFL and Professor Michael Unser |
| Organisation | Swiss Federal Institute of Technology in Lausanne (EPFL) |
| Country | Switzerland |
| Sector | Public |
| PI Contribution | The Team at Imperial is developing new Deep-learning methods for reconstruction of 3-D molecular structures from Cryo-EM data which was discussed during a visit in September 2024. |
| Collaborator Contribution | Prof. Michael Unser is a research leader in inverse problems and on stability of deep neural networks. He has also contributed significantly to the development of deep-learning methods for Cryo-EM. He shared his expertise in the area suggesting ways to improve our current prototype for 3-D reconstruction of molecules from Cryo-EM data. He is also director of the Center for Imaging at EPFL and so he organized a meeting with the executive director of the centre (Dr Laurene Donati) where they discussed how the center facilitate multi-disciplinary research in the imaging sector. The PI of this project (Prof. Dragotti) also have a working breakfast with the then President of EPFL, Prof. M. Vetterli. |
| Impact | Exchange of knowledge in the area ad the intersection of Deep-learning and Cryo-EM |
| Start Year | 2024 |
| Description | Inverse Problems in the Age of AI |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | Keynote talk given at the Ideal 2024 conference held in Valencia in November 2024. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Model-Based Deep Learning for Inverse Problems in Imaging |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | Seminar given at the Center for Imaging at EPFL (and also broadcast on YouTube). Seminar covered new Deep-learning methods for inverse problems in Imaging that have been recently developed by the PI of this grant together with his team. |
| Year(s) Of Engagement Activity | 2024 |
| Description | Model-Based Deep Learning for Inverse Problems in Imaging |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | This is a seminar given at Based University on the research conducted by the PI and his team in the area of Deep Learning for Cryo-EM |
| Year(s) Of Engagement Activity | 2024 |