The UK High-End Computing Consortium for Biomolecular Simulation
Lead Research Organisation:
University of Southampton
Department Name: Sch of Chemistry
Abstract
There is tremendous future scope for biomolecular simulation to provide unprecedented insights into biomolecular systems. The level of detail afforded by these methods, along with their ability to rationalise experimental data and their predictive power are already enabling them to make significant contributions in a wide variety of areas that are crucial for healthcare, quality of life and the environment. The UK biomolecular simulation community has a strong international reputation, with world-leading efforts in in drug design and development, biocatalysis, bionano-technology, chemical biology and medicine. HECBioSim has already delivered outstanding research with impact in bionanotechology, drug design and AMR. But we have only just scratched the surface and there is currently huge room for expansion. Having access to the largest, most modern computing facilities is essential for this. Renewal of the Consortium will enable us to continue allocating time ARCHER for cutting-edge biomolecular simulations.
We will place a special emphasis on reaching out to experimentalists and scientists working in industry in order to foster interactions between computational and experimental scientists, and academia and industry to encourage integrated multidisciplinary studies of key problems.
Biomolecular simulation and modelling is an integral part of drug design and development. The pharmaceutical industry needs well-trained scientists in this area, as well as the development of new methods (e.g. for prediction of drug binding affinities, ligand selectivity and metabolism). Members of the consortium have a strong track record of collaboration with industry to deliver trained scientists and new methodologies. For example, PhD students trained by consortium members have recently taken up positions in UCB, Unilever, Oxford Nanoimaging and even Sky Broadcasting as software developer. Many of these academic-industry collaborations have been strengthened by work done through HECBioSim allocations.
The Consortium will continue to welcome new members from across the whole community. We will continue to develop computational tools and training for both experts and non-experts using biomolecular simulation on HEC resources. We propose to develop new tools that will enable inter-conversion between biomolecular systems at different levels of resolution thereby allowing users to tackle more ambitious 'grand challenges' than are currently feasible.
In summary HECBioSim will foster collaborations between computational and experimental scientists between scientists working in industry and academia in all disciplines within biomolecular simulation to maintain the UK as a world-leader in this field.
We will place a special emphasis on reaching out to experimentalists and scientists working in industry in order to foster interactions between computational and experimental scientists, and academia and industry to encourage integrated multidisciplinary studies of key problems.
Biomolecular simulation and modelling is an integral part of drug design and development. The pharmaceutical industry needs well-trained scientists in this area, as well as the development of new methods (e.g. for prediction of drug binding affinities, ligand selectivity and metabolism). Members of the consortium have a strong track record of collaboration with industry to deliver trained scientists and new methodologies. For example, PhD students trained by consortium members have recently taken up positions in UCB, Unilever, Oxford Nanoimaging and even Sky Broadcasting as software developer. Many of these academic-industry collaborations have been strengthened by work done through HECBioSim allocations.
The Consortium will continue to welcome new members from across the whole community. We will continue to develop computational tools and training for both experts and non-experts using biomolecular simulation on HEC resources. We propose to develop new tools that will enable inter-conversion between biomolecular systems at different levels of resolution thereby allowing users to tackle more ambitious 'grand challenges' than are currently feasible.
In summary HECBioSim will foster collaborations between computational and experimental scientists between scientists working in industry and academia in all disciplines within biomolecular simulation to maintain the UK as a world-leader in this field.
Planned Impact
Who might benefit from this research?
The direct beneficiaries of this research include academic and industrial scientists, including experimentalists working across a very wide range of disciplines including X-ray crystallography, NMR, electron microscopy, single molecule biophysics methods, mass spectrometry, hydrodynamics, enzyme kinetics, drug design medicinal chemistry, chemical and synthetic biology and design on biosensors. The two most obvious industrial beneficiaries are the pharmaceutical sector, which is of huge economic importance to the UK; and the biotechnology sector in which the UK is a world leader in e.g. nanopore DNA sequencing. Other industries (such as biocatalysis analysis and processing, IT hardware and software development) will also benefit from the UK High-End Computing Consortium for Biomolecular Simulation (HECBioSim) through our already established excellent links and new ones that will be fostered through the renewal of the Consortium. In the longer term, All of the work will impact on the general public through improvements in health and quality of life.
How will they benefit?
The focus of HECBioSim is to support access and use of high-end computing in the UK, both in terms of increased uptake and awareness by experimental colleagues, but also from experimental groups in both academia and industry. We also engage other interested parties and the general public.
Focused Training workshops and webinars:
We will undertake 3-4 workshops that will have a definitive focus on engaging more strongly with: i) industrial users, ii) experimental groups (mainly in academia but could also be from industry) and iii) emerging technology areas (including cryo-EM, and machine learning). We propose to deliver an annual webinar, showcasing HECBioSim outputs. Webinars offer the distinct advantage that they are virtual and can be accessed by anyone worldwide for zero cost to participants.
Making it easier for industry and academics to adopt.
In industry the use of electronic notebooks is widespread and the capture of metadata for experiments and simulations is absolutely critical. In order to make the use and adoption of HPC simulation data more viable, we plan to develop quality control metrics for simulation data and integrate capturing this information seamlessly into electronic notebooks. These electronic notebooks will be of benefit for industrial and academic scientists and will facilitate multidisciplinary projects.
Engagement to a more general audience.
We propose to put in a proposal for major science festivals including the Summer Science Exhibition at the Royal Society or the Cheltenham Science Festival to reach out to the wider community. Furthermore we will also generate a short promotional video that highlights the role of the HECBioSim and the impact that the work it supports has on UK science and the benefit to the general public. This will be hosted on our main website but we will disseminate via as many other routes as well (for example, youtube). We will highlight key results from the consortium via the website but also via social media, mainly via Twitter.
Maximizing code development potential
We plan to continue to develop our existing codes, which are already freely available Longbow and FESetup in addition to a new project which will focus around providing an easy-to-use interface for running coarse-grain and atomistic simulations and quantum mechanics calculations and interconverting between them, on HEC resources. This will be provided free of charge to the academic community along with training material.
Commitment to Diversity
We are committed to a philosophy of diversity within our science. To ensure this, we have made contact with Women in high Performance Computing ), in order to work closely with them to ensure a fair gender representation across all of our activities.
The direct beneficiaries of this research include academic and industrial scientists, including experimentalists working across a very wide range of disciplines including X-ray crystallography, NMR, electron microscopy, single molecule biophysics methods, mass spectrometry, hydrodynamics, enzyme kinetics, drug design medicinal chemistry, chemical and synthetic biology and design on biosensors. The two most obvious industrial beneficiaries are the pharmaceutical sector, which is of huge economic importance to the UK; and the biotechnology sector in which the UK is a world leader in e.g. nanopore DNA sequencing. Other industries (such as biocatalysis analysis and processing, IT hardware and software development) will also benefit from the UK High-End Computing Consortium for Biomolecular Simulation (HECBioSim) through our already established excellent links and new ones that will be fostered through the renewal of the Consortium. In the longer term, All of the work will impact on the general public through improvements in health and quality of life.
How will they benefit?
The focus of HECBioSim is to support access and use of high-end computing in the UK, both in terms of increased uptake and awareness by experimental colleagues, but also from experimental groups in both academia and industry. We also engage other interested parties and the general public.
Focused Training workshops and webinars:
We will undertake 3-4 workshops that will have a definitive focus on engaging more strongly with: i) industrial users, ii) experimental groups (mainly in academia but could also be from industry) and iii) emerging technology areas (including cryo-EM, and machine learning). We propose to deliver an annual webinar, showcasing HECBioSim outputs. Webinars offer the distinct advantage that they are virtual and can be accessed by anyone worldwide for zero cost to participants.
Making it easier for industry and academics to adopt.
In industry the use of electronic notebooks is widespread and the capture of metadata for experiments and simulations is absolutely critical. In order to make the use and adoption of HPC simulation data more viable, we plan to develop quality control metrics for simulation data and integrate capturing this information seamlessly into electronic notebooks. These electronic notebooks will be of benefit for industrial and academic scientists and will facilitate multidisciplinary projects.
Engagement to a more general audience.
We propose to put in a proposal for major science festivals including the Summer Science Exhibition at the Royal Society or the Cheltenham Science Festival to reach out to the wider community. Furthermore we will also generate a short promotional video that highlights the role of the HECBioSim and the impact that the work it supports has on UK science and the benefit to the general public. This will be hosted on our main website but we will disseminate via as many other routes as well (for example, youtube). We will highlight key results from the consortium via the website but also via social media, mainly via Twitter.
Maximizing code development potential
We plan to continue to develop our existing codes, which are already freely available Longbow and FESetup in addition to a new project which will focus around providing an easy-to-use interface for running coarse-grain and atomistic simulations and quantum mechanics calculations and interconverting between them, on HEC resources. This will be provided free of charge to the academic community along with training material.
Commitment to Diversity
We are committed to a philosophy of diversity within our science. To ensure this, we have made contact with Women in high Performance Computing ), in order to work closely with them to ensure a fair gender representation across all of our activities.
Organisations
- University of Southampton (Lead Research Organisation)
- Columbia University (Collaboration)
- University of Sheffield (Collaboration)
- European Cooperation in Science and Technology (COST) (Collaboration)
- Case Western Reserve University (Collaboration)
- University of Virginia (UVa) (Collaboration)
- Henry Royce Institute (Collaboration)
- UCB SA (Collaboration)
- Trinity College Dublin (Collaboration)
- Complutense University of Madrid (Collaboration)
- Argonne National Laboratory (Collaboration)
- University of Oxford (Collaboration)
- JOHN INNES CENTRE (Collaboration)
- Medical Research Council (MRC) (Collaboration)
- AstraZeneca (Project Partner)
- Vernalis plc (Project Partner)
- UCB Pharma (Project Partner)
Publications
Abraham M
(2019)
Sharing Data from Molecular Simulations
Abraham M
(2019)
Sharing Data from Molecular Simulations.
in Journal of chemical information and modeling
Abramiuk M
(2024)
Biocatalytic pathways, cascades, cells and systems: general discussion.
in Faraday discussions
Acevedo-Rocha C
(2024)
Artificial, biomimetic and hybrid enzymes: general discussion.
in Faraday discussions
Acevedo-Rocha C
(2024)
Enzyme evolution, engineering and design: mechanism and dynamics: general discussion.
in Faraday discussions
Akter F
(2023)
Binding pocket dynamics along the recovery stroke of human ß-cardiac myosin
in PLOS Computational Biology
Al-Otaibi NS
(2020)
A new spin on flagellar rotation.
in Nature microbiology
Ali HS
(2019)
Entropy of Simulated Liquids Using Multiscale Cell Correlation.
in Entropy (Basel, Switzerland)
Related Projects
| Project Reference | Relationship | Related To | Start | End | Award Value |
|---|---|---|---|---|---|
| EP/R029407/1 | 01/02/2019 | 30/07/2021 | £321,432 | ||
| EP/R029407/2 | Transfer | EP/R029407/1 | 31/07/2021 | 29/04/2023 | £132,343 |
| Description | Collaboration of Khalid with Rommie Amaro (UCSD, USA) and Michael Feig (Michigan, USA) New collaboration initiated in April 2020 to model the M protein of the SARS-Cov2 virus. The structure of this protein (the most abundant protein in the viral membrane) is not known yet it is essential for the structural integrity of the viral particle. Khalid, Amaro and Feig are using bioinformatics, molecular modelling and multiscale simulations to predict the structure of the protein and its arrangement within the viral membrane including around the spike protein. The coarse-grained simulations are being run on Archer while the atomistic simulations are being run on Frontera at the Texas Advanced Supercomputer Center. A manuscript is in preparation. The collaboration of Khalid with Amaro's team and other US teams resulted in the award of the Gordon-Bell Special Prize for Covid19 related work. This made mainstream news in the USA. A preprint is available here: doi: https://doi.org/10.1101/2020.11.19.390187 Collaboration of Khalid with Wonpil Im (Lehigh, USA) Khalid has been collaborating with Wonpil Im (Lehigh, USA) to use the hydrogen mass repartitioning technique to enable faster simulations of bacterial membranes. The preprint was uploaded during the reporting period but the paper came out just after so will be included in the next annual report. This work represents an important advancement as it enables users on ARCHER to use longer time steps with the CHARMM force-field thereby speeding up simulations. We expect this study to be highly cited given its methodological utility and that the time-step can be set-up via the free online server CHARMMGUI Collaboration of Khalid with Gizela Storz (NIH, USA) This is a collaboration that is also involves Ben Luisi (Cambridge) and brings together multiple experimental methods with computer simulations to understand protein-protein and protein-lipid interactions in a bacterial membrane environment. • Du D, Neuberger A, Orr MW, Newman CE, Hsu PC, Samsudin F, Szewczak-Harris A, Ramos LM, Debela M, Khalid S, Storz G, Luisi BF (2020), Interactions of a Bacterial RND Transporter with a Transmembrane Small Protein in a Lipid Environment. Structure Collaboration of Sansom with Sudha Chakrapani, (Case Western U, Cleveland, USA), ARCHER allocations through HECBioSim were used for a number of joint studies of ion channel structure function relationships with both collaborators leading to the following publications: • Kumar, A., Basak, S., Rao, S., Gicheru, Y., Mayer, M.L., Sansom, M.S.P. & Chakrapani, S. (2020) Mechanisms of activation and desensitization of full-length glycine receptor in membranes. Nature Communicatons. Collaboration of Sansom, Kalli and Stansfeld with Eiji Yamamoto (Keio University, Japan) and Robert Best (NIH, USA). Further details are in one of our case studies. • Yamamoto E, Domanski J, Naughton FB, Best RB, Kalli AC, Stansfeld PJ, Sansom MSP. (2020) Multiple lipid binding sites determine the affinity of PH domains for phosphoinositide-containing membranes, Science Advances Collaboration of Sansom with David Baker (University of Washington, USA) This is a newly established collaboration in which the combined expertise of the UK and USA groups are being used for de novo design of membrane protein pores. A PhD student (Shanlin Rao) from the Sansom group is being trained as part of the project, which from the UK side relies on HPC provided via HECBioSim. Stansfeld: multiple international collaborations: Jochen Zimmer (Virginia, US) Linda Columbus (Virginia, US) Inga Hanelt (Frankfurt, Germany) • Structural basis of the proton-coupled potassium transport in the KUP family Igor Tascón, Joana Sousa, Robin Corey, Deryck Mills, David Griwatz, Nadine Aumüller, Vedrana Mikusevic, Phillip J. Stansfeld, Janet Vonck, Inga Hanelt, Nature Communications Cristina Paulino (Groningen, Netherlands) John Parkinson (Utah, US) Irina Gutsche (Grenoble, France) • Complete structure of the core signalling unit of the E. coli chemosensory array in an optimised minicell strain. Alister Burt, CK Cassidy, Peter Ames, Maria Bacia-Verloop, Megghane Baulard, Karine Huard, Zaida Luthey-Schulten, Ambroise Desfosses, Phillip J. Stansfeld, William Margolin, John Parkinson, Irina Gutsche. Nature Communications, 2020 Filippo Mancia (Columbia, US) Jonathan Dworkin (Columbia, US) Ed Yu (Case Western, US) Multiple international authors from UK, Slovenia, USA • Structure and mechanism of bactericidal mammalian perforin-2, an ancient agent of innate immunity, Tao Ni, Fang Jiao, Xiulian Yu, SaÅ¡a Aden, Lucy Ginger, Sophie Williams, Fangfang Bai, Vojtech Pražák, Dimple Karia, Phillip Stansfeld, Peijun Zhang, George Munson, Gregor Anderluh, Simon Scheuring, Robert J. C. Gilbert, Science Advances, 2020 Gervasio with Karl-Heinz Krause (Geneva): Collaboration with Prof. Karl-Heinz Krause, faculty of Medicine University of Geneva. Understanding the effect of Mutations in SARS-CoV2 Spike protein. Preprint available at https://www.medrxiv.org/content/10.1101/2020.08.23.20180281v1, submitted for publication). Mulholland with Amaro (UCSD) • Biomolecular simulations for covid applications, highlighting HECBioSim activity, are reviewed in: 'Biomolecular Simulations in the Time of COVID-19, and After' R.E. Amaro and A.J. Mulholland Computing in Science & Engineering Nov.-Dec. 2020, pp. 30-36, vol. 22 DOI: 10.1109/MCSE.2020.3024155 • COVID19 - Computational Chemists Meet the Moment, Adrian J. Mulholland and Rommie E. Amaro J. Chem. Inf. Model. 2020. • Simulations support the interaction of the SARS-CoV-2 spike protein with nicotinic acetylcholine receptors A. Sofia F. Oliveira, Amaurys Avila Ibarra, Isabel Bermudez, Lorenzo Casalino, ZiedGaieb, Deborah K. Shoemark, Timothy Gallagher, Richard B. Sessions, Rommie E. Amaro, Adrian J. Mulholland https://www.biorxiv.org/content/10.1101/2020.07.16.206680v3 Mulholland with multiple international authors: • Free fatty acid binding pocket in the locked structure of SARS-CoV-2 spike protein Science 370 725-730 (2020) https://science.sciencemag.org/content/370/6517/725 Additional time has been provided where needed leading to some large, and important collaborations, for example Vendruscolo is working with the Rosalind Franklin Institute to study the dynamics of antibodies that bind the viral spike protein (a follow on of the project described here: https://www.rfi.ac.uk/engineered-llama-antibodies-neutralise-covid-19-virus/). Both Vendruscolo and Mulholland are currently using time on Bede and ARCHER2 respectively allocated via evaluation by HECBioSim to study the newly emerged mutations in the spike protein (UK and South Africa strains). New industrial and overseas collaborations have been established in which simulations performed on ARCHER are facilitating the development of anti-virals that target the SARS-CoV2 virus. These include: • Gervasio (UCL): collaboration with Mone Zaidi, Mount Sinai School of Medicine and DeepMind - London (Designing therapeutic peptides for COVID). • Biggin (Oxford): collaboration with IBM on designing antivirals targeting SARS-CoV2 enzymes. • Mulholland (Bristol) collaboration with: Geneva Biotech Sàrl and Imophoron Ltd to characterise druggable sites on the SARS-CoV2 Spike protein. |
| Exploitation Route | Our work has resulted in new insights into how and where molecules may bind to SARS-Cov2 proteins which can be used by others to: Develop drugs Look allosteric effects of mutations We have uncovered many details of protein-lipid interactions (see publications) which can be used by those working on imaging membrane proteins to help interpret their imaging data. We have provided design principles of nanopores which are advantageous for applications in bionanotechnology which can be used by others to translate into new pores or to help understand microscopic behaviour of existing pores. |
| Sectors | Chemicals Healthcare Pharmaceuticals and Medical Biotechnology |
| URL | https://doi.org/10.1016/j.jmb.2020.01.008 |
| Description | Sansom (Oxford): UCB have supported a number of BBSRC iCASE studentships in the general area of membrane protein/lipid interactions, all of which have benefitted from ARCHER access. These molecular interactions play a key role in modulating the function of membrane proteins implicated in many diseases, and therefore are of importance to pharmaceutical companies such as UCB. Sansom (Oxford) and Stansfeld (Warwick): IBM support an EPSRC iCASE studentship to examine peptide /lipid interactions which also benefits from ARCHER access. This has resulted in a preprint within the funding period: (which is now published, but just after this reporting period): Biggin (Oxford) We have been extending our work on absolute binding free energies to fragment design with Boehringer Ingelheim, via a fully-funded PDRA. New code has been developed as part of this collaboration, which will be made available soon. An iCASE studentship on membrane proteins is funded by Vertex. Both projects have been built on the successful usage of ARCHER via HECBioSim. Khalid (Oxford) A studentship funded through Oxford Nanopore Technologies Ltd (ONT) relies upon access to ARCHER via HECBioSim. We are simulating the protein used in the commercial devices made by the world leading nanopore DNA sequencing company. Much of the work cannot be published due to the highly sensitive nature of the details of the engineered protein being used. However a paper on a model system has been published as well as second on the wildtype protein complex CsgG/F with the Steve Matthews and Sarah Rouse groups at Imperial College (also HECBioSim members). Discussion is ongoing with ONT for future funding. Essex (Southampton) A PhD student has been exploring the conformations of the recognition loops of antibodies to understand how they achieve such high affinity and specificity. This work is supported by UCB Another PhD student is using ARCHER2 to simulate the protein complex apoferritin in very large volumes of water, to provide model data to explore the accuracy of cryo-EM image reconstruction algorithms. This is in collaboration with the Rosalind Franklin Institute. Publications from both of these projects are in the pipeline. Michel (Edinburgh) PhD student funded by Cresset. The Michel group and Cresset are using HPC provided through HECBioSim to develop machine-learned (ML) models of the difficulty of protein ligand binding Free Energy Perturbation (FEP) calculations. The ML models are used to power new automated FEP workflows for high-throughput studies that support industrial drug design R&D efforts. Fraternali (King's College London). A novel antimicrobial nanocapsule construct able of self assembly. The results are presented in the manuscript under revision "Nanocapsule Designs for Antimicrobial Resistance". The project carried on previous work performed in collaboration with the National Physics Laboratory (UK), and prompted a collaboration with Unilever (UK) for the investigation of analogous antimicrobial peptides interactions and self-assembly. Unilever provided funding for a PDRA, access to ARCHER via HECBioSim played a role in securing the funding and setting up the collaboration. Such molecules are of interest for the company, aiming at incorporating antimicrobial elements into its products. Two manuscripts are resulting from this collaboration, one is in second revision and the other is in preparation. Stansfeld (Warwick) PhD studentship funded by OMASS has yielded the paper: Ligand induced conformational dynamics of the LPS translocon LptDE. Francesco Fiorentino, Joshua B. Sauer, Xing Yu Qiu, Robin A. Corey, C. Keith Cassidy, Benjamin Mynors-Wallis, Shahid Mehmood, Jani Reddy Bolla, Phillip J. Stansfeld, Carol V. Robinson (2020) Nature Chemical Biology. The simulations reported here were performed on ARCHER with access provided by HECBioSim Gervasio (UCL) Collaboration with Ben Cossins (UCB, Slough) on developing accurate algorithms for ligand binding free energy calculations Collaboration with Astra Zeneca (who are co-sponsoring an industrial student with EPSRC): understating the effects of cancer-causing mutations on the structure and dynamics of EGFR. |
| Sector | Chemicals,Healthcare,Pharmaceuticals and Medical Biotechnology |
| Impact Types | Societal |
| Description | Invited contribution to DiRAC workshop on future UKRI Digital Research Infrastructure. |
| Geographic Reach | National |
| Policy Influence Type | Participation in a guidance/advisory committee |
| Description | Presentation to HPC-AI Advisory Council |
| Geographic Reach | Multiple continents/international |
| Policy Influence Type | Participation in a guidance/advisory committee |
| Description | Research data and policy |
| Geographic Reach | National |
| Policy Influence Type | Contribution to new or improved professional practice |
| URL | https://www.chemistryworld.com/news/ukri-finds-itself-in-hot-water-too-over-researchfish-cyberbullyi... |
| Description | UKRI research data capture approaches |
| Geographic Reach | National |
| Policy Influence Type | Contribution to new or improved professional practice |
| URL | https://www.researchprofessionalnews.com/rr-news-uk-research-councils-2023-1-researchfish-tweets-aga... |
| Description | participation at digital research infrastructure meeting |
| Geographic Reach | National |
| Policy Influence Type | Participation in a guidance/advisory committee |
| Description | AMR Global Development Award 2017 |
| Amount | £88,000 (GBP) |
| Funding ID | MR/R014922/1 |
| Organisation | Medical Research Council (MRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2017 |
| End | 03/2018 |
| Description | Action! Modelling DNA nano-machines for deciphering their molecular mechanisms |
| Amount | £62,000 (GBP) |
| Funding ID | EP/N509802/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2019 |
| End | 09/2022 |
| Description | BBSRC Tools and Techniques: Computational tools for enzyme engineering: bridging the gap between enzymologists and expert simulation |
| Amount | £146,027 (GBP) |
| Funding ID | BB/L018756/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 06/2014 |
| End | 01/2016 |
| Description | BBSRC sLoLa: Innovative Routes to Monoterpene Hydrocarbons and Their High Value Derivatives |
| Amount | £3,038,984 (GBP) |
| Funding ID | BB/M000354/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2010 |
| End | 09/2019 |
| Description | BEORHN: Bacterial Enzymatic Oxidation of Reactive Hydroxylamine in Nitrification via Combined Structural Biology and Molecular Simulation |
| Amount | £181,398 (GBP) |
| Funding ID | BB/V016768/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 01/2022 |
| End | 12/2024 |
| Description | Biocatalysis and Biotransformation: A 5th Theme for the National Catalysis Hub |
| Amount | £3,053,639 (GBP) |
| Funding ID | EP/M013219/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 01/2015 |
| End | 12/2019 |
| Description | BristolBridge has contributed to the University of Bristol being the UK's largest recipient of RCUK AMR cross-council funding awards both in number (7) and value (£5.268M). 6 related AMR grants awarded with BristolBridge PIs/Co-Is include MRC-led AMR. |
| Amount | £5,268,000 (GBP) |
| Organisation | Medical Research Council (MRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2016 |
| End | 09/2019 |
| Description | CCP-BioSim: Biomolecular Simulation at the Life Sciences Interface |
| Amount | £235,706 (GBP) |
| Funding ID | EP/M022609/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 06/2015 |
| End | 04/2021 |
| Description | CCP-BioSim: Biomolecular simulation at the life sciences interface |
| Amount | £287,541 (GBP) |
| Funding ID | EP/J010588/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2011 |
| End | 09/2015 |
| Description | CCPBioSim: Biomolecular Simulation at the Life Science Interface |
| Amount | £345,687 (FKP) |
| Funding ID | EP/T026308/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 11/2020 |
| End | 10/2025 |
| Description | Carbapenem Antibiotic Resistance in Enterobacteriaceae: Understanding Interactions of KPC Carbapenemases with Substrates and Inhibitors |
| Amount | £668,396 (GBP) |
| Funding ID | MR/T016035/1 |
| Organisation | Medical Research Council (MRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 01/2020 |
| End | 01/2023 |
| Description | Confidence in Concept 'Developing a mobile device for rapid antimicrobial resistance detection in primary care' |
| Amount | £74,685 (GBP) |
| Organisation | Medical Research Council (MRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 05/2017 |
| End | 05/2018 |
| Description | Development of physics-based computational models for predicting the spatial architecture of bacterial genomes |
| Amount | £61,000 (GBP) |
| Funding ID | EP/R513386/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2017 |
| End | 09/2020 |
| Description | EPSRC |
| Amount | £188,950 (GBP) |
| Funding ID | E/EP/G007705/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2013 |
| End | 03/2014 |
| Description | IAA: Nucleoid-associated proteins in biofilms |
| Amount | £17,000 (GBP) |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 01/2022 |
| End | 07/2022 |
| Description | Inquire: Software for real-time analysis of binding |
| Amount | £105,748 (GBP) |
| Funding ID | BB/K016601/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 08/2013 |
| End | 09/2014 |
| Description | Nicotinic Ligand Development to Target Smoking Cessation and Gain a Molecular Level Understanding of Partial Agonism |
| Amount | £724,553 (GBP) |
| Funding ID | EP/N024117/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 05/2016 |
| End | 11/2019 |
| Description | Oracle for Research Cloud Fellowship |
| Amount | $100,000 (USD) |
| Organisation | Oracle Corporation |
| Sector | Private |
| Country | United States |
| Start | 02/2023 |
| End | 12/2023 |
| Description | PREDACTED Predictive computational models for Enzyme Dynamics, Antimicrobial resistance, Catalysis and Thermoadaptation for Evolution and Desig |
| Amount | € 2,482,332 (EUR) |
| Funding ID | 101021207 |
| Organisation | European Research Council (ERC) |
| Sector | Public |
| Country | Belgium |
| Start | 09/2021 |
| End | 09/2026 |
| Description | Synthetic Biology Research Centre. BrisSynBio: Bristol Centre for Synthetic Biology |
| Amount | £13,528,180 (GBP) |
| Funding ID | BB/L01386X/1 |
| Organisation | Biotechnology and Biological Sciences Research Council (BBSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 06/2014 |
| End | 07/2019 |
| Description | UKRI CoA Extension |
| Amount | £24,675 (GBP) |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 09/2020 |
| End | 01/2021 |
| Description | Understanding bacterial DNA gyrase for the development of novel antibiotics |
| Amount | £65,000 (GBP) |
| Funding ID | EP/W524657/1 |
| Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
| Sector | Public |
| Country | United Kingdom |
| Start | 08/2022 |
| End | 09/2022 |
| Description | https://gtr.ukri.org/person/2A2990B1-E1E1-4888-8848-7C256C3A3B43 |
| Amount | £20,009,000 (GBP) |
| Funding ID | https://gtr.ukri.org/person/2A2990B1-E1E1-4888-8848-7C256C3A3B43 |
| Organisation | United Kingdom Research and Innovation |
| Sector | Public |
| Country | United Kingdom |
| Start | 01/2006 |
| End | 02/2033 |
| Title | An expandable, modular de novo protein platform for precision redox engineering |
| Description | Pioneering study signals new era of environment-friendly programmable bioelectronics Image shows the design of a protein nanowire, with the green arrow indicating electron flow. Ross Anderson Image shows structural analysis of the protein-based wire, comparing the model of the designed protein (shown in red) with the experimentally determined structure (in grey). Ross Anderson Press release issued: 25 July 2023 Researchers have created a unique microscopic toolkit of 'green' tuneable electrical components, paving the way for a new generation of bioelectronic devices and sensors. The University of Bristol-led study, published today in The Proceedings of the National Academy of Sciences (PNAS), demonstrates how to make conductive, biodegradable wires from designed proteins. These could be compatible with conventional electronic components made from copper or iron, as well as the biological machinery responsible for generating energy in all living organisms. The miniscule wires are the size of transistors on silicon chips or one thousandth of the breadth of the finest human hair. They are made completely of natural amino acids and heme molecules, found in proteins such as hemoglobin, which transports oxygen in red blood cells. Harmless bacteria were used for their manufacture, eliminating the need for potentially complex and environmentally damaging procedures commonly used in the production of synthetic molecules. Lead author Ross Anderson, Professor of Biological Chemistry at the University of Bristol, said: "While our designs take inspiration from the protein-based electronic circuits necessary for all life on Earth, they are free from much of the complexity and instability that can prevent the exploitation of their natural equivalents on our own terms. We can also build these minute electronic components to order, specifying their properties in a way that is not possible with natural proteins." Leading experts in biomolecular engineering and simulation worked together to produce this unique new method of designing tailor-made proteins with tuneable electronic properties. The multidisciplinary team used advanced computational tools to design simple building blocks that could be combined into longer, wire-like protein chains for conducting electrons. They were able to visualise the structures of these wires using protein X-ray crystallography and electron cryo-microscopy (cryo-EM), techniques which allow structures to be viewed in the finest detail. Pushing the technical boundaries of cryo-EM, images of the smallest individual protein ever studied were obtained with this technique. Ultimately, these nanoscale designer wires have the potential to be used in a wide range of applications, including biosensors for the diagnosis of diseases and detection of environmental pollutants. It is also hoped this invention will form the foundation of new electrical circuits for creating tailor-made catalysts for green industrial biotechnology and artificial photosynthetic proteins for capturing solar energy. The breakthrough was possible thanks to a £4.9 million grant from the Biotechnology and Biological Science Research Council (BBSRC), the UK's largest bioscience funder, which resulted in a five-year project entitled 'The Circuits of Life' involving the Universities of Bristol, Portsmouth, East Anglia, and University College London (UCL). The team harnessed their expertise in protein design, electron transfer, biomolecular simulation, structural biology and spectroscopy, gaining insight into how electrons flow through natural biological molecules, a fundamental process which underpins cellular respiration and photosynthesis. Further advances are expected as the project, which began last year, progresses, presenting significant opportunities to help meet the transition to net zero and more sustainable industrial processes. Co-author Adrian Mulholland, Professor of Chemistry at the University of Bristol, said: "These proteins show how protein design is increasingly delivering practically useful tools. They offer exciting possibilities as components for engineering biology and also are great systems for investigating the fundamental mechanisms of biological electron transfer." Paper 'An expandable, modular de novo protein platform for precision redox engineering' by George H. Hutchins, Claire E.M. Noble, Adrian Bunzel et al published in PNAS |
| Type Of Material | Technology assay or reagent |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| Impact | Pioneering study signals new era of environment-friendly programmable bioelectronics Image shows the design of a protein nanowire, with the green arrow indicating electron flow. Ross Anderson Image shows structural analysis of the protein-based wire, comparing the model of the designed protein (shown in red) with the experimentally determined structure (in grey). Ross Anderson Press release issued: 25 July 2023 Researchers have created a unique microscopic toolkit of 'green' tuneable electrical components, paving the way for a new generation of bioelectronic devices and sensors. The University of Bristol-led study, published today in The Proceedings of the National Academy of Sciences (PNAS), demonstrates how to make conductive, biodegradable wires from designed proteins. These could be compatible with conventional electronic components made from copper or iron, as well as the biological machinery responsible for generating energy in all living organisms. The miniscule wires are the size of transistors on silicon chips or one thousandth of the breadth of the finest human hair. They are made completely of natural amino acids and heme molecules, found in proteins such as hemoglobin, which transports oxygen in red blood cells. Harmless bacteria were used for their manufacture, eliminating the need for potentially complex and environmentally damaging procedures commonly used in the production of synthetic molecules. Lead author Ross Anderson, Professor of Biological Chemistry at the University of Bristol, said: "While our designs take inspiration from the protein-based electronic circuits necessary for all life on Earth, they are free from much of the complexity and instability that can prevent the exploitation of their natural equivalents on our own terms. We can also build these minute electronic components to order, specifying their properties in a way that is not possible with natural proteins." Leading experts in biomolecular engineering and simulation worked together to produce this unique new method of designing tailor-made proteins with tuneable electronic properties. The multidisciplinary team used advanced computational tools to design simple building blocks that could be combined into longer, wire-like protein chains for conducting electrons. They were able to visualise the structures of these wires using protein X-ray crystallography and electron cryo-microscopy (cryo-EM), techniques which allow structures to be viewed in the finest detail. Pushing the technical boundaries of cryo-EM, images of the smallest individual protein ever studied were obtained with this technique. Ultimately, these nanoscale designer wires have the potential to be used in a wide range of applications, including biosensors for the diagnosis of diseases and detection of environmental pollutants. It is also hoped this invention will form the foundation of new electrical circuits for creating tailor-made catalysts for green industrial biotechnology and artificial photosynthetic proteins for capturing solar energy. The breakthrough was possible thanks to a £4.9 million grant from the Biotechnology and Biological Science Research Council (BBSRC), the UK's largest bioscience funder, which resulted in a five-year project entitled 'The Circuits of Life' involving the Universities of Bristol, Portsmouth, East Anglia, and University College London (UCL). The team harnessed their expertise in protein design, electron transfer, biomolecular simulation, structural biology and spectroscopy, gaining insight into how electrons flow through natural biological molecules, a fundamental process which underpins cellular respiration and photosynthesis. Further advances are expected as the project, which began last year, progresses, presenting significant opportunities to help meet the transition to net zero and more sustainable industrial processes. Co-author Adrian Mulholland, Professor of Chemistry at the University of Bristol, said: "These proteins show how protein design is increasingly delivering practically useful tools. They offer exciting possibilities as components for engineering biology and also are great systems for investigating the fundamental mechanisms of biological electron transfer." Paper 'An expandable, modular de novo protein platform for precision redox engineering' by George H. Hutchins, Claire E.M. Noble, Adrian Bunzel et al published in PNAS |
| URL | https://www.bristol.ac.uk/news/2023/july/protein-nanowires.html |
| Title | COVID is airborne |
| Description | Multscale model of SARS-CoV-2 in respiratory aerosol |
| Type Of Material | Improvements to research infrastructure |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| Impact | HPC2021 Gordon Bell Special Prize Finalist Skip to main content U.S. flagAn official website of the United States government Here's how you know NIH NLM Logo Log in Access keysNCBI HomepageMyNCBI HomepageMain ContentMain Navigation Search PMC Full-Text Archive Search in PMC Run this search in PubMed Advanced Search User Guide Journal List SAGE Choice PMC9527558 Logo of sageopen The International Journal of High Performance Computing Applications Int J High Perform Comput Appl. 2023 Jan; 37(1): 28-44. Published online 2022 Oct 2. doi: 10.1177/10943420221128233 PMCID: PMC9527558 PMID: 36647365 #COVIDisAirborne: AI-enabled multiscale computational microscopy of delta SARS-CoV-2 in a respiratory aerosol Monitoring Editor: Mark Parsons Abigail Dommer,1,†Lorenzo Casalino,1,†Fiona Kearns,1,†Mia Rosenfeld,1 Nicholas Wauer,1 Surl-Hee Ahn,1 John Russo,2 Sofia Oliveira,3 Clare Morris,1 Anthony Bogetti,4 Anda Trifan,5,6 Alexander Brace,5,7 Terra Sztain,1,8 Austin Clyde,5,7 Heng Ma,5 Chakra Chennubhotla,4 Hyungro Lee,9 Matteo Turilli,9 Syma Khalid,10 Teresa Tamayo-Mendoza,11 Matthew Welborn,11 Anders Christensen,11 Daniel GA Smith,11 Zhuoran Qiao,12 Sai K Sirumalla,11 Michael O'Connor,11 Frederick Manby,11 Anima Anandkumar,12,13 David Hardy,6 James Phillips,6 Abraham Stern,13 Josh Romero,13 David Clark,13 Mitchell Dorrell,14 Tom Maiden,14 Lei Huang,15 John McCalpin,15 Christopher Woods,3 Alan Gray,13 Matt Williams,3 Bryan Barker,16 Harinda Rajapaksha,16 Richard Pitts,16 Tom Gibbs,13 John Stone,6,13 Daniel M. Zuckerman,2 Adrian J. Mulholland,3 Thomas Miller, III,11,12 Shantenu Jha,9 Arvind Ramanathan,5 Lillian Chong,4 and Rommie E Amaro1 Author information Copyright and License information Disclaimer Previous version available: This article is based on a previously available preprint: "#COVIDisAirborne: AI-Enabled Multiscale Computational Microscopy of Delta SARS-CoV-2 in a Respiratory Aerosol". Go to: Abstract We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus obscure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized. Keywords: molecular dynamics, deep learning, multiscale simulation, weighted ensemble, computational virology, SARS-CoV-2, aerosols, COVID-19, HPC, AI, GPU, Delta Go to: Justification We develop a novel HPC-enabled multiscale research framework to study aerosolized viruses and the full complexity of species that comprise them. We present technological and methodological advances that bridge time and length scales from electronic structure through whole aerosol particle morphology and dynamics. Performance attributes Performance attribute Our submission Category of achievement Scalability, Time-to-solution Type of method used Explicit, Deep Learning Results reported on the basis of Whole application including I/O Precision reported Mixed Precision System scale Measured on full system Measurement mechanism Hardware performance counters Application timers Performance Modeling Open in a separate window Overview of the problem Respiratory pathogens, such as SARS-CoV-2 and influenza, are the cause of significant morbidity and mortality worldwide. These respiratory pathogens are spread by virus-laden aerosols and droplets that are produced in an infected person, exhaled, and transported through the environment (Wang et al., 2021) (Figure 1). Medical dogma has long focused on droplets as the main transmission route for respiratory viruses, where either a person has contact with an infected surface (fomites) or direct droplet transmission by close contact with an infected individual. However, as we continue to observe with SARS-CoV-2, airborne transmission also plays a significant role in spreading disease. We know this from various super spreader events, for example, during a choir rehearsal (Miller et al., 2021). Intervention and mitigation decisions, such as the relative importance of surface cleaning or whether and when to wear a mask, have unfortunately hinged on a weak understanding of aerosol transmission, to the detriment of public health. An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig1.jpg Figure 1. Overall schematic depicting the construction and multiscale simulations of Delta SARS-CoV-2 in a respiratory aerosol. (N.B.: The size of di-valent cations has been increased for visibility.) A central challenge to understanding airborne transmission has been the inability of experimental science to reliably probe the structure and dynamics of viruses once they are inside respiratory aerosol particles. Single particle experimental methods have poor resolution for smaller particles (<1 micron) and are prone to sample destruction during collection. Airborne viruses are present in low concentrations in the air and are similarly prone to viral inactivation during sampling. In addition, studies of the initial infection event, for example, in the deep lung, are limited in their ability to provide a detailed understanding of the myriad of molecular interactions and dynamics taking place in situ. Altogether, these knowledge gaps hamper our collective ability to understand mechanisms of infection and develop novel effective antivirals, as well as prevent us from developing concrete, science-driven mitigation measures (e.g., masking and ventilation protocols). Here, we aim to reconceptualize current models of airborne transmission of respiratory viruses by providing never-before-seen views of viruses within aerosols. Our approach relies on the use of all-atom molecular dynamics (MD) simulations as a multiscale "computational microscope." MD simulations can synthesize multiple types of biological data (e.g., multiresolution structural datasets, glycomics, lipidomics, etc.) into cohesive, biologically "accurate" structural models. Once created, we then approximate the model down to its many atoms, creating trajectories of its time dependent dynamics under cell-like (or in this case, aerosol-like) conditions. Critically, MD simulations are more than just "pretty movies." MD equations are solved in a theoretically rigorous manner, allowing us to compute experimentally testable macroscopic observables from time-averaged microscopic properties. What this means is that we can directly connect MD simulations with experiments, each validating and providing testable hypotheses to the other, which is the real power of the approach. An ongoing challenge to the successful application of such methods, however, is the need for technological and methodological advances that make it possible to access length scales relevant to the study of large, biologically complex systems (spanning nanometers to ~one micron in size) and, correspondingly, longer timescales (microseconds to seconds). Such challenges and opportunities manifest in the study of aerosolized viruses. Aerosols are generally defined as being less than 5 microns in diameter, able to float in the air for hours, travel significant distances (i.e., can fill a room, like cigarette smoke), and be inhaled. Fine aerosols < 1 micron in size can stay in the air for over 12 h and are enriched with viral particles (Fennelly 2020; Coleman et al., 2021). Our work focuses on these finer aerosols that travel deeper into the respiratory tract. Several studies provide the molecular recipes necessary to reconstitute respiratory aerosols according to their actual biologically relevant composition (Vejerano and Marr 2018; Walker et al., 2021). These aerosols can contain lipids, cholesterol, albumin (protein), various mono- and di-valent salts, mucins, other surfactants, and water (Figure 1). Simulations of aerosolized viruses embody a novel framework for the study of aerosols: they will allow us and others to tune different species, relative humidity, ion concentrations, etc. to match experiments that can directly and indirectly connect to and inform our simulations, as well as test hypotheses. Some of the species under study here, for example, mucins, have not yet been structurally characterized or explored with simulations and thus the models we generate are expected to have impact beyond their roles in aerosols. In addition to varying aerosol composition and size, the viruses themselves can be modified to reflect new variants of concern, where such mutations may affect interactions with particular species in the aerosol that might affect its structural dynamics and/or viability. The virion developed here is the Delta variant (B.1.617.2 lineage) of SARS-CoV-2 (Figure 2), which presents a careful integration of multiple biological datasets: (1) a complete viral envelope with realistic membrane composition, (2) fully glycosylated full-length spike proteins integrating 3D structural coordinates from multiple cryoelectron microscopy (cryoEM) studies (McCallum et al., 2021; Wrapp et al., 2020; Walls et al., 2020; Bangaru et al., 2020) (3) all biologically known features (post-translational modifications, palmitoylation, etc.), (4) any other known membrane proteins (e.g. the envelope (E) and membrane (M) proteins), and (5) virion size and patterning taken directly from cryoelectron tomography (cryoET). Each of the individual components of the virus are built up before being integrated into the composite virion, and thus represent useful molecular-scale scientific contributions in their own right (Casalino et al., 2020; Sztain et al., 2021). An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig2.jpg Figure 2. Individual protein components of the SARS-CoV-2 Delta virion. The spike is shown with the surface in cyan and with Delta's mutated residues and deletion sites highlighted in pink and yellow, respectively. Glycans attached to the spike are shown in blue. The E protein is shown in yellow and the M-protein is shown in silver and white. Visualized with VMD. Altogether in this work, we dramatically extend the capabilities of data-driven, multiscale computational microscopy to provide a new way of exploring the composition, structure, and dynamics of respiratory aerosols. While a seemingly limitless number of putative hypotheses could result from these investigations, the first set of questions we expect to answer are: How does the virus exist within a droplet of the same order of magnitude in size, without being affected by the air-water interface, which is known to destroy molecular structure (D'Imprima et al. 2019)? How does the biochemical composition of the droplet, including pH, affect the structural dynamics of the virus? Are there species within the aerosols that "buffer" the viral structure from damage, and are there particular conditions under which the impact of those species changes? Our simulations can also provide specific parameters that can be included in physical models of aerosols, which still assume a simple water or water-salt composition even though it is well known that such models, for example, using kappa-Kohler theory, break down significantly as the molecular species diversify (Petters and Kreidenweis 2007). Go to: Current state of the art Current experimental methods are unable to directly interrogate the atomic-level structure and dynamics of viruses and other molecules within aerosols. Here we showcase computational microscopy as a powerful tool capable to overcome these significant experimental limitations. We present the major elements of our multiscale computational microscope and how they come together in an integrated manner to enable the study of aerosols across multiple scales of resolution. We demonstrate the impact such methods can bring to bear on scientific challenges that until now have been intractable, and present a series of new scientific discoveries for SARS-CoV-2. Parallel molecular dynamics All-atom molecular dynamics simulation has emerged as an increasingly powerful tool for understanding the molecular mechanisms underlying biophysical behaviors in complex systems. Leading simulation engines, NAMD (Phillips et al., 2020), AMBER (Case et al. [n. d.]), and GROMACS (Páll et al., 2020), are broadly useful, with each providing unique strengths in terms of specific methods or capabilities as required to address a particular biological question, and in terms of their support for particular HPC hardware platforms. Within the multiscale computational microscopy platform developed here, we show how each of these different codes contributes different elements to the overall framework, oftentimes utilizing different computing modalities/architectures, while simultaneously extending on state-of-the-art for each. Structure building, simulation preparation, visualization, and post hoc trajectory analysis are performed using VMD on both local workstations and remote HPC resources, enabling modeling of the molecular systems studied herein (Humphrey et al., 1996; Stone et al., 2013a,b, 2016b; Sener et al., 2021). We show how further development of each of these codes, considered together within the larger-scale collective framework, enables the study of SARS-CoV-2 in a wholly novel manner, with extension to numerous other complex systems and diseases. AI-enhanced WE simulations Because the virulence of the Delta variant of SARS-CoV-2 may be partly attributable to spike protein (S) opening, it is of pressing interest to characterize the mechanism and kinetics of the process. Although S-opening in principle can be studied via conventional MD simulations, in practice the system complexity and timescales make this wholly intractable. Splitting strategies that periodically replicate promising MD trajectories, among them the weighted ensemble (WE) method (Huber and Kim 1996; Zuckerman and Chong 2017), have enabled simulations of the spike opening of WT SARS-CoV-2 (Sztain et al., 2021; Zimmerman et al., 2021). WE simulations can be orders of magnitude more efficient than conventional MD in generating pathways and rate constants for rare events (e.g. protein folding (Adhikari et al., 2019) and binding (Saglam and Chong 2019)). The WESTPA software for running WE (Zwier et al., 2015) is well-suited for high-performance computing with nearly perfect CPU/GPU scaling. The software is interoperable with any dynamics engine, including the GPU-accelerated AMBER dynamics engine (Salomon-Ferrer et al., 2013) that is used here. As shown below, major upgrades to WESTPA (v. 2.0) have enabled a dramatic demonstration of spike opening in the Delta variant (Figures 5 and and6)6) and exponentially improved analysis of spike-opening kinetics (Russo et al., 2022). An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig5.jpg Figure 5. Delta variant spike opening from WE simulations, and AI/haMSM analysis. A) The integrated workflow. B) Snapshots of the "down," "up," and "open" states for Delta S-opening from a representative pathway generated by WE simulation, which represents ~ 105 speedup compared to conventional MD. C) Rate constant estimation with haMSM analysis of WE data (purple lines) significantly improves direct WE computation (red), by comparison to experimental measurement (black dashed). Varying haMSM estimates result from different featurizations which will be individually cross-validated. D) The first three dimensions of the ANCA-AE embeddings depict a clear separation between the closed (darker purple) and open (yellow) conformations of the Delta spike. A sub-sampled landscape is shown here where each sphere represents a conformation from the WE simulations and colored with the root-mean squared deviations (Å) with respect to the closed state. Visualized with VMD. An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig6.jpg Figure 6. WE simulations reveal a dramatic opening of the Delta S (cyan), compared to WT S (white). While further investigation is needed, this super open state seen in the Delta S may indicate increased capacity for binding to human host-cell receptors. The integration of AI techniques with WE can further enhance the efficiency of sampling rare events (Noe 2020; Brace et al., 2021b; Casalino et al., 2021). One frontier area couples unsupervised linear and non-linear dimensionality reduction methods to identify collective variables/progress coordinates in high-dimensional molecular systems (Bhowmik et al., 2018; Clyde et al., 2021). Such methods may be well suited for analyzing the aerosolized virus. Integrating these approaches with WE simulations is advantageous in sampling the closed ? open transitions in the Delta S landscape (Figure 5) as these unsupervised AI approaches automatically stratify progress coordinates (Figure 5(D)). Dynamical non-equilibrium MD Aerosols rapidly acidify during flight via reactive uptake of atmospheric gases, which is likely to impact the opening/closing of the S protein (Vejerano and Marr 2018; Warwicker 2021). Here, we describe the extension of dynamical non-equilibrium MD (D-NEMD) (Ciccotti and Ferrario 2016) to investigate pH effects on the Delta S. D-NEMD simulations (Ciccotti and Ferrario 2016) are emerging as a useful technique for identifying allosteric effects and communication pathways in proteins (Galdadas et al., 2021; Oliveira et al., 2019), including recently identifying effects of linoleic acid in the WT spike (Oliveira et al., 2021b). This approach complements equilibrium MD simulations, which provide a distribution of configurations as starting points for an ensemble of short non-equilibrium trajectories under the effect of the external perturbation. The response of the protein to the perturbation introduced can then be determined using the Kubo-Onsager relation (Oliveira et al., 2021a; Ciccotti and Ferrario 2016) by directly tracking the change in atomic positions between the equilibrium and non-equilibrium simulations at equivalent points in time (Oliveira et al., 2021a). OrbNet Ca2+ ions are known to play a key role in mucin aggregation in epithelial tissues (Hughes et al., 2019). Our RAV simulations would be an ideal case-study to probe such complex interactions between Ca2+, mucins, and the SARS-CoV-2 virion in aerosols. However, Ca2+ binding energies can be difficult to capture accurately due to electronic dispersion and polarization, terms which are not typically modeled in classical mechanical force fields. Quantum mechanical (QM) methods are uniquely suited to capture these subtle interactions. Thus, we set out to estimate the correlation in Ca2+ binding energies between CHARMM36m and quantum mechanical estimates enabled via AI with OrbNet. Calculation of energies with sufficient accuracy in biological systems can, in many cases, be adequately described with density functional theory (DFT). However, its high cost limits the applicability of DFT in comparison to fixed charge force fields. To capture quantum quality energetics at a fraction of the computational expense, we employ a novel approach (OrbNet) based on the featurization of molecules in terms of symmetry-adapted atomic orbitals and the use of graph neural network methods for deep learning quantum-mechanical properties (Qiao et al., 2020). Our method outperforms existing methods in terms of its training efficiency and transferable accuracy across diverse molecular systems, opening a new pathway for replacing DFT in large-scale scientific applications such as those explored here. (Christensen et al., 2021). Innovations realized Construction and simulation of SARS-CoV-2 in a respiratory aerosol Our approach to simulating the entire aerosol follows a composite framework wherein each of the individual molecular pieces is refined and simulated on its own before it is incorporated into the composite model. Simulations of each of the components are useful in their own right, and often serve as the basis for biochemical and biophysical validation and experiments (Casalino et al., 2020). Throughout, we refer to the original circulating SARS-CoV-2 strain as "WT," whereas all SARS-CoV-2 proteins constructed in this work represent the Delta variant (Figure 2). All simulated membranes reflect mammalian ER-Golgi intermediate compartment (ERGIC) mimetic lipid compositions. VMD (Humphrey et al., 1996; Stone et al., 2016a), psfgen (Phillips et al., 2005), and CHARMM-GUI (Park et al., 2019) were used for construction and parameterization. Topologies and parameters for simulations were taken from CHARMM36m all-atom additive force fields (Guvench et al., 2009; Huang and Mackerell 2013; Huang et al., 2017; Klauda et al., 2010; Beglov and Roux 1994; Han et al., 2018; Venable et al., 2013). NAMD was used to perform MD simulations (Phillips et al., 2020), adopting similar settings and protocols as in (Casalino et al., 2020). All systems underwent solvation, charge neutralization, minimization, heating, and equilibration prior to production runs. Refer to Table 1 for Abbreviations, PBC dimensions, total number of atoms, and total equilibration times for each system of interest. Table 1. Summary of all systems constructed in this work. See Figure 3 for illustration of aerosol construction. asystems bAbb c(Å × Å × Å) dNa e (ns) fM dimers M 125 × 125 × 124 164,741 700 fE pentamers E 123 × 125 × 102 136,775 41 Spikes f (Open) S 206 × 200 × 410 1,692,444 330 f (Closed) S 204 × 202 × 400 1,658,224 330 g (Closed, head) SH 172 × 184 × 206 615,593 73µs Mucins fshort mucin 1 m1 123 × 104 × 72 87,076 25 fshort mucin 2 m2 120 × 101 × 72 82,155 25 flong mucin 1 m3 810 × 104 × 115 931,778 23 flong mucin 2 m4 904 × 106 × 109 997,029 15 flong mucin 3 m5 860 × 111 × 113 1,040,215 18 fS+m1/m2+ALB SMA 227 × 229 × 433 2,156,689 840 fVirion V 1460 × 1460 × 1460 305,326,834 41 fResp.Aero.+Vir RAV 2834 × 2820 × 2828 1,016,813,441 2.42 Total FLOPS 2.4 ZFLOPS Open in a separate window aM, E, S, SH, and V models represent SARS-CoV-2 Delta strain. bAbbreviations used throughout document. cPeriodic boundary dimensions. dTotal number of atoms. eTotal aggregate simulation time, including heating and equilibration runs. fSimulated with NAMD. gSimulated with NAMD, AMBER, and GROMACS. Simulating the SARS-CoV-2 structural proteins Fully glycosylated Delta spike (S) structures in open and closed conformations were built based on WT constructs from Casalino et al. (Casalino et al., 2020) with the following mutations: T19R, T95I, G142D, E156G, ?157-158, L452R, T478K, D614G, P681R, and D950N (McCallum et al., 2021; Kannan et al., 2021). Higher resolved regions were grafted from PDB 7JJI (Bangaru et al., 2020). Additionally, coordinates of residues 128-167-accounting for a drastic conformational change seen in the Delta variant S-graciously made available to us by the Veesler Lab, were similarly grafted onto our constructs (McCallum et al., 2021). Finally, the S proteins were glycosylated following work by Casalino et al. (Casalino et al., 2020). By incorporating the Veesler Lab's bleeding-edge structure (McCallum et al., 2021) and highly resolved regions from 7JJI (Bangaru et al., 2020), our models represent the most complete and accurate structures of the Delta S to date. The S proteins were inserted into membrane patches and equilibrated for 3 × 110 ns. For non-equilibrium and weighted ensemble simulations, a closed S head (SH, residues 13-1140) was constructed by removing the stalk from the full-length closed S structure, then resolvated, neutralized, minimized, and subsequently passed to WE and D-NEMD teams. The M-protein was built from a structure graciously provided by the Feig Lab (paper in prep). The model was inserted into a membrane patch and equilibrated for 700 ns. RMSD-based clustering was used to select a stable starting M-protein conformation. From the equilibrated and clustered M structure, VMD's Mutator plugin (Humphrey et al., 1996) was used to incorporate the I82T mutation onto each M monomer to arrive at the Delta variant M. To construct the most complete E protein model to-date, the structure was patched together by resolving incomplete PDBs 5X29 (Surya et al., 2018), 7K3G (Mandala et al., 2020) and 7M4R (Chai et al., 2021). To do so, the transmembrane domain (residues 8-38) from 7K3G were aligned to the N-terminal domain (residues 1-7) and residues 39 to 68 of 5X29 and residues 69 to 75 of 7M4R by their Ca atoms. E was then inserted into a membrane patch and equilibrated for 40 ns. Constructing the SARS-CoV-2 Delta virion The SARS-CoV-2 Delta virion (V) model was constructed following Casalino et al. (Casalino et al., 2021) using CHARMM-GUI (Lee et al., 2016), LipidWrapper (Durrant and Amaro 2014), and Blender (Blender Online Community 2020), using a 350 Å lipid bilayer with an equilibrium area per lipid of 63 Å2 and a 100 nm diameter Blender icospherical surface mesh (Turonova et al., 2020). The resulting lipid membrane was solvated in a 1100 Å3 waterbox and subjected to four rounds of equilibration and patching (Casalino et al., 2021). 360 m dimers and 4 E pentamers were then tiled onto the surface, followed by random placement of 29 full-length S proteins (9 open, 20 closed) according to experimentally observed S protein density (Ke et al., 2020). M and E proteins were oriented with intravirion C-termini. After solvation in a 1460 Å waterbox, the complete V model tallied >305 million atoms (Table 1). V was equilibrated for 41 ns prior to placement in the respiratory aerosol (RA) model. The equilibrated membrane was 90 nm in diameter and remains in close structural agreement with the experimental studies (Ke et al., 2020). Building and simulating the respiratory aerosol Respiratory aerosols contain a complex mixture of chemical and biological species. We constructed a respiratory aerosol (RA) fluid based on a composition from artificial saliva and surrogate deep lung fluid recipes (Walker et al., 2021). This recipe includes 0.7 mm DPPG, 6.5 mm DPPC, 0.3 mm cholesterol, 1.4 mm Ca2+, 0.8 mm Mg2+, and 142 mm Na+ (Vejerano and Marr 2018; Walker et al., 2021), human serum albumin (ALB) protein, and a composition of mucins (Figure 3). Mucins are long polymer-like structures that are decorated by dense, heterogeneous, and complex regions of O-glycans. This work represents the first of its kind as, due to their complexity, the O-glycosylated regions of mucins have never before been constructed for molecular simulations. Two short (m1, m2, ~5 nm) and three long (m3, m4, m5 ~55 nm) mucin models were constructed following known experimental compositions of protein and glycosylation sequences (Symmes et al., 2018; Hughes et al., 2019; Markovetz et al., 2019; Thomsson et al., 2005; Mariethoz et al., 2018) with ROSETTA (Raveh et al., 2010) and CHARMM-GUI Glycan Modeller (Jo et al., 2011). Mucin models (short and long) were solvated, neutralized by charge matching with Ca2+ ions, minimized, and equilibrated for 15-25 ns each (Table 1). Human serum albumin (ALB), which is also found in respiratory aerosols, was constructed from PDB 1AO6 (Sugio et al., 1999). ALB was solvated, neutralized, minimized, and equilibrated for 7ns. Equilibrated structures of ALB and the three long mucins were used in construction of the RAV with m3+m4+m5 added at 6 g/mol and ALB at 4.4 g/mol. An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig3.jpg Figure 3. Image of RAV with relative mass ratios of RA molecular components represented in the colorbar. Water content is dependent on the relative humidity of the environment and is thus omitted from the molecular ratios. Constructing the respiratory aerosolized virion model A 100 nm cubic box with the RA fluid recipe specified above was built with PACKMOL (Martínez et al., 2009), minimized, equilibrated briefly on TACC Frontera, then replicated to form a 300 nm cube. The RA box was then carved into a 270 nm diameter sphere. To make space for the placement of V within the RA, a spherical selection with volume corresponding to that of the V membrane + S crown (radius 734 Å) was deleted from the center of the RA. The final equilibrated V model, including surrounding equilibrated waters and ions (733 Å radius), was translated into the RA. Atom clashes were resolved using a 1.2 Å cutoff. Hydrogen mass repartitioning (Hopkins et al., 2015) was applied to the structure to improve performance. The simulation box was increased to 2800 Å per side to provide a 100 Å vacuum atmospheric buffer. The RAV simulation was conducted in an NVT ensemble with a 4 fs timestep. After minimizing, the RAV was heated to 298 K with 0.1 kcal/mol Å2 restraints on the viral lipid headgroups, then equilibrated for 1.5 ns. Finally, a cross-section of the RAV model-including and open S, m1/m2, and ALB (called the SMA system)-was constructed with PACKMOL to closely observe atomic scale interactions within the RAV model (Figure 4). An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig4.jpg Figure 4. SMA system captured with multiscale modeling from classical MD to AI-enabled quantum mechanics. For all panels: S protein shown in cyan, S glycans in blue, m1/m2 shown in red, ALB in orange, Ca2+ in yellow spheres, viral membrane in purple. A) Interactions between mucins and S facilitated by glycans and Ca2+. B) Snapshot from SMA simulations. C) Example Ca2+ binding site from SMA simulations (1800 sites, each 1000+ atoms) used for AI-enabled quantum mechanical estimates from OrbNet Sky. D) Quantification of contacts between S and mucin from SMA simulations. E) OrbNet Sky energies versus CHARMM36m energies for each sub-selected system, colored by total number of atoms. Performance of OrbNet Sky versus DFT in subplot (?B97x-D3/def-TZVP, R2=0.99, for 17 systems of peptides chelating Ca2+ (Hu et al., 2021)). Visualized with VMD. Parameter evaluation with OrbNet Comparison to quantum methods reveals significant polarization effects, and shows that there is opportunity to improve the accuracy of fixed charge force fields. For the large system sizes associated with solvated Ca2+-protein interaction motifs (over 1000 atoms, even in aggressively truncated systems), conventional quantum mechanics methods like density functional theory (DFT) are impractical for analyzing a statistically significant ensemble of distinct configurations (see discussion in Performance Results). In contrast, OrbNet allows for DFT accuracy with over 1000-fold speedup, providing a useful method for benchmarking and refining the force field simulation parameters with quantum accuracy (Christensen et al., 2021). To confirm the accuracy of OrbNet versus DFT (?B97X-D/def2-TZVP), the inset of Figure 4(E) correlates the two methods for the Ca2+-binding energy in a benchmark dataset of small Ca2+-peptide complexes (Hu et al., 2021). The excellent correlation of OrbNet and DFT for the present use case is clear from the inset figure; six datapoints were removed from this plot on the basis of a diagnostic applied to the semi-empirical GFN-xTB solution used for feature generation of OrbNet (Christensen et al., 2021). Figure 4 presents a comparison of the validated OrbNet method with the CHARMM36m force field for 1800 snapshots taken from the SMA MD simulations. At each snapshot, a subsystem containing a solvated Ca2+-protein complex was extracted (Figure 4(E)), with protein bonds capped by hydrogens. For both OrbNet and the force field, the Ca2+-binding energy was computed and shown in the correlation plot. Lack of correlation between OrbNet and the force field identifies important polarization effects, absent in a fixed charge description. Similarly, the steep slope of the best-fit line in Figure 4(E) reflects the fact that some of the configurations sampled using MD with the CHARMM36m force field are relatively high in energy according to the more accurate OrbNet potential. This approach allows us to test and quantify limitations of empirical force fields, such as lack of electronic polarization. The practicality of OrbNet for these simulation snapshots with 1000+ atoms offers a straightforward multiscale strategy for refining the accuracy of the CHARMM36m force field. By optimizing the partial charges and other force field parameters, improved correlation with OrbNet for the subtle Ca2+-protein interactions could be achieved, leading to near-quantum accuracy simulations with improved configurational sampling. The calculations presented here present a proof-of-concept of this iterative strategy. AI-WE simulations of delta spike opening While our previous WE simulations of the WT SARS-CoV-2 S-opening (Sztain et al., 2021) were notable in generating pathways for a seconds-timescale process of a massive system, we have made two critical technological advancements in the WESTPA software that greatly enhance the efficiency and analysis of WE simulations. These advances enabled striking observations of Delta variant S opening (Figures 5 and and6).6). First, in contrast to prior manual bins for controlling trajectory replication, we have developed automated and adaptive binning that enables more efficient surmounting of large barriers via early identification of "bottleneck" regions (Torrillo et al., 2021). Second, we have parallelized, memory-optimized, and implemented data streaming for the history-augmented Markov state model (haMSM) analysis scheme (Copperman and Zuckerman 2020) to enable application to the TB-scale S-opening datasets. The haMSM approach estimates rate constants from simulations that have not yet reached a steady state (Suarez et al., 2014). Our WE simulations generated >800 atomically detailed, Delta variant S-opening pathways (Figures 5(B) and and6)6) of the receptor binding domain (RBD) switching from a glycan-shielded "down" to an exposed "up" state using 72 µs of total simulation time within 14 days using 192 NVIDIA V100 GPUs at a time on TACC's Longhorn supercomputer. Among these pathways, 83 reach an "open" state that aligns with the structure of the human ACE2-bound WT S protein (Benton et al., 2020) and 18 reach a dramatically open state (Figure 6). Our haMSM analysis of WT WE simulations successfully provided long-timescale (steady state) rate constants for S-opening based on highly transient information (Figure 5(C)). We also leveraged a simple, yet powerful unsupervised deep learning method called Anharmonic Conformational Analysis enabled Autoencoders (ANCA-AE) Clyde et al. (2021) to extract conformational states from our long-timescale WE simulations of Delta spike opening (Figures 5(A) and (D)). ANCA-AE first minimizes the fourth order correlations in atomistic fluctuations from MD simulation datasets and projects the data onto a low dimensional space where one can visualize the anharmonic conformational fluctuations. These projections are then input to an autoencoder that further minimizes non-linear correlations in the atomistic fluctuations to learn an embedding where conformations are automatically clustered based on their structural and energetic similarity. A visualization of the first three dimensions from the latent space articulates the RBD opening motion from its closed state (Figure 5(D)). It is notable that while other deep learning techniques need special purpose hardware (such as GPUs), the ANCA-AE approach can be run with relatively modest CPU resources and can therefore scale to much larger systems (e.g., the virion within aerosol) when optimized. D-NEMD explores pH effects on delta spike We performed D-NEMD simulations of the SH system with GROMACS (Abraham et al., 2015) using a ?pH=2.0 (from 7.0 to 5.0) as the external perturbation. We ran 3200-ns equilibrium MD simulations of SH to generate 87 configurations (29 configurations per replicate) that were used as the starting points for multiple short (10 ns) D-NEMD trajectories under the effect of the external perturbation (?pH=2.0). The effect of a ?pH was modeled by changing the protonation state of histidines 66, 69, 146, 245, 625, 655, 1064, 1083, 1088, and 1101 (we note that other residues may also become protonated (Lobo and Warwicker 2021); the D-NEMD approach can also be applied to examine those). The structural response of the S to the pH decrease was investigated by measuring the difference in the position for each Ca atom between the equilibrium and corresponding D-NEMD simulation at equivalent points in time (Oliveira et al., 2021a), namely after 0, 0.1, 1, 5, and 10 ns of simulation. The D-NEMD simulations reveal that pH changes, of the type expected in aerosols, affect the dynamics of functionally important regions of the spike, with potential implications for viral behavior (Figure 7). As this approach involves multiple short independent non-equilibrium trajectories, it is well suited for cloud computing. All D-NEMD simulations were performed using Oracle Cloud. An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig7.jpg Figure 7. D-NEMD simulations reveal changes in key functional regions of the S protein, including the receptor binding domain, as the result of a pH decrease. Color scale and ribbon thickness indicate the degree of deviation of Ca atoms from their equilibrium position. Red spheres indicate the location of positively charged histidines. How performance was measured WESTPA For the WE simulations of spike opening using WESTPA, we defined the time-to-solution as the total simulation time required to generate the first spike opening event. Spike opening is essentially impossible to observe via conventional MD. WESTPA simulations were run using the AMBER20 dynamics engine and 192 NVIDIA V100 GPUs at a time on TACC's Longhorn supercomputer. NAMD NAMD performance metrics were collected using hardware performance counters for FLOPs/step measurements, and application-internal timers for overall simulation rates achieved by production runs including all I/O for simulation trajectory and checkpoint output. NAMD FLOPs/step measurements were conducted on TACC Frontera, by querying hardware performance counters with the rdmsr utility from Intel msr-tools1 and the "TACC stats" system programs.2 For each simulation, FLOP counts were measured for NAMD simulation runs of two different step counts. The results of the two simulation lengths were subtracted to eliminate NAMD startup operations, yielding an accurate estimate of the marginal FLOPs per step for a continuing simulation (Phillips et al., 2002). Using the FLOPs/step values computed for each simulation, overall FLOP rates were computed by dividing the FLOPs/step value by seconds/step performance data reported by NAMD internal application timers during production runs. GROMACS GROMACS 2020.4 benchmarking was performed on Oracle Cloud Infrastructure (OCI)3 compute shape BM.GPU4.8 consisting of 8×NVIDIA A100 tensor core GPUs, and 64 AMD Rome CPU cores. The simulation used for benchmarking contained 615,563 atoms and was run for 500,000 steps with 2 fs time steps. The simulations were run on increasing numbers of GPUs, from 1 to 8, using eight CPU cores per GPU, running for both the production (Nose-Hoover) and GPU-accelerated (velocity rescaling) thermostats. Particle-mesh Ewald (PME) calculations were pinned to a single GPU, with additional GPUs for multi-GPU jobs used for particle-particle calculations. Performance data (ns/day and average single-precision TFLOPS, calculated as total number of TFLOPs divided by total job walltime) were reported by GROMACS itself. Each simulation was repeated four times and average performance figures reported. Performance results Table 2. Table 2. MD simulation floating point ops per timestep. MD Simulation Code Atoms aFLOPs/step Spike, head AMBER, GROMACS 0.6 m 62.14 GFLOPs/step Spike NAMD 1.7 m 43.05 GFLOPs/step S+m1/m2+ALB NAMD 2.1 m 54.86 GFLOPs/step Resp. Aero.+Vir NAMD 1B 25.81 TFLOPs/step Open in a separate window aFLOPs/step data were computed by direct FLOP measurements from hardware performance counters for NAMD simulations, or by using the application-reported FLOP rates and ns/day simulation performance in the case of GROMACS. NAMD performance NAMD was used to perform all of the simulations listed in Table 1, except for the closed spike "SH" simulations described further below. With the exception of the aerosol and virion simulation, the other NAMD simulations used conventional protocols and have performance and parallel scaling characteristics that closely match the results reported in our previous SARS-CoV-2 research Casalino et al. (2021). NAMD 2.14 scaling performance for the one billion-atom respiratory aerosol and virion simulation run on ORNL Summit is summarized in Tables 3 and and4.4. A significant performance challenge associated with the aerosol virion simulation relates to the roughly 50% reduction in particle density as compared with a more conventional simulation with a fully populated periodic cell. The reduced particle density results in large regions of empty space that nevertheless incur additional overheads associated with both force calculations and integration, and creates problems for the standard NAMD load balancing scheme that estimates the work associated with the cubic "patches" used for parallel domain decomposition. The PME electrostatics algorithm and associated 3-D FFT and transpose operations encompass the entire simulation unit cell and associated patches, requiring involvement in communication and reduction operations despite the inclusion of empty space. Enabling NAMD diagnostic output on a 512-node 1B-atom aerosol and virion simulation revealed that ranks assigned empty regions of the periodic cell had 66 times the number of fixed-size patches as ranks assigned dense regions. The initial load estimate for an empty patch was changed from a fixed 10 atoms to a runtime parameter with a default of 40 atoms, which reduced the patch ratio from 66 to 19 and doubled performance on 512 nodes. Table 3. NAMD performance: Respiratory Aerosol + Virion, 1B atoms, 4 fs timestep w/HMR, and PME every three steps. Nodes Summit Speedup Efficiency CPU + GPU 256 4.18 ns/day ~1.0× ~100% 512 7.68 ns/day 1.84× 92% 1024 13.64 ns/day 3.27× 81% 2048 23.10 ns/day 5.53× 69% 4096 34.21 ns/day 8.19× 51% Open in a separate window Table 4. Peak NAMD FLOP rates, ORNL Summit. NAMD Simulation Atoms, B Nodes Sim rate Performance Resp. Aero.+Vir 1 4096 34.21 ns/day 2.55 PFLOPS Open in a separate window WESTPA performance Our time to solution for WE simulations of spike opening (to the "up" state) (Figure 5) using the WESTPA software and AMBER20 was 14 µs of total simulation time, which was completed in 4 days using 192 NVIDIA V100 GPUs at a time on TACC's Longhorn supercomputer. For reference, conventional MD would require an expected ~5 orders of magnitude more computing. The WESTPA software is highly scalable, with nearly perfect scaling out to >1000 NVIDIA V100 GPUs and this scaling is expected to continue until the filesystem is saturated. Thus, WESTPA makes optimal use of large supercomputers and is limited by filesystem I/O due to the periodic restarting of trajectories after short time intervals. AI-enhanced WE simulations DeepDriveMD is a framework to coordinate the concurrent execution of ensemble simulations and drive them using AI models Brace et al. (2021a); Lee et al. (2019). DeepDriveMD has been shown to improve the scientific performance of diverse problems: from-protein folding to conformation of protein-ligand complexes. We coupled WESTPA to DeepDriveMD, which is responsible for resource dynamism and concurrent heterogeneous task execution (ML and AMBER). The coupled workflow was executed on 1024 nodes on Summit (OLCF), and, in spite of the spatio-temporal heterogeneity of tasks involved, the resource utilization was in the high 90%. Consistent with earlier studies, the coupling of WESTPA to DeepDriveMD results in a 100x improvement in the exploration of phase space. GROMACS performance Figure 8 shows GROMACS parallelizes well across the eight NVIDIA A100 GPUs available on each BM.GPU4.8 instance used in the Cluster in the Cloud4 running on OCI. There is a performance drop for two GPUs due to inefficient division of the PME and particle-particle tasks. Methods to address this exist for the two GPU case Páll et al. (2020), but were not adopted as we were targeting maximum raw performance across all eight GPUs. Production simulations achieved 27% of the peak TFLOPS available from the GPUs. Multiple simulations were run across 10 such compute nodes, enabling the ensemble to run at an average combined speed of 425 TFLOPS and sampling up to 1µs/day. We note that the calculations will be able to run 20%-40% faster once the Nose-Hoover thermostat that is required for the simulation is ported to run on the GPU. Benchmarking using a velocity rescaling thermostat that has been ported to GPU shows that this would enable the simulation to extract 34% of the peak TFLOPS from the cards, enabling each node to achieve an average speed of 53.4 TFLOPS, and 125 ns/day. A cluster of 10 nodes would enable GROMACS to run at an average combined speed of over 0.5 PFLOPs, simulating over 1.2 µs/day. An external file that holds a picture, illustration, etc. Object name is 10.1177_10943420221128233-fig8.jpg Figure 8. GROMACS performance across 1-8 A100 GPUs in ns/day (thicker, blue lines) and the fraction of maximum theoretical TFLOPS (thinner, green lines); production setup shown with solid line, and runs with the GPU-accelerated thermostat in dashed. A significant innovation is that this power is available on demand: Cluster in the Cloud with GPU-optimized GROMACS was provisioned and benchmarked within 1 day of inception of the project. This was handed to the researcher, who submitted the simulations. Automatically, up to 10 BM.GPU4.8 compute nodes were provisioned on-demand based on requests from the Slurm scheduler. These simulations were performed on OCI, using Cluster in the Cloud Williams (2021) to manage automatic scaling. Cluster in the Cloud was configured to dynamically provision and terminate computing nodes based on the workload. Simulations were conducted using GROMACS 2020.4 compiled with CUDA support. Multiple simultaneous simulations were conducted, with each simulation utilizing a single BM.GPU4.8 node without multinode parallelism. This allowed all production simulations to be completed within 2 days. The actual compute cost of the project was less than $6125 USD (on-demand OCI list price). The huge reduction in "time to science" that low-cost cloud enables changes the way that researchers can access and use HPC facilities. In our opinion, such a setup enables "exclusive on-demand" HPC capabilities for the scientific community for rapid advancement in science. OrbNet performance Prior benchmarking reveals that OrbNet provides over 1000-fold speedup compared to DFT (Christensen et al., 2021). For the calculations presented here, the cost of corresponding high quality range-separated DFT calculations (?B97X-D/def2-TZVP) can be estimated. In Figure 4(E), we consider system sizes which would require 14,000-47,000 atomic orbitals for ?B97X-D/def2-TZVP, exceeding the range of typical DFT evaluations. Estimation of the DFT computational cost of the 1811 configurations studied in Figure 4(E) suggests a total of 115M core-hours on NERSC Cori Haswell nodes; in contrast, the OrbNet calculations for the current study require only 100k core-hours on the same nodes. DFT cost estimates were based on extrapolation from a dataset of over 1M ChEMBL molecules ranging in size from 40 to 107 atom systems considering only the cubic cost component of DFT (Christensen et al., 2021). Go to: Implications Our major scientific achievements are 1. We showcase an extensible AI-enabled multiscale computational framework that bridges time and length scales from electronic structure through whole aerosol particle morphology and dynamics. 2. We develop all-atom simulations of respiratory mucins, and use these to understand the structural basis of interaction with the SARS-CoV-2 spike protein. This has implications for viral binding in the deep lung, which is coated with mucins. We expect the impact of our mucin simulations to be far reaching, as malfunctions in mucin secretion and folding have been implicated in progression of severe diseases such as cancer and cystic fibrosis. 3. We present a significantly enhanced all-atom model and simulation of the SARS-CoV-2 Delta virion, which includes the hundreds of tiled M-protein dimers and the E-protein ion channels. This model can be used as a basis to understand why the Delta virus is so much more infectious than the WT or alpha variants. 4. We develop an ultra-large (1 billion+) all-atom simulation capturing massive chemical and biological complexity within a respiratory aerosol. This simulation provides the first atomic-level views of virus-laden aerosols and is already serving as a basis to develop an untold number of experimentally testable hypotheses. An immediate example suggests a mechanism through which mucins and other species, for example, lipids, which are present in the aerosol, arrange to protect the molecular structure of the virus, which otherwise would be exposed to the air-water interface. This work also opens the door for developing simulations of other aerosols, for example, sea spray aerosols, that are involved in regulating climate. 5. We evidence how changes in pH, which are expected in the aerosol environment, may alter dynamics and allosteric communication pathways in key functional regions of the Delta spike protein. 6. We characterize atomically detailed pathways for the spike-opening process of the Delta variant using WE simulations, revealing a dramatically open state that may facilitate binding to human host cells. 7. We demonstrate how parallelized haMSM analysis of WE data can provide physical rate estimates of spike opening, improving prior estimates by many orders of magnitude. The pipeline can readily be applied to the any variant spike protein or other complex systems of interest. 8. We show how HPC and cloud resources can be used to significantly drive down time-to-solution for major scientific efforts as well as connect researchers and greatly enable complex collaborative interactions. 9. We demonstrate how AI coupled to HPC at multiple levels can result in significantly improved effective performance, for example, with AI-driven WESTPA, and extend the reach and domain of applicability of tools ordinarily restricted to smaller, less complex systems, for example, with OrbNet. 10. While our work provides a successful use case, it also exposes weaknesses in the HPC ecosystem in terms of support for key steps in large/complex computational science campaigns. We find lack of widespread support for high performance remote visualization and interactive graphical sessions for system preparation, debugging, and analysis with diverse science tools to be a limiting factor in such efforts. Go to: Acknowledgements We thank Prof. Kim Prather for inspiring and informative discussions about aerosols and for her commitment to convey the airborne nature of SARS-CoV-2. We thank D. Veesler for sharing the Delta spike NTD coordinates in advance of publication. We thank B. Messer, D. Maxwell, and the Oak Ridge Leadership Computing Facility at Oak Ridge National Laboratory supported by the DOE under Contract DE-AC05-00OR22725. We thank the Texas Advanced Computing Center Frontera team, especially D. Stanzione and T. Cockerill, and for compute time made available through a Director's Discretionary Allocation (NSF OAC-1818253). We thank the Argonne Leadership Computing Facility supported by the DOE under DE-AC02-06CH11357. We thank the Pittsburgh Supercomputer Center for providing priority queues on Bridges-2 through the XSEDE allocation NSF TG-CHE060063. We thank N. Kern and J. Lee of the CHARMM-GUI support team for help converting topologies between NAMD and GROMACS. We thank J. Copperman, G. Simpson, D. Aristoff, and J. Leung for valuable discussions and support from NIH grant GM115805. NAMD and VMD are funded by NIH P41-GM104601. This work was supported by the NSF Center for Aerosol Impacts on Chemistry of the Environment (CAICE), National Science Foundation Center for Chemical Innovation (NSF CHE-1801971), as well as NIH GM132826, NSF RAPID MCB-2032054, an award from the RCSA Research Corp., a UC San Diego Moore's Cancer Center 2020 SARS-CoV-2 seed grant, to R.E.A. This work was also supported by Oracle Cloud credits and related resources provided by the Oracle for Research program. AJM and ASFO receive funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (PREDACTED Advanced Grant, Grant agreement No.: 101021207). Go to: Notes 1. https://github.com/intel/msr-tools 2. https://github.com/TACC/tacc_stats 3. https://www.oracle.com/cloud/ 4. https://cluster-in-the-cloud.readthedocs.io/ Go to: Footnotes The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by National Science Foundation (CHE- 1801971); National Science Foundation (MCB- 2032054); National Science Foundation (OAC-1818253); National Science Foundation (TG-CHE060063); U.S. Department of Energy (DE-AC02-06CH11357); U.S. Department of Energy (DE-AC05- 00OR22725); National Institutes of Health (P41-GM104601); National Institutes of Health (R01-GM132826). Go to: ORCID iDs Lorenzo Casalino https://orcid.org/0000-0003-3581-1148 Mia Rosenfeld https://orcid.org/0000-0002-8961-8231 Surl-Hee Ahn https://orcid.org/0000-0002-3422-805X John Russo https://orcid.org/0000-0002-2813-6554 Sofia Oliveira https://orcid.org/0000-0001-8753-4950 Clare Morris https://orcid.org/0000-0002-4314-5387 Alexander Brace https://orcid.org/0000-0001-9873-9177 Hyungro Lee https://orcid.org/0000-0002-4221-7094 Zhuoran Qiao https://orcid.org/0000-0002-5704-7331 Anima Anandkumar https://orcid.org/0000-0002-6974-6797 James Phillips https://orcid.org/0000-0002-2296-3591 John McCalpin https://orcid.org/0000-0002-2535-1355 Christopher Woods https://orcid.org/0000-0001-6563-9903 Matt Williams https://orcid.org/0000-0003-2198-1058 Richard Pitts https://orcid.org/0000-0002-2037-3360 Daniel Zuckerman https://orcid.org/0000-0001-7662-2031 Adrian Mulholland https://orcid.org/0000-0003-1015-4567 Arvind Ramanathan https://orcid.org/0000-0002-1622-5488 Lillian Chong https://orcid.org/0000-0002-0590-483X Rommie E Amaro https://orcid.org/0000-0002-9275-9553 Go to: References Abraham MJ, Murtola T, Schulz R, et al. (2015) GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1-2: 19-25. DOI: 10.1016/j.softx.2015.06.001 10.1016/j.softx.2015.06.001. [CrossRef] [CrossRef] [Google Scholar] Adhikari U, Mostofian B, Copperman J, et al. (2019) Computational estimation of ms-sec atomistic folding times. Journal of the American Chemical Society 141: 6519-6526. DOI: 10.1101/427393 10.1101/427393. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Bangaru S, Gabriel O, Turner HL, et al. (2020) Structural analysis of full-length SARS-CoV-2 spike protein from an advanced vaccine candidate. Science 370: 65201089-65201094. DOI: 10.1126/science.abe1502 10.1126/science.abe1502. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Beglov D, Roux B. (1994) Finite representation of an infinite bulk system: Solvent boundary potential for computer simulations. The Journal of Chemical Physics 100(12). DOI: 10.1063/1.466711 10.1063/1.466711. [CrossRef] [CrossRef] [Google Scholar] Benton DJ, Wrobel AG, Xu P, et al. (2020) Receptor binding and priming of the spike protein of SARS-CoV-2 for membrane fusion. Nature 588: 7837327-7837330. DOI: 10.1038/s41586-020-2772-0 10.1038/s41586-020-2772-0. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Bhowmik D, Gao S, Young MT, et al. (2018) Deep clustering of protein folding simulations. BMC Bioinformatics 19(18): 484. DOI: 10.1186/s12859-018-2507-5 10.1186/s12859-018-2507-5. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Blender Online Community (2020) Blender - a 3D modelling and rendering package. http://www.blender.org. Brace A, Lee H, Ma H, et al. (2021. a) Achieving 100X Faster Simulations of Complex Biological Phenomena by Coupling ML to HPC Ensembles. arXiv: cs.DC/2104.04797. [Google Scholar] Brace A, Michael S, Subbiah V, et al. (2021. b) Stream-AI-MD: Streaming AI-Driven Adaptive Molecular Simulations for Heterogeneous Computing Platforms. New York, NY, USA: Association for Computing Machinery. DOI: 10.1145/3468267.3470578 10.1145/3468267.3470578. [CrossRef] [CrossRef] [Google Scholar] Casalino L, C Dommer A, Gaieb Z, et al. (2021) AI-driven multiscale simulations illuminate mechanisms of SARS-CoV-2 spike dynamics. The International Journal of High Performance Computing Applications 35(5): 432-451. DOI: 10.1177/10943420211006452 10.1177/10943420211006452. [CrossRef] [CrossRef] [Google Scholar] Casalino L, Gaieb Z, Goldsmith JA, et al. (2020) Beyond Shielding: The Roles of Glycans in the SARS-CoV-2 Spike Protein. ACS Central Science 6(10): 1722-1734. DOI: 10.1021/acscentsci.0c01056 10.1021/acscentsci.0c01056. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Case DA, Cheatham TE, III, Darden TA, et al. (n.d.). San Francisco: Publisher: University of California. Amber16. ([n. d.]). [Google Scholar] Chai J, Cai Y, Pang C, et al. (2021) Structural basis for SARS-CoV-2 envelope protein recognition of human cell junction protein PALS1. Nature Communications 12(1): 3433. DOI: 10.1038/s41467-021-23533-x 10.1038/s41467-021-23533-x. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Christensen AS, Krishna Sirumalla S, Qiao Z, et al. (2021) OrbNet Denali: A Machine Learning Potential for Biological and Organic Chemistry with Semi-empirical Cost and DFT Accuracy. arXiv: physics.chem-ph/2107.00299. [PubMed] [Google Scholar] Ciccotti G, Ferrario M. (2016) Non-equilibrium by molecular dynamics: a dynamical approach. Molecular Simulation 42(16): 1385-1400. DOI: 10.1080/08927022.2015.1121543 10.1080/08927022.2015.1121543. [CrossRef] [CrossRef] [Google Scholar] Clyde A, Galanie S, Kneller DW, et al. (2021) High Throughput Virtual Screening and Validation of a SARS-CoV-2 Main Protease Non-covalent Inhibitor. bioRxiv. arXiv: DOI: 10.1101/2021.03.27.437323. https://www.biorxiv.org/content/early/2021/04/02/2021.03.27.437323.full.pdf [PMC free article] [PubMed] [CrossRef] [Google Scholar] Coleman KK, Wen Tay DJ, Tan KS, et al. (2021) Viral Load of SARS-CoV-2 in Respiratory Aerosols Emitted by COVID-19 Patients while Breathing, Talking, and Singing. Clinical Infectious Diseases. DOI: 10.1093/cid/ciab691 10.1093/cid/ciab691. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Copperman J, Zuckerman DM. (2020) Accelerated estimation of long-timescale kinetics from weighted ensemble simulation via non-Markovian "microbin" analysis. Journal of Chemical Theory and Computation 16(11): 6763-6775. [PMC free article] [PubMed] [Google Scholar] D'Imprima E, Floris D, Joppe M, et al. (2019) Protein denaturation at the air-water interface and how to prevent it. eLife 8: e42747. DOI: 10.7554/eLife.42747 10.7554/eLife.42747. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Durrant JD, Amaro RE. (2014) LipidWrapper: An Algorithm for Generating Large-Scale Membrane Models of Arbitrary Geometry. PLoS Computational Biology 10: 7. DOI: 10.1371/journal.pcbi.1003720 10.1371/journal.pcbi.1003720. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Fennelly KP. (2020) Particle sizes of infectious aerosols: implications for infection control. The Lancet Respiratory Medicine 8(9): 914-924. DOI: 10.1016/S2213-2600(20)30323-4 10.1016/S2213-2600(20)30323-4. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Galdadas I, Shen Q, F Oliveira AS, et al. (2021) Allosteric communication in class A ß-lactamases occurs via cooperative coupling of loop dynamics. eLife 10: e66567. DOI: 10.7554/eLife.66567 10.7554/eLife.66567. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Guvench O, Hatcher E, Venable RM, et al. (2009) CHARMM additive all-atom force field for glycosidic linkages between hexopyranoses. Journal of Chemical Theory and Computation 5: 2353-2370. DOI: 10.1021/ct900242e 10.1021/ct900242e. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Han K, Richard M, VenableBryant A-M, et al. (2018) Graph-Theoretic Analysis of Monomethyl Phosphate Clustering in Ionic Solutions. The Journal of Physical Chemistry B 122(4): 1484-1494. PMID: 29293344 DOI: 10.1021/acs.jpcb.7b10730 10.1021/acs.jpcb.7b10730. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Hopkins CW, ScottGrand Le, Walker RC, et al. (2015) Long-Time-Step Molecular Dynamics through Hydrogen Mass Repartitioning. Journal of Chemical Theory and Computation 11(1): 1864-1874. DOI: 10.1021/ct5010406 10.1021/ct5010406. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Hu X, Lenz-Himmer M-O, Baldauf C. (2021) Better Force Fields Start with Better Data - A Data Set of Cation Dipeptide Interactions. arXiv:q-bio.BM/2107.08855. [PMC free article] [PubMed] [Google Scholar] Huang J, Mackerell AD. (2013) CHARMM36 all-atom additive protein force field: Validation based on comparison to NMR data. Journal of Computational Chemistry 34(25): 2135-2145. DOI: 10.1002/jcc.23354 10.1002/jcc.23354. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Huang J, Rauscher S, Nawrocki G, et al. (2017) CHARMM36m: An Improved Force Field for Folded and Intrinsically Disordered Proteins. Nature Methods 14(1): 71-73. DOI: 10.1038/nmeth.4067 10.1038/nmeth.4067. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Huber GA, Kim S. (1996) Weighted-ensemble Brownian dynamics simulations for protein association reactions. Biophysical Journal 70(1): 97-110. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1224912/ [PMC free article] [PubMed] [Google Scholar] Hughes GW, Ridley C, Collins R, et al. (2019) The MUC5B mucin polymer is dominated by repeating structural motifs and its topology is regulated by calcium and pH. Scientific Reports 9(1): 17350. DOI: 10.1038/s41598-019-53768-0 10.1038/s41598-019-53768-0. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Humphrey W, Dalke A, Schulten K. (1996) VMD - Visual Molecular Dynamics. J. Mol. Graphics 14(1): 33-38. DOI: 10.1016/0263-7855(96)00018-5 10.1016/0263-7855(96)00018-5. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Jo S, Song KC, Desaire H, et al. (2011) Glycan reader: Automated sugar identification and simulation preparation for carbohydrates and glycoproteins. Journal of Computational Chemistry 32(14): 3135-3141. DOI: 10.1002/jcc.21886 10.1002/jcc.21886. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Kannan SR, Spratt AN, Cohen AR, et al. (2021) Evolutionary analysis of the Delta and Delta Plus variants of the SARS-CoV-2 viruses. Journal of Autoimmunity 124(2021): 102715. DOI: 10.1016/j.jaut.2021.102715 10.1016/j.jaut.2021.102715. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Ke Z, Oton J, Qu K, et al. (2020) Structures and distributions of SARS-CoV-2 spike proteins on intact virions. Nature 588(2020): 1-7. DOI: 10.1038/s41586-020-2665-2 10.1038/s41586-020-2665-2. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Klauda JB, Venable RM, Alfredo Freites J, et al. (2010) Update of the CHARMM All-Atom Additive Force Field for Lipids: Validation on Six Lipid Types. The Journal of Physical Chemistry B 114(23): 7830-7843. PMID: 20496934 DOI: 10.1021/jp101759q 10.1021/jp101759q. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Lee H, Turilli M, Jha S, et al. (2019) DeepDriveMD: Deep-Learning Driven Adaptive Molecular Simulations for Protein Folding.In: 2019 IEEE/ACM Third Workshop on Deep Learning on Supercomputers (DLS), pp. 12-19. [Google Scholar] Lee J, Cheng Xi, Jason M, et al. (2016) CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field. Journal of Chemical Theory and Computation 12(1): 405-413. PMID: 26631602 DOI: 10.1021/acs.jctc.5b00935 10.1021/acs.jctc.5b00935. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Lobo VR, Warwicker J. (2021) Predicted pH-dependent stability of SARS-CoV-2 spike protein trimer from interfacial acidic groups. Computational and Structural Biotechnology Journal 19(2021): 5140-5148. DOI: 10.1016/j.csbj.2021.08.049 10.1016/j.csbj.2021.08.049. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Mandala VS, McKay MJ, Shcherbakov AA, et al. (2020) Structure and drug binding of the SARS-CoV-2 envelope protein transmembrane domain in lipid bilayers. Nature Structural & Molecular Biology 27(12): 1202-1208. DOI: 10.1038/s41594-020-00536-8 10.1038/s41594-020-00536-8. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Mariethoz J, Alocci D, Gastaldello A, et al. (2018) Glycomics@ExPASy: Bridging the Gap*. Molecular and Cellular Proteomics 17(11): 2164-2176. DOI: 10.1074/mcp.RA118.000799 10.1074/mcp.RA118.000799. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Markovetz MR, Subramani DB, Kissner WJ, et al. (2019) Endotracheal tube mucus as a source of airway mucus for rheological study. American Journal of Physiology-Lung Cellular and Molecular Physiology 317(4): L498-L509. PMID: 31389736 DOI: 10.1152/ajplung.00238.2019 10.1152/ajplung.00238.2019. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Martínez L, Andrade R, Birgin EG, et al. (2009) PACKMOL: A package for building initial configurations for molecular dynamics simulations. Journal of Computational Chemistry 30: 132157-132164. arXiv: DOI: 10.1002/jcc.21224. [PubMed] [CrossRef] [Google Scholar] McCallum M, Walls AC, Sprouse KR, et al. (2021) Molecular Basis of Immune Evasion by the Delta and Kappa SARS-CoV-2 Variants. bioRxiv. arXiv: DOI: 10.1101/2021.08.11.455956. https://www.biorxiv.org/content/early/2021/08/12/2021.08.11.455956.full.pdf [PubMed] [CrossRef] [Google Scholar] Miller SL, Nazaroff WW, Jimenez JL, et al. (2021) Transmission of SARS-CoV-2 by inhalation of respiratory aerosol in the Skagit Valley Chorale superspreading event. Indoor Air 31(2): 314-323. DOI: 10.1111/ina.12751 10.1111/ina.12751. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Noe F. (2020) Machine Learning for Molecular Dynamics on Long Timescales. Cham: Springer International Publishing, pp. 331-372. DOI: 10.1007/978-3-030-40245-7_16 10.1007/978-3-030-40245-7_16. [CrossRef] [CrossRef] [Google Scholar] Oliveira ASF, Ciccotti G, Haider S, et al. (2021. a) Dynamical nonequilibrium molecular dynamics reveals the structural basis for allostery and signal propagation in biomolecular systems. The European Physical Journal B 94(7): 144. DOI: 10.1140/epjb/s10051-021-00157-0 10.1140/epjb/s10051-021-00157-0. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Oliveira ASF, Edsall CJ, Woods CJ, et al. (2019) A General Mechanism for Signal Propagation in the Nicotinic Acetylcholine Receptor Family. Journal of the American Chemical Society 141(51): 19953-19958. PMID: 31805762 DOI: 10.1021/jacs.9b09055 10.1021/jacs.9b09055. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Oliveira ASF, Shoemark DK, Avila Ibarra A, et al. (2021. b) The fatty acid site is coupled to functional motifs in the SARS-CoV-2 spike protein and modulates spike allosteric behaviour. bioRxiv 20arXiv: DOI: 10.1101/2021.06.07.447341. https://www.biorxiv.org/content/early/2021/06/09/2021.06.07.447341.full.pdf [PMC free article] [PubMed] [CrossRef] [Google Scholar] Park SJ, Lee J, Qi Y, et al. (2019) CHARMM-GUI Glycan Modeler for modeling and simulation of carbohydrates and glycoconjugates. Glycobiology 29(4): 320-331. DOI: 10.1093/glycob/cwz003 10.1093/glycob/cwz003. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Petters MD, Kreidenweis SM. (2007) A single parameter representation of hygroscopic growth and cloud condensation nucleus activity. Atmospheric Chemistry and Physics 7(8): 1961-1971. DOI: 10.5194/acp-7-1961-2007 10.5194/acp-7-1961-2007. [CrossRef] [CrossRef] [Google Scholar] Phillips J, Zheng G, Kumar S, et al. (2002) NAMD: Biomolecular Simulation on Thousands of Processors. In:Proceedings of the IEEE/ACM SC2002 Conference. Baltimore, MD: IEEE Press, pp. 1-18. Technical Paper 277. [Google Scholar] Phillips J., Braun R, Wang W, et al. (2005) Scalable Molecular Dynamics with NAMD. DOI: 10.1002/jcc.20289 10.1002/jcc.20289. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Phillips JC, Hardy DJ, Maia JDC, et al. (2020) Scalable molecular dynamics on CPU and GPU architectures with NAMD. J. Chem. Phys 153: 044130. DOI: 10.1063/5.0014475 10.1063/5.0014475. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Páll S, Zhmurov A, Bauer P, et al. (2020) Heterogeneous parallelization and acceleration of molecular dynamics simulations in GROMACS. The Journal of Chemical Physics 153: 13134110. DOI: 10.1063/5.0018516 10.1063/5.0018516. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Qiao Z, Welborn M, Anandkumar A, et al. (2020) OrbNet: Deep learning for quantum chemistry using symmetry-adapted atomic-orbital features. The Journal of Chemical Physics 153: 12124111. DOI: 10.1063/5.0021955 10.1063/5.0021955. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Raveh B, London N, Schueler-Furman O. (2010) Sub-angstrom modeling of complexes between flexible peptides and globular proteins. Proteins: Structure, Function, and Bioinformatics 78(9): 2029-2040. arXiv: DOI: 10.1002/prot.22716. https://onlinelibrary.wiley.com/doi/pdf/10.1002/prot.22716 [PubMed] [CrossRef] [Google Scholar] Russo JD, Zhang S, Leung JMG, et al. (2022) WESTPA 2.0: High-Performance Upgrades for Weighted Ensemble Simulations and Analysis of Longer-Timescale Applications. Journal of Chemical Theory and Computation 18: 638-649. DOI: 10.1021/acs.jctc.1c01154. [PMC free article] [PubMed] [CrossRef] [Google Scholar] Saglam AS, Chong LT. (2019) Protein-protein binding pathways and calculations of rate constants using fully-continuous, explicit-solvent simulations. Chemical Science 10(8): 2360-2372. DOI: 10.1039/c8sc04811h 10.1039/c8sc04811h. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Salomon-Ferrer R, Götz AW, Duncan P, et al. (2013) Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. Journal of Chemical Theory and Computation 9(9): 3878-3888. DOI: 10.1021/ct400314y 10.1021/ct400314y. [PubMed] [CrossRef] [CrossRef] [Google Scholar] Sener Melih, Levy Stuart, Stone John E., et al. (2021) Multiscale Modeling and Cinematic Visualization of Photosynthetic Energy Conversion Processes from Electronic to Cell Scales. Parallel Comput., p. 102698. [PMC free article] [PubMed] [Google Scholar] Stone JE, Hynninen A-P, Phillips JC, et al. (2016. a) Early Experiences Porting the NAMD and VMD Molecular Simulation and Analysis Software to GPU-Accelerated OpenPOWER Platforms. International Workshop on OpenPOWER for HPC, pp. 188-206. (IWOPH'16). [PMC free article] [PubMed] [Google Scholar] Stone JE, Barry I, Schulten K. (2013. a) Early Experiences Scaling VMD Molecular Visualization and Analysis Jobs on Blue Waters. In: Extreme Scaling Workshop (XSW), pp. 43-50. DOI: 10.1109/XSW.2013.10 10.1109/XSW.2013.10. [CrossRef] [CrossRef] [Google Scholar] Stone JE, Sener M, Vandivort KL, et al. (2016. b) Atomic Detail Visualization of Photosynthetic Membranes with GPU-Accelerated Ray Tracing. Parallel Comput 55: 17-27. DOI: 10.1016/j.parco.2015.10.015 10.1016/j.parco.2015.10.015. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Stone JE, Vandivort KL, Schulten K. (2013. b) GPU-Accelerated Molecular Visualization on Petascale Supercomputing Platforms. In: Proceedings of the 8th International Workshop on Ultrascale Visualization (UltraVis '13). New York, NY, USA: ACM, p. 8. Article 6. [Google Scholar] Suarez E, Lettieri S, Zwier MC, et al. (2014) Simultaneous computation of dynamical and equilibrium information using a weighted ensemble of trajectories. Journal of Chemical Theory and Computation 10(7): 2658-2667. [PMC free article] [PubMed] [Google Scholar] Sugio S, Kashima A, Mochizuki S, et al. (1999) Crystal structure of human serum albumin at 2.5 Å resolution. Protein Engineering, Design and Selection 12(6): 439-446. arXiv: DOI: 10.1093/protein/12.6.439 10.1093/protein/12.6.439. https://academic.oup.com/peds/article-pdf/12/6/439/18543407/120439.pdf [PubMed] [CrossRef] [CrossRef] [Google Scholar] Surya W, Li Y, Torres J. (2018) Structural model of the SARS coronavirus E channel in LMPG micelles. Biochimica et Biophysica Acta (BBA) - Biomembranes 1860(6): 1309-1317. DOI: 10.1016/j.bbamem.2018.02.017 10.1016/j.bbamem.2018.02.017. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Symmes BA, Stefanski AL, Magin CM, et al. (2018) Role of mucins in lung homeostasis: regulated expression and biosynthesis in health and disease. Biochemical Society Transactions 46(3): 707-719. arXiv: DOI: 10.1042/BST20170455 10.1042/BST20170455. https://portlandpress.com/biochemsoctrans/article-pdf/46/3/707/479418/bst-2017-0455c.pdf [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Sztain T, Ahn S-H, Bogetti AT, et al. (2021) A glycan gate controls opening of the SARS-CoV-2 spike protein. Nature Chemistry 13(10): 963-968. DOI: 10.1038/s41557-021-00758-3 10.1038/s41557-021-00758-3. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Thomsson KA, Schulz BL, Packer NH, et al. (2005) MUC5B glycosylation in human saliva reflects blood group and secretor status. Glycobiology 15(8): 791-804. arXiv: DOI: 10.1093/glycob/cwi059. https://academic.oup.com/glycob/article-pdf/15/8/791/1787060/cwi059.pdf [PubMed] [CrossRef] [Google Scholar] Torrillo PA, Bogetti AT, Chong LT. (2021) A Minimal, Adaptive Binning Scheme for Weighted Ensemble Simulations. The Journal of Physical Chemistry A 125(7): 1642-1649. DOI: 10.1021/acs.jpca.0c10724 10.1021/acs.jpca.0c10724. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Turonova B, Sikora M, Schürmann C, et al. (2020) In situ structural analysis of SARS-CoV-2 spike reveals flexibility mediated by three hinges. Science 370(6513): 203-208. DOI: 10.1126/science.abd5223 10.1126/science.abd5223. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Vejerano EP, Marr LC. (2018) Physico-chemical characteristics of evaporating respiratory fluid droplets. Journal of the Royal Society Interface 15(139): 1-10. DOI: 10.1098/rsif.2017.0939 10.1098/rsif.2017.0939. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Venable RM, Luo Y, Gawrisch K, et al. (2013) Simulations of Anionic Lipid Membranes: Development of Interaction-Specific Ion Parameters and Validation Using NMR Data. The Journal of Physical Chemistry B 117(35): 10183-10192. PMID: 23924441 DOI: 10.1021/jp401512z 10.1021/jp401512z. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Walker JS, Archer J, Florence K, et al. (2021) Accurate Representations of the Microphysical Processes Occurring during the Transport of Exhaled Aerosols and Droplets. ACS Central Science 7(1). DOI: 10.1021/acscentsci.0c01522 10.1021/acscentsci.0c01522. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Walls AC, YoungPark J, Alejandra Tortorici M, et al. (2020) Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181(2): 281-292. DOI: 10.1016/j.cell.2020.02.058 10.1016/j.cell.2020.02.058. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Wang CC, Prather KA, Sznitman J, et al. (2021) Airborne transmission of respiratory viruses. Science 373(6558): eabd9149. DOI: 10.1126/science.abd9149 10.1126/science.abd9149. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Warwicker J. (2021) A model for pH coupling of the SARS-CoV-2 spike protein open/closed equilibrium. Briefings in Bioinformatics 22(2): 1499-1507. arXiv: 10.1093/bib/bbab056. https://academic.oup.com/bib/article-pdf/22/2/1499/36654668/bbab056.pdf [PMC free article] [PubMed] [CrossRef] [Google Scholar] Williams M. (2021) Cluster in the Cloud. https://cluster-in-the-cloud.readthedocs.io [Google Scholar] Wrapp D, Wang N, Corbett KS, et al. (2020) Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367(6483): 1260-1263. DOI: 10.1126/science.abb2507 10.1126/science.abb2507. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Zimmerman MI, Porter JR, Ward MD, et al. (2021) SARS-CoV-2 simulations go exascale to predict dramatic spike opening and cryptic pockets across the proteome. Nature Chemistry 13(7): 651-659. DOI: 10.1038/s41557-021-00707-0 10.1038/s41557-021-00707-0. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Zuckerman DM, Chong LT. (2017) Weighted Ensemble Simulation: Review of Methodology, Applications, and Software. Annual Review of Biophysics 46: 43-57. DOI: 10.1146/annurev-biophys-070816-033834 10.1146/annurev-biophys-070816-033834. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] Zwier MC, Adelman JL, Kaus JW, et al. (2015) WESTPA: An Interoperable, Highly Scalable Software Package for Weighted Ensemble Simulation and Analysis. Journal of Chemical Theory and Computation 11(2): 800-809. DOI: 10.1021/ct5010615 10.1021/ct5010615. [PMC free article] [PubMed] [CrossRef] [CrossRef] [Google Scholar] OTHER FORMATS PubReader PDF (2.8M) ACTIONS Cite Collections SHARE RESOURCES Similar articles Cited by other articles Links to NCBI Databases FOLLOW NCBI Connect with NLM National Library of Medicine 8600 Rockville Pike Bethesda, MD 20894 Web Policies FOIA HHS Vulnerability Disclosure Help Accessibility Careers NLM NIH HHS USA.gov Tell us what you think!Close |
| Title | Atomic force microscopy and atomistic molecular dynamics simulation data to resolve structures of negatively-supercoiled DNA minicircles at base-pair resolution. |
| Description | Simulations results and data source of Figures published on Nature Communications 2021 |
| Type Of Material | Database/Collection of data |
| Year Produced | 2021 |
| Provided To Others? | Yes |
| Impact | Available publicly. It is of interest of the research community |
| URL | https://springernature.figshare.com/articles/dataset/Atomic_force_microscopy_and_atomistic_molecular... |
| Title | Atomic force microscopy and atomistic molecular dynamics simulation data to resolve structures of negatively-supercoiled DNA minicircles at base-pair resolution. |
| Description | This data record consists 2 zipped folders: Full AFM raw data set.zip, and Source data .zip.The zipped folder Full AFM raw data set.zip contains all raw AFM data including repeats and experiments carried out in alternative conditionsThe primary subfolder names correspond to the method of DNA immobilisation:Nickel - use of 3 mM NiCl2 in Ph7.4 20 mM HEPES bufferPLLNaOAc - use of PLL and pH 5.4 50 mM NaOAc bufferHR images - high resolution images, obtained also using the nickel conditions.The secondary subfolder names correspond to the superhelical density as shown in figure 3 in the article, and these contain the raw AFM images as .spm isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. File formats included in the zipped folder: .spm, .tiff, .json, .txt and .pdf.The zipped folder Source data .zip comprises all relevant data, pdbs of all the structures depicted in the paper obtained from simulations and AFM. See below for details on each sub folder within Source data 2.zip. Each folder contains the data used to generate each figure ad supplementary figure in the article. Figure 1: AFM data: the AFM raw files for the high-resolution images shown in figure 1, and calculations of their aspect ratios as aspectratiomanual.xlsxAFM movie: the AFM raw files for the time-lapse images shown in figure 1.MD data: the MD images used for the high-resolution images shown in figure 1 and .tar files - the MD files used to generate the snapshots MD movie: the MD snapshots files for the time-lapse images shown in figure 1 and .tar files - the MD files used to generate the snapshotsFile formats included in the Figure 1 sub folder: 0## files where ## represent numbers, .gwy, .txt, .eps, .mpg and .xlsx.Figure 2: Kink and defect measurements - the measured bend angles shown in Fig 2 and an AFM image showing how the FAM bends were measuredMD Radgyr Writhe - measurements of radius of gyrations and writhe for each topoisomer.tar files - the MD files used to generate the snapshots in 2a.txt file - the profile shown in fig 2bFile formats included in the Figure 2 sub folder: .tiff, .txt and .datFigure 3: The subfolder names correspond to the superhelical density as shown in figure 3, and these contain the raw AFM images as '.spm' isles, the sub folders within those are created by the program TopoStats, and are processed data from the raw AFM images. The '.json' file contains the data used to make the plots shown in Figure 3File formats included in the Figure 3 sub folder: .spm, .tiff, .txt, .json and .pdfFigure 4: '.dat' files contain information from MD simulations used to create the subfigure they are labelled with.The '.spm' and '.037' files are the raw AFM images used in this figure.The .tar files are MD simulations data used to generate the snapshots shown in figure 4.File formats included in the Figure 4 sub folder: .spm, .txt, .pdf and .datFigure S1: Simulations data generated using the SerraLine program, showing the average and maximum deviations from planarity in relative and absolute numbers.Data were plotted suing the distributions_plot.py script.File formats included in the Figure S1 sub folder: .csv, .pdf, and .txtFigure S2a: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files. File formats included in the Figure S2a sub folder: .pdb and .dat.Figure S2b: MD measurement of the writhe over time as a '.dat file' and snapshots as '.pdb' files.File formats included in the Figure S2b sub folder: .pdb and .dat.Figure S3: The AFM and MD measurements of bending angles including all profiles for MD simulations, generated using Serraline A, FM images and measurements in the form '251angles' '339 angles'.File formats included in the Figure S3 sub folder: .tiff, .txt and .pdb.Figure S4: AFM length analysis of the position of the triplex on linearised minicircles. 'Csv' file contains the length data measured by hand using the IMOD software.Plots: plots of the data raw AFM data: AFM data files used in the analysisFile formats included in the Figure S4 sub folder: .csv, .xlsx, .pdf and 0## files where ## represent numbers.Figure S5: Surface plasmon resonance (SPR) data show the effect of ions on the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S5 sub folder: .json, .pdf and .xlsx.Figure S6: SPR data in showing the affinity of the triplex for varying superhelical densities of DNA minicircles, plotted using the script 'sprplot'. '.pdf' files are the plots of the various excel files.File formats included in the Figure S6 sub folder: .json, .pdf, .xlsx and .pdf.Figure S7: An MS '.tar' file containing the snapshots shown in figure S7File formats included in the Figure S7 sub folder: .pdbFigure S8: AFM data used in figure s8, the '.gwy' files are AFM images of the wide view, and each of the time-lapse images. The '.txt' files are the profiles taken in those images and plotted in the figure.File formats included in the Figure S8 sub folder: .gwy and .txt.Figure S9: Simulations data showing the difference between the OL$ and BSC1 forcefields.File formats included in the Figure S9 sub folder: .datSimulations: The simulations data File formats included in the Simulations sub folder: .gro and .xtcSupp videos: The supplementary videosFile formats included in the SuppVideods sub folder: .pdb and .mpgSoftware needed to access data: 20151103_251_NAT_17ng_Ni_20mm_052DX.058 or AFM_339_TFO_HR_cs.037, spm files & all files included in the "Raw AFM data" sub folder - Gwyddion, Nanoscope Analysiseps files - illustrator/ pdf software.mpg - any movie player.gro - gromacs files- GRO files may be viewed on a computer using a supporting HP calculator emulator, such as Emu48.xtc files - gromacs files- a suitable software like XTrkCADsee http://manual.gromacs.org/documentation/2018/user-guide/file-formats.html for more information on gromacs files.Study aims and methodology: In the cell, DNA is arranged into highly-organised and topologically-constrained (supercoiled) structures. It remains unclear how this supercoiling affects the detailed double-helical structure of DNA, largely because of limitations in spatial resolution of the available biophysical tools. In this study, the authors combined high-resolution atomic force microscopy (AFM) with molecular dynamics (MD) simulations to reveal how supercoiling affects global and local DNA conformation, structure and dynamics in DNA minicircles of length 250-340 bp. The following procedures are described in more detail in the related article: generation and purification of small DNA circles, preparation and analysis of different topological species of minicircles, S1 nuclease digestions, atomic force microscopy, atomistic simulations and surface plasmon resonance. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2021 |
| Provided To Others? | Yes |
| URL | https://springernature.figshare.com/articles/dataset/Atomic_force_microscopy_and_atomistic_molecular... |
| Title | Benchmark set inputs for absolute binding free energy calculations of fragment optimisations |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Provided here are the various scripts, input files, and results necessary to reproduce the outcomes of the above mentioned publication. Please see the provided README.md files for further information on the contents of this dataset. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5913468 |
| Title | Benchmark set inputs for absolute binding free energy calculations of fragment optimisations |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Provided here are the various scripts, input files, and results necessary to reproduce the outcomes of the above mentioned publication. Please see the provided README.md files for further information on the contents of this dataset. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5913469 |
| Title | Cyclic di-AMP traps proton-coupled K+ transporters of the KUP family in an inward-occluded conformation |
| Description | Raw data of whole-cell K+ transport data and DSF measurements, detailed information provided in the according paper |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://springernature.figshare.com/articles/dataset/Cyclic_di-AMP_traps_proton-coupled_K_sup_sup_tr... |
| Title | Cyclic di-AMP traps proton-coupled K+ transporters of the KUP family in an inward-occluded conformation |
| Description | Raw data of whole-cell K+ transport data and DSF measurements, detailed information provided in the according paper |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://springernature.figshare.com/articles/dataset/Cyclic_di-AMP_traps_proton-coupled_K_sup_sup_tr... |
| Title | Dataset for CDL vs MLCL in mitochondrial Complex III - Complex IV supercomplexes |
| Description | Coarse-grained structure (gro) and trajectory (xtc) files for yeast, mouse, and human respiratory supercomplexes in a membrane containing either cardiolipin (CL) or monolysocardiolipin (MLCL). Atomistic structure (gro) and trajectory (xtc) files for the yeast respiratory supercomplex in a membrane containing either CL or MLCL. Three repeats of each simulation are present in each case. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/7261107 |
| Title | Dataset for CDL vs MLCL in mitochondrial Complex III - Complex IV supercomplexes |
| Description | Coarse-grained structure (gro) and trajectory (xtc) files for yeast, mouse, and human respiratory supercomplexes in a membrane containing either cardiolipin (CL) or monolysocardiolipin (MLCL). Atomistic structure (gro) and trajectory (xtc) files for the yeast respiratory supercomplex in a membrane containing either CL or MLCL. Three repeats of each simulation are present in each case. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/7261108 |
| Title | Experimental and molecular modelling data from the study of 46mer DNAzyme |
| Description | Here we are providing access to experimental and molecular dynamics (MD) simulation data from the study of 46mer DNAzyme. Experimental data: NMR experiments for the determination of the diffusion coefficients of the c4, s4, c4s4 without and with Copper(II). (NMR.zip) EPR/ESR experiments, both 1D and 2D, for the determination of g-factors and hyperfine couplings with Cu2+ Mass spectroscopy for the analysis of DNA fragments before and after the cleavage reactions Molecular dynamics data: Atomic pdb models of 46mer DNAzyme with (46m_WithCU.pdb) and without Cu2+ ions (46m_WithoutCU.pdb) Ten trajectories/replicas of 46mer DNAzyme with Cu2+ ions (1-10 46m-WithCU-500ns_dt100ps.xtc) Three trajectories/replicas of 46mer DNAzyme without Cu2+ ions (1-3 46m-WithoutCU-500ns_dt100ps.xtc) Each replica contains 500 nanaoseconds of simulation data sampled at 100 picosecond intervals. Simulations were done using GROMACS2020 package using parmbsc1 forcefield on JADE2 cluster. This project made use of time on HPC granted via the UK High-End Computing Consortium for Biomolecular Simulation, HECBioSim (http://hecbiosim.ac.uk), supported by EPSRC (grant no. EP/R029407/1). EPR experiments were supported by financial support from the IR INFRANALYTICS FR2054. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://brunel.figshare.com/articles/dataset/Experimental_and_molecular_modelling_data_from_the_stud... |
| Title | Experimental and molecular modelling data from the study of 46mer DNAzyme |
| Description | Here we are providing access to experimental and molecular dynamics (MD) simulation data from the study of 46mer DNAzyme. Experimental data: NMR experiments for the determination of the diffusion coefficients of the c4, s4, c4s4 without and with Copper(II). (NMR.zip) EPR/ESR experiments, both 1D and 2D, for the determination of g-factors and hyperfine couplings with Cu2+ Mass spectroscopy for the analysis of DNA fragments before and after the cleavage reactions Molecular dynamics data: Atomic pdb models of 46mer DNAzyme with (46m_WithCU.pdb) and without Cu2+ ions (46m_WithoutCU.pdb) Ten trajectories/replicas of 46mer DNAzyme with Cu2+ ions (1-10 46m-WithCU-500ns_dt100ps.xtc) Three trajectories/replicas of 46mer DNAzyme without Cu2+ ions (1-3 46m-WithoutCU-500ns_dt100ps.xtc) Each replica contains 500 nanaoseconds of simulation data sampled at 100 picosecond intervals. Simulations were done using GROMACS2020 package using parmbsc1 forcefield on JADE2 cluster. This project made use of time on HPC granted via the UK High-End Computing Consortium for Biomolecular Simulation, HECBioSim (http://hecbiosim.ac.uk), supported by EPSRC (grant no. EP/R029407/1). EPR experiments were supported by financial support from the IR INFRANALYTICS FR2054. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://brunel.figshare.com/articles/dataset/Experimental_and_molecular_modelling_data_from_the_stud... |
| Title | Experimental and molecular modelling data from the study of 46mer DNAzyme |
| Description | Here we are providing access to experimental and molecular dynamics (MD) simulation data from the study of 46mer DNAzyme.Experimental data:NMR experiments for the determination of the diffusion coefficients of the c4, s4, c4s4 without and with Copper(II). (NMR.zip)EPR/ESR experiments, both 1D and 2D, for the determination of g-factors and hyperfine couplings with Cu2+Mass spectroscopy for the analysis of DNA fragments before and after the cleavage reactionsMolecular dynamics data:Atomic pdb models of 46mer DNAzyme with (46m_WithCU.pdb) and without Cu2+ ions (46m_WithoutCU.pdb)Ten trajectories/replicas of 46mer DNAzyme with Cu2+ ions (1-10 46m-WithCU-500ns_dt100ps.xtc)Three trajectories/replicas of 46mer DNAzyme without Cu2+ ions (1-3 46m-WithoutCU-500ns_dt100ps.xtc)Each replica contains 500 nanaoseconds of simulation data sampled at 100 picosecond intervals. Simulations were done using GROMACS2020 package using parmbsc1 forcefield on JADE2 cluster.This project made use of time on HPC granted via the UK High-End Computing Consortium for Biomolecular Simulation, HECBioSim (http://hecbiosim.ac.uk), supported by EPSRC (grant no. EP/R029407/1). EPR experiments were supported by financial support from the IR INFRANALYTICS FR2054. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2023 |
| Provided To Others? | Yes |
| URL | https://brunel.figshare.com/articles/dataset/Experimental_and_molecular_modelling_data_from_the_stud... |
| Title | HECBioSim UK HPC Performance Benchmark Database |
| Description | This database contains a large number of performance benchmarks taken across the UK Tier1 and Tier2 HPC facilities with AMBER, GROMACS, LAMMPS and NAMD. These benchmarks are able to run across the wide variety of architectures available across the UK and report typical performance figures for a number of commonly used simulation protocols. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2021 |
| Provided To Others? | Yes |
| Impact | This dataset assists PIs with applications for time across the many HPC facilities across the UK and gives the computing centers and the HECBioSim RAP a yard stick to measure requests for resource against the science case provided. These benchmarks have been used during the commissioning process of JADE2 to illustrate performance realised during the installation phase of the machine and were incorporated into the acceptance test schedule for Oxford University. |
| URL | http://www.hecbiosim.ac.uk |
| Title | IHF bends and bridges DNA in a multiplicity of states |
| Description | A representative pdb for each of the 4 topological states IHF induces in linear DNA |
| Type Of Material | Database/Collection of data |
| Year Produced | 2020 |
| Provided To Others? | Yes |
| Impact | Structures obtained from simulations publicly available |
| URL | https://pure.york.ac.uk/portal/en/datasets/ihf-bends-and-bridges-dna-in-a-multiplicity-of-states(0b6... |
| Title | MemProtMD |
| Description | A collection of all membrane protein structures in their native lipid membranes. |
| Type Of Material | Database/Collection of data |
| Year Produced | 2015 |
| Provided To Others? | Yes |
| Impact | Many fruitful collaborations with other scientists and workshops. Through further funding you have enabled us to keep this website live and fully functional. Sustainable research is essential! |
| URL | http://memprotmd.bioch.ox.ac.uk/ |
| Title | Sampled ?H/?? and ?H data from ABFE calculations (using standard atomic masses) of 10 ligands bound to Cyclophilin D |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 10 ligands to bound Cyclophilin D. These ligands are originally detailed by Grädler et al. (https://doi.org/10.1016/j.bmcl.2019.126717). Unlike other datasets in this work, which employed hydrogen mass repartitioning, the ligands here were calculated using standard atomic masses. All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906018 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations (using standard atomic masses) of 10 ligands bound to Cyclophilin D |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 10 ligands to bound Cyclophilin D. These ligands are originally detailed by Grädler et al. (https://doi.org/10.1016/j.bmcl.2019.126717). Unlike other datasets in this work, which employed hydrogen mass repartitioning, the ligands here were calculated using standard atomic masses. All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906019 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 10 ligands bound to Cyclophilin D |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 10 ligands bound to Cyclophilin D. These ligands are originally detailed by Grädler et al. (https://doi.org/10.1016/j.bmcl.2019.126717). Unlike the other Cyclophilin D dataset provided in this work, simulations for these ABFE calculations were carried out using hydrogen mass repartitioning (HMR). The MCL-1, HSP90, and PWWP1 also used HMR. All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5904110 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 10 ligands bound to Cyclophilin D |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 10 ligands bound to Cyclophilin D. These ligands are originally detailed by Grädler et al. (https://doi.org/10.1016/j.bmcl.2019.126717). Unlike the other Cyclophilin D dataset provided in this work, simulations for these ABFE calculations were carried out using hydrogen mass repartitioning (HMR). The MCL-1, HSP90, and PWWP1 also used HMR. All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5904109 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 12 ligands bound to PWWP1 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 12 ligands to bound PWWP1. These ligands are originally detailed by Böttcher et al. (https://doi.org/10.1038/s41589-019-0310-x). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906262 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 12 ligands bound to PWWP1 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 12 ligands to bound PWWP1. These ligands are originally detailed by Böttcher et al. (https://doi.org/10.1038/s41589-019-0310-x). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906261 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 18 ligands bound to HSP90 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 18 ligands bound to HSP90. These ligands are originally detailed by Murray et al. (https://doi.org/10.1021/jm100059d). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906109 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 18 ligands bound to HSP90 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 18 ligands bound to HSP90. These ligands are originally detailed by Murray et al. (https://doi.org/10.1021/jm100059d). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906110 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 19 ligands bound to MCL-1 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 19 ligands to bound MCL-1. These ligands are originally detailed by Friberg et al. (https://doi.org/10.1021/jm301448p). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906804 |
| Title | Sampled ?H/?? and ?H data from ABFE calculations of 19 ligands bound to MCL-1 |
| Description | Supplementary Information: "Evaluating the use of absolute binding free energy in the fragment optimization process" Included are the ABFE raw free energy samples for multiple replicates (labelled by `run` number) of 19 ligands to bound MCL-1. These ligands are originally detailed by Friberg et al. (https://doi.org/10.1021/jm301448p). All samples are provided as a set of `.xvg` files as generated by GROMACS 2021 (https://doi.org/10.5281/zenodo.5849961). The `.xvg` files are labelled as dhdl.N.xvg where N represents the ? state the free energy values were sampled from. The `.xvg` files contain both ?H/?? and ?H values, please see the header of each files for more information. Samples detailing the partial decoupling of the ligand from the protein-ligand complex are contained within the `complex` folder. These consist of an orientational restraint addition step (found within the `restraints-xvg` folders), charge annihilation step (found within the `coul-xvg` folders), and Van der Waals decoupling step (found within the `vdw-xvg` folders). Samples detailing the partial decoupling of the ligand from solvent are contained within the `ligand` folder and consist of a charge annihilation step (found within the individual `coul-xvg` folders) and a Van der Waals decoupling step (found within the individual `vdw-xvg` folders). |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| URL | https://zenodo.org/record/5906805 |
| Title | Simulations of circular RNAs from the Rolling Circle RNA synthesis |
| Description | All-atom trajectories from simulations done over the different stages of Rolling Circle RNA synthesis |
| Type Of Material | Database/Collection of data |
| Year Produced | 2021 |
| Provided To Others? | Yes |
| Impact | None |
| URL | https://pure.york.ac.uk/portal/en/datasets/b92977bd-f016-4740-8b4a-f86c68d5eb2c |
| Title | Simulations of supercoiled minicircles bound to IHF |
| Description | Trajectories for all-atom simulations pf DNA minicircles bound to the bacterial protein IHF |
| Type Of Material | Database/Collection of data |
| Year Produced | 2022 |
| Provided To Others? | Yes |
| Impact | None |
| Title | agnesnoy/SerraLINE: SerraLINE v1.0 |
| Description | Version utilized in the paper: ALB Pyne, A Noy, K Main, V Velasco-Berrelleza, MM Piperakis, LA Mitchenall, FM Cugliandolo, JG Beton, CEM Stevenson, BW Hoogenboom, AD Bates, A Maxwell, SA Harris (2020). "Base-pair resolution analysis of the effect of supercoiling on DNA flexibility and recognition", Accepted in Nat Commun |
| Type Of Material | Database/Collection of data |
| Year Produced | 2020 |
| Provided To Others? | Yes |
| Impact | None |
| URL | https://zenodo.org/record/4105979 |
| Title | agnesnoy/SerraNA: SerraNA v1.0 |
| Description | By Victor Velasco-Berrelleza and Agnes Noy Published at: V Velasco-Berrelleza, M Burmann, JW Shepherd, MC Leake, R Golestanian, A Noy (2020). "SerraNA: a program to determine nucleic acids elasticity from simulation data" Phys Chem Chem Phys, 22, 19254-19266 https://doi.org/10.1039/D0CP02713H |
| Type Of Material | Database/Collection of data |
| Year Produced | 2020 |
| Provided To Others? | Yes |
| Impact | None |
| URL | https://zenodo.org/record/4105986 |
| Title | agnesnoy/WrLINE: WrLINEv1.0 |
| Description | By Thana Sutthibutpong and Agnes Noy Published at: Sutthibutpong T, Harris SA, Noy A* (2015). "Comparison of molecular contours for measuring writhe in atomistic supercoiled DNA" J Chem Theor Comput, 11, 2768. DOI:10.1021/acs.jctc.5b00035 |
| Type Of Material | Database/Collection of data |
| Year Produced | 2020 |
| Provided To Others? | Yes |
| Impact | None |
| URL | https://zenodo.org/record/4106245 |
| Description | Collaboration 2 with University of Virginia |
| Organisation | University of Virginia (UVa) |
| Country | United States |
| Sector | Academic/University |
| PI Contribution | Combination of Molecular simulations with Electron Paramagnetic Resonance studies to show the impact of binding of antibiotics to bacterial enzymes. |
| Collaborator Contribution | Electron Paramagnetic Resonance studies to show the impact of binding of antibiotics to bacterial enzymes. |
| Impact | Paper currently in preparation. Data presented at the Annual Biophysical Society Meeting in San Diego by our collaborator Prof Linda Columbus. |
| Start Year | 2019 |
| Description | Collaboration with Arvind Ramanathan |
| Organisation | Argonne National Laboratory |
| Country | United States |
| Sector | Public |
| PI Contribution | We provided details of MD codes and also the essential biophysics and biochemistry that must be incorporated into the simulations. As well as details of how we would like to use ML to augment the MD simulations. |
| Collaborator Contribution | They provided training for my postdoc on how to use their software and we are working together at the moment to integrate their software into our workflows. |
| Impact | N/A |
| Start Year | 2024 |
| Description | Collaboration with Case Western |
| Organisation | Case Western Reserve University |
| Department | Department of Biochemistry |
| Country | United States |
| Sector | Academic/University |
| PI Contribution | Molecular simulations of the copper transporter CusA revealing how Copper stabilises the efflux state. We also show how the periplasmic domains change conformation from resting to extrusion state and also how protons may use a water wire across the membrane to power transport via the Proton Motive Force. |
| Collaborator Contribution | Ed Yu and his lab have provided novel structures of the CusA transporter in distinct states. |
| Impact | mBio publication now accepted. |
| Start Year | 2020 |
| Description | Collaboration with Dr. Alice Pyne, University of Sheffield |
| Organisation | Henry Royce Institute |
| Department | Henry Royce Institute – University of Sheffield Facilities |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Intellectual input and expertise, time of PhD student and access to data |
| Collaborator Contribution | Access to the Henry Royce center through a participation scheme with time dedicated from the Research Officer and in kind materials |
| Impact | -Article in Nature Communications, DOI: 10.1038/s41467-021-21243-y -Articles in Media: Daily Express, Daily Mail, Yahoo News and Deccan Herald -Interviews in Media: Alice Pyne at ITV |
| Start Year | 2016 |
| Description | Collaboration with Dr. Tung Le, JIC, Norwich |
| Organisation | John Innes Centre |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Expertise and PhD student time to perform MD simulations that support the experiments |
| Collaborator Contribution | They provide the experimental structures |
| Impact | A publication in Cell Reports (2020), DOI: 10.1016/j.celrep.2020.107928 |
| Start Year | 2019 |
| Description | Collaboration with Pedro Carvalho |
| Organisation | University of Oxford |
| Department | Sir William Dunn School of Pathology |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Simulations of a key lipid-interacting protein. |
| Collaborator Contribution | Structures and Functional data. |
| Impact | Paper in progress. |
| Start Year | 2019 |
| Description | Collaboration with Prof. Anthony Maxwell, JIC, Norwich and Inspiralis |
| Organisation | John Innes Centre |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Performed simulations and write a CASE studentship |
| Collaborator Contribution | He put us in contact with the company Inspiralis |
| Impact | CASE studentship: EP/W524657/1 |
| Start Year | 2021 |
| Description | Collaboration with Prof. Fred Antson (York) and Cyril Sanders (Sheffield) |
| Organisation | University of Sheffield |
| Department | Sheffield Medical School |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Secure a PhD studentship |
| Collaborator Contribution | Share data and expertise |
| Impact | Shared PhD student ship (EP/N509802/1) Manuscript in preparation |
| Start Year | 2017 |
| Description | Collaboration with Prof. Phil Holliger, MRC-LMB, Cambridge |
| Organisation | Medical Research Council (MRC) |
| Department | MRC Laboratory of Molecular Biology (LMB) |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Expertise and time to perform MD simulations that support the experiments |
| Collaborator Contribution | Intellectual contribution and expertise on evolutionary molecular biology |
| Impact | Publication in eLIFE, 2022 with more than 10 citations in a year, DOI: 10.7554/eLife.75186 |
| Start Year | 2021 |
| Description | Collaboration with Tanmay Bharat |
| Organisation | University of Oxford |
| Department | Sir William Dunn School of Pathology |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Simulations of bacterial Surface Layer proteins. |
| Collaborator Contribution | Structures and functional data for SLPs |
| Impact | Structural basis of lipopolysaccharide mediated surface layer anchoring on Caulobacter crescentus cells Andriko von Kügelgen, Haiping Tang, Gail G. Hardy, Danguole Kureisaite-Ciziene, Yves V. Brun, Phillip J. Stansfeld, Carol V. Robinson, and Tanmay A.M. Bharat Cell pii: S0092-8674(19)31332-7. doi: 10.1016/j.cell.2019.12.006. A further publication is being finalised for submission. |
| Start Year | 2018 |
| Description | Collaboration with Tanmay Bharat |
| Organisation | University of Oxford |
| Department | Sir William Dunn School of Pathology |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | My research team is simulating the biofilm matrix protein TasA from Bacillus subtilis based to evaluate protein-protein interactions that stabilise the higher order structures of the protein. |
| Collaborator Contribution | X-ray, electron cryomicroscopy and electron cryotomography data as of the wildtype protein as well as mutants based on our simulations |
| Impact | Manuscript has been submitted |
| Start Year | 2021 |
| Description | Collaboration with Trinity College Dublin |
| Organisation | Trinity College Dublin |
| Department | School of Biochemistry and Immunology |
| Country | Ireland |
| Sector | Academic/University |
| PI Contribution | Collaboration to study lipoprotein maturation and transport. We provide the molecular simulation and computational data to add value to our collaborators' structures. |
| Collaborator Contribution | Our collaborators provide the molecular structures and experimental data. |
| Impact | Publications, seminars and contribution to an international website resource. |
| Start Year | 2013 |
| Description | Collaboration with UCB pharma on cryptic sites |
| Organisation | UCB SA |
| Department | UCB Pharma |
| Country | United Kingdom |
| Sector | Private |
| PI Contribution | We contributed our new computational methods to find cryptic binding sites. |
| Collaborator Contribution | Experimental validation, including crystal structures, surface plasmon resonance, new compounds. |
| Impact | Helped develop new drug candidates. |
| Start Year | 2018 |
| Description | Collaboration with University of Columbia |
| Organisation | Columbia University |
| Country | United States |
| Sector | Academic/University |
| PI Contribution | Computational simulations and modelling to understand key biosynthetic pathways within the bacterial cell envelope. |
| Collaborator Contribution | Molecular structures of key biosynthetic enzymes. |
| Impact | Papers in progress. 1 paper ready for submission in Mar 2021. |
| Start Year | 2020 |
| Description | Developing new computational approaches to inhibit "undruggable" targets. |
| Organisation | UCB SA |
| Department | UCB Pharma |
| Country | United Kingdom |
| Sector | Private |
| PI Contribution | We shared with UCB early versions of a computational platform we are developing to target allosteric sites on otherwise "undruggable" targets, i.e. pharmaceutical targets that are difficult to target with traditional drug design approaches based on substrate competitive ligands. |
| Collaborator Contribution | UCB contributed to the project with high quality structural and biological data and is hiring a dedicated PDRA to work on the collaboration for 3 years. |
| Impact | The new tools have been used to design novel drugs. |
| Start Year | 2016 |
| Description | ERNEST - European Research Network on Signal Transduction (CA18133) / Member of ERNEST Management Committee and the organizer of the first ERNEST Conference |
| Organisation | European Cooperation in Science and Technology (COST) |
| Department | COST Action |
| Country | Belgium |
| Sector | Public |
| PI Contribution | We were honoured to host the inaugural meeting of the European Research Network on Signal Transduction (ERNEST) funded by COST (Cooperation in Science and Technology) Action at Queen's University Belfast on October 28-30, 2019. PI was a main organizer of the GPCR meeting in Belfast. 150 delegates from 31 countries have attended the meeting, which made our conference a truly international event. This meeting was entitled "GPCR Pharmacology: Activation, Signalling and Drug Design" and focused on recent advances in knowledge and technology in the field. PI has designed the scientific program of the conference and has taken care of sponsor and venue arrangements. PDRA has gave a oral presentation on this BBSRC research project. |
| Collaborator Contribution | ERNEST COST Network has provided funding to organize the event. |
| Impact | Scientific outcomes of the meeting included recent advancements in cryo-electron microscopy, which is currently revolutionizing the field; how artificial intelligence and big data could drive pharmacology; the development of a multi-dimensional map of GPCR signal transduction and an overview of the IUPHAR / BPS Guide to PHARMACOLOGY database. The meeting had great networking opportunities and cultural experiences. On the first evening, a Welcome Reception with Deputy Lord Mayor Councillor Peter McReynolds, took place at the Belfast City Hall. The welcoming lectures were given by Michael Livingstone, a local historian, and Dr Aidan Seeley from the British Pharmacological Society. On the second evening, the Conference Dinner with traditional Irish dancing and music took place at the Great Hall. The conference has got additional sponsors: Belfast City Hall, Visit Belfast, British Pharmacological Society, ACS Pharmacology and Translational Science, Boehringer Ingelheim, Sosei Heptares, Nanotemper Technologies and Crelux, WuXi AppTec Company. As a result of this meeting, PI is a contributor to the ERNEST signalling map group. |
| Start Year | 2019 |
| Description | UCB Pharmaceuticals |
| Organisation | UCB SA |
| Department | UCB Pharma |
| Country | United Kingdom |
| Sector | Private |
| PI Contribution | Provision of molecular structures. |
| Collaborator Contribution | Assessment of the structures. |
| Impact | Continued direct impact between academia and pharmaceuticals. |
| Start Year | 2017 |
| Description | University of Virginia |
| Organisation | University of Virginia (UVa) |
| Department | School of Medicine |
| Country | United States |
| Sector | Academic/University |
| PI Contribution | Molecular simulations of molecular structures solved by the Zimmer research group. |
| Collaborator Contribution | Provision of novel molecular protein structures. |
| Impact | Understanding of the mechanisms of polysaccharide transport. |
| Start Year | 2018 |
| Description | collaboration with Dr.Chantal Valeriani |
| Organisation | Complutense University of Madrid |
| Country | Spain |
| Sector | Academic/University |
| PI Contribution | We have collaborated with Dr Valeriani and her team on the behaviour of different water models in simulations of aquaporins. We contributed expertise in water models and simulations of nanopores. This has resulted in a joint publication in J Chem Phys. |
| Collaborator Contribution | Dr. Valeriani's post doc Dr Miguel Angel Gonzalez visited Oxford for a few weeks and worked alongside Dr Lynch (the EPSRC postdoc on this project). We have continued the collaboration online and a paper has been published in J Chem Phys. |
| Impact | This has resulted in a joint publication in J Chem Phys. |
| Start Year | 2018 |
| Title | Confidential |
| Description | Confidential |
| IP Reference | Confidential |
| Protection | Patent / Patent application |
| Year Protection Granted | 2024 |
| Licensed | Commercial In Confidence |
| Impact | Confidential |
| Title | CG2AT2 |
| Description | An enhance version of our method to convert a coarse-grained system to atomic detail. https://github.com/pstansfeld/cg2at |
| Type Of Technology | Software |
| Year Produced | 2021 |
| Open Source License? | Yes |
| Impact | Becoming widely used within our research group and those that that are linked to our work. Aims to publish in the near future and induce world-wide usage. |
| Title | Computational tool to predict allosteric sites in membrane proteins |
| Description | The jupyter notebook scripts to set up the mapping of allosteric sites in GPCRs used in the paper: Probe Confined Dynamic Mapping for GPCR Allosteric Site Prediction Ciancetta A., Gill A.K., Ding T., Karlov D.S., Chalhoub G., McCormick P.J. and Tikhonova I.G. ACS Central Science, 2021 in press https://doi.org/10.1021/acscentsci.1c00802 The automatic procedure of probe confined dynamic mapping is provided as jupyter notebooks (MIDAS_EC.ipynb, MIDAS_IC.ipynb, and MIDAS_LI.ipynb) available. The required input files are (i) pdb of a receptor (preoriented with OPM), membrane, and one copy of cosolvent/fragment structures and (ii) an input file containing the following information: a receptor, an orthosteric ligand (if applicable), cosolvent and membrane file names (1); an orthosteric ligand (if applicable), cosolvent, and lipid residue names (2); cosolvent/fragment molecular weight and desired water/probe m/m % (in the closed box) (3); and height of the water buffer region between the protein and the water/probe mixture box (4). |
| Type Of Technology | New/Improved Technique/Technology |
| Year Produced | 2021 |
| Open Source License? | Yes |
| Impact | The product has received press coverage and new research collaborations. https://www.qub.ac.uk/News/Allnews/Newcomputermodellingcouldboostdrugdiscovery.html |
| URL | https://github.com/irinat12/Probe-Confined-Dynamic-Mapping-Protocols-GPCRs_membrane_proteins |
| Title | Containerised cloud training platform |
| Description | The cloud training platform was developed to offer both the HEC and the CCP consortia a new flexible approach to training in the latest and next generation scientific techniques, methodologies and tools. This platform consists of a catalogue of docker containers containing software compiled by our experts, a catalogue of code repositories containing training courses writen by our experts in the JupyterHub/Lab format. This is all brought together in the cloud via kubernetes and results in a platform that researchers and students in our community can simply navigate to our training section on our website and click the course they want to do and the platform will provision a virtual environment with all software and course material ready for them. |
| Type Of Technology | New/Improved Technique/Technology |
| Year Produced | 2019 |
| Impact | We have been using this platform to deliver consortia training events in the programme. This has allowed us to deliver thousands of training hours (3,051) across all our courses online and to deliver thousands more (2,510) in instructor led sessions. When the Covid-19 pandemic hit, this infrastructure allowed us to move immediately from events at physical locations to fully online backed by zoom lectures. We even scaled up to having over 400 delegates register with around 200 attending a given session where as we only used to allow around 50 to register and 30 to attend a given session previously with in person classes. |
| URL | http://www.ccpbiosim.ac.uk/training |
| Title | Free Energy calculation methods for Lipid-Protein Interactions |
| Description | We have developed new tools for studying lipid-protein interactions. https://github.com/owenvickery/metadynamics_analysis https://github.com/owenvickery/umbrella_sampling |
| Type Of Technology | New/Improved Technique/Technology |
| Year Produced | 2019 |
| Impact | For further details see: Insights into Membrane Protein-Lipid Interactions from Free Energy Calculations. Corey RA, Vickery ON, Sansom MSP, Stansfeld PJ. J Chem Theory Comput. 2019 Oct 8;15(10):5727-5736. doi: 10.1021/acs.jctc.9b00548. |
| URL | https://github.com/owenvickery/metadynamics_analysis |
| Title | HECBioSim Automated Benchmarking Suite |
| Description | The automated benchmarking suite is a set of performance benchmarks and a set of Python utilities that integrate with our Longbow tool to provide an automated benchmarking tool. The benchmarks in this suite have been updated and tested to work with the latest software versions of AMBER, Gromacs, LAMMPS and NAMD. The utilities are both CPU and GPU aware and can profile the architectures using different regimes. This allows us to rapidly profile new and existing equipment for performance and can be used to monitor performance of software over time. |
| Type Of Technology | Software |
| Year Produced | 2021 |
| Open Source License? | Yes |
| Impact | This tool has been used in profiling the ARCHER2 4 cabinet system and has been used to profile the UK Tier2 machines JADE, JADE2, Bede, ISAMBARD, THOMAS and DIRAC to collect performance information. This performance information has been used during the commissioning phase of JADE2 and Bede and is also used to inform the HECBioSim resource allocation panels. |
| URL | http://www.hecbiosim.ac.uk |
| Title | HECBioSim Web Portal Upgrade |
| Description | The HECBioSim web portal provides a clean and simple way for us to interact with our community. It serves as a single place in which members of the bio-simulation community can get access to all of the things we offer; whether this be finding out about/applying for time on ARCHER, downloading our software tools, getting support with using HPC or our software, this web portal provides users with a single and consistent point of entry to what we have to offer. |
| Type Of Technology | Webtool/Application |
| Year Produced | 2021 |
| Impact | The infrastructure that the HECBioSim portal was based was getting old and becoming a security risk, and was becoming unreliable and resulting in down-time. The Hardware had degraded to a point that the underlying operating system could no longer be updated and this brought with it challenges in maintaining a web facing resource. This refresh brought the web portal up to modern microservices style approach that is consistent with hosting in a cloud environment, thus allowing more flexibility going forward, this eliminates being tied to a particular generation of hardware and software and will allow us to deliver better more scalable and sustainable services going forward. |
| URL | http://www.hecbiosim.ac.uk |
| Title | HECBioSim Web Portal refresh |
| Description | The HECBioSim web portal provides a clean and simple way for us to interact with our community. It serves as a single place in which members of the bio-simulation community can get access to all of the things we offer; whether this be finding out about/applying for time on ARCHER, downloading our software tools, getting support with using HPC or our software, this web portal provides users with a single and consistent point of entry to what we have to offer. |
| Type Of Technology | Webtool/Application |
| Year Produced | 2020 |
| Impact | The website was given a new look and feel as part of the new grant and also as part of a drive to modernise the way it behaves on recent browsers. Old features were retired and removed and new content added to reflect the new HPC resources that the HECBioSim got access to. More information was added to assist PIs in applying for resources through the HEC and information was added to support users of the Tier2 facility JADE in how to run their simulations. |
| URL | http://www.hecbiosim.ac.uk |
| Title | Lipoprotein Modification Tool |
| Description | We have developed a method to automatically modify lipoprotein cysteine residues. |
| Type Of Technology | Software |
| Year Produced | 2020 |
| Impact | This forms part of this paper: Characterising Membrane Association and Periplasmic Transfer of Bacterial Lipoproteins through Molecular Dynamics Simulations Shanlin Rao, George Bates, Callum Matthews, Owen Vickery, Phillip J. Stansfeld Structure The tool is here: https://github.com/owenvickery/add_acyl_tails_martini_2.2 |
| URL | https://www.sciencedirect.com/science/article/pii/S0969212620300125 |
| Title | Longbow v1.5.0 |
| Description | Longbow is an automated simulation submission and monitoring tool. Longbow is designed to reproduce the look and feel of using software on the users local computer with the difference that the heavy lifting is done by a supercomputer. Longbow will automatically generate the necessary submit files and handle all initial file transfer, monitor jobs, transfer files at configurable intervals and perform final file transfer and cleanup. Longbow can be used to launch one-off jobs, generate ensembles of similar jobs or even run many different jobs over many different supercomputers. Out of the box, Longbow is currently supporting the PBS/Torque, LSF, SGE, Slurm, SoGE schedulers and ships with application plugins for commonly used bio-molecular simulation softwares AMBER, CHARMM, GROMACS, LAMMPS, NAMD. Longbow is however highly configurable and will function normally with generic software without plugins, however plugins can easily be made to extend Longbow to fully support applications and schedulers that do not ship out of the box. Longbow is also available to developers of applications which require support for automating job submission. Longbow is available as a convenient and light-weight python API that can be integrated in a number of different way. |
| Type Of Technology | Software |
| Year Produced | 2018 |
| Open Source License? | Yes |
| Impact | Longbow has been downloaded by 6,211 times world wide by researchers doing biomolecular simulation. Longbow simplifies the transfer, preparation and submission of simulations across a wide range of HPC architectures. Users at the beginner end of the spectrum can quickly be up and running on any HPC resource within minutes without having to learn much about the process of doing it. Power users can leverage the power of Longbow in generating huge arrays of simulations and have it fire and bring back results across multiple machines that they have access to. It is a tool mainly aimed at taking the pain out of HPC by automating the time consuming tedious tasks associated with doing it at scale. |
| URL | http://www.hecbiosim.ac.uk/longbow |
| Title | Longbow v1.5.1 |
| Description | Longbow is an automated simulation submission and monitoring tool. Longbow is designed to reproduce the look and feel of using software on the users local computer with the difference that the heavy lifting is done by a supercomputer. Longbow will automatically generate the necessary submit files and handle all initial file transfer, monitor jobs, transfer files at configurable intervals and perform final file transfer and cleanup. Longbow can be used to launch one-off jobs, generate ensembles of similar jobs or even run many different jobs over many different supercomputers. Out of the box, Longbow is currently supporting the PBS/Torque, LSF, SGE, Slurm, SoGE schedulers and ships with application plugins for commonly used bio-molecular simulation softwares AMBER, CHARMM, GROMACS, LAMMPS, NAMD. Longbow is however highly configurable and will function normally with generic software without plugins, however plugins can easily be made to extend Longbow to fully support applications and schedulers that do not ship out of the box. Longbow is also available to developers of applications which require support for automating job submission. Longbow is available as a convenient and light-weight python API that can be integrated in a number of different way. |
| Type Of Technology | Software |
| Year Produced | 2019 |
| Open Source License? | Yes |
| Impact | Longbow has been downloaded by 7,542 times world wide by researchers doing biomolecular simulation. Longbow simplifies the transfer, preparation and submission of simulations across a wide range of HPC architectures. Users at the beginner end of the spectrum can quickly be up and running on any HPC resource within minutes without having to learn much about the process of doing it. Power users can leverage the power of Longbow in generating huge arrays of simulations and have it fire and bring back results across multiple machines that they have access to. It is a tool mainly aimed at taking the pain out of HPC by automating the time consuming tedious tasks associated with doing it at scale. |
| URL | http://www.hecbiosim.ac.uk/longbow |
| Title | Longbow v1.5.2 |
| Description | Longbow is an automated simulation submission and monitoring tool. Longbow is designed to reproduce the look and feel of using software on the users local computer with the difference that the heavy lifting is done by a supercomputer. Longbow will automatically generate the necessary submit files and handle all initial file transfer, monitor jobs, transfer files at configurable intervals and perform final file transfer and cleanup. Longbow can be used to launch one-off jobs, generate ensembles of similar jobs or even run many different jobs over many different supercomputers. Out of the box, Longbow is currently supporting the PBS/Torque, LSF, SGE, Slurm, SoGE schedulers and ships with application plugins for commonly used bio-molecular simulation softwares AMBER, CHARMM, GROMACS, LAMMPS, NAMD. Longbow is however highly configurable and will function normally with generic software without plugins, however plugins can easily be made to extend Longbow to fully support applications and schedulers that do not ship out of the box. Longbow is also available to developers of applications which require support for automating job submission. Longbow is available as a convenient and light-weight python API that can be integrated in a number of different way. |
| Type Of Technology | Software |
| Year Produced | 2019 |
| Open Source License? | Yes |
| Impact | Longbow has been downloaded by 8,695 times world wide by researchers doing biomolecular simulation. Longbow simplifies the transfer, preparation and submission of simulations across a wide range of HPC architectures. Users at the beginner end of the spectrum can quickly be up and running on any HPC resource within minutes without having to learn much about the process of doing it. Power users can leverage the power of Longbow in generating huge arrays of simulations and have it fire and bring back results across multiple machines that they have access to. It is a tool mainly aimed at taking the pain out of HPC by automating the time consuming tedious tasks associated with doing it at scale. |
| URL | https://www.hecbiosim.ac.uk/downloads |
| Title | MemProtMD methods for membrane self-assembly and insertion |
| Description | A methodology for inserting a membrane protein into a lipid bilayer using Google Colab Notebooks. |
| Type Of Technology | Webtool/Application |
| Year Produced | 2022 |
| Open Source License? | Yes |
| Impact | Regular usage within the research group and university teaching. |
| Title | OpenMMSLICER |
| Description | scientific software for sequential monte carlo simulations in openmm |
| Type Of Technology | Software |
| Year Produced | 2022 |
| Impact | This software has been further developed for simulated tempering and is being used for new research projects |
| URL | https://github.com/openmmslicer/openmmslicer |
| Title | ProtoCaller |
| Description | Software to automate the setup of free energy calculations |
| Type Of Technology | Software |
| Year Produced | 2020 |
| Open Source License? | Yes |
| Impact | non yet |
| URL | https://protocaller.readthedocs.io/en/latest/index.html |
| Title | agnesnoy/SerraLINE: SerraLINE v1.0 |
| Description | Version utilized in the paper: ALB Pyne, A Noy, K Main, V Velasco-Berrelleza, MM Piperakis, LA Mitchenall, FM Cugliandolo, JG Beton, CEM Stevenson, BW Hoogenboom, AD Bates, A Maxwell, SA Harris (2020). "Base-pair resolution analysis of the effect of supercoiling on DNA flexibility and recognition", Accepted in Nat Commun |
| Type Of Technology | Software |
| Year Produced | 2020 |
| Open Source License? | Yes |
| Impact | Publication |
| URL | https://zenodo.org/record/4105979 |
| Title | agnesnoy/SerraNA: SerraNA v1.0 |
| Description | By Victor Velasco-Berrelleza and Agnes Noy Published at: V Velasco-Berrelleza, M Burmann, JW Shepherd, MC Leake, R Golestanian, A Noy (2020). "SerraNA: a program to determine nucleic acids elasticity from simulation data" Phys Chem Chem Phys, 22, 19254-19266 https://doi.org/10.1039/D0CP02713H |
| Type Of Technology | Software |
| Year Produced | 2020 |
| Impact | Publication and use on subsequent research |
| URL | https://zenodo.org/record/4105987 |
| Title | agnesnoy/WrLINE: WrLINEv1.0 |
| Description | By Thana Sutthibutpong and Agnes Noy Published at: Sutthibutpong T, Harris SA, Noy A* (2015). "Comparison of molecular contours for measuring writhe in atomistic supercoiled DNA" J Chem Theor Comput, 11, 2768. DOI:10.1021/acs.jctc.5b00035 |
| Type Of Technology | Software |
| Year Produced | 2020 |
| Impact | Used in several publications and in current research |
| URL | https://zenodo.org/record/4106245 |
| Description | 6th Molecular Microbiology Meeting |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | I attended this conference and I gave a talk with the purpose of engaging with microbiologists on multidisciplinary exchange of knowledge |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://conferences.ncl.ac.uk/molmicro2019/ |
| Description | Advanced technology to support research, innovation and economic growth in the UK |
| Form Of Engagement Activity | A formal working group, expert panel or dialogue |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Policymakers/politicians |
| Results and Impact | Reform hosted a policy roundtable on the opportunities for advanced technology in the UK in May 2019, with the kind support of Hewlett Packard Enterprise. The session was introduced by Chris Skidmore MP, then Minister of State for Universities, Science, Research and Innovation, and Professor Mark Parsons, Director at the Edinburgh Parallel Computing Centre. The Minister stressed the importance of international collaboration in R&D, reaffirming the UK's ambition to strengthen and enrich existing partnerships, as well as to develop new global partnerships - as outlined in the then recently announced International Research & Innovation Strategy. The discussion also focused on the opportunities the upcoming Comprehensive Spending Review would offer to the science, research and innovation sectors, and on the need to build a strong case for. investment in emerging technologies - including quantum technology, performance computing (HPC) and robotics. During this meeting it was highlighted that High-performance computing is considered a game-changing technology, which will be fundamental to the UK's ability to maintain its global competitiveness in the science, research and innovation sectors. |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://reform.uk/research/advanced-technology-support-research-innovation-and-economic-growth-uk |
| Description | An introduction to Longbow |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Postgraduate students |
| Results and Impact | The HECBioSim CoSeC developer James Gebbie-Rayet gave a workshop on how to use Longbow with ARCHER as part of the CCPBioSim training week. The workshop was attended by 22 people and they learnt the basics of submitting simulations to ARCHER using our simulation automation tool. |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://www.ccpbiosim.ac.uk/events/workshop-course-material/eventdetail/120/-/ccpbiosim-training-wee... |
| Description | Article in IFLScience |
| Form Of Engagement Activity | A magazine, newsletter or online publication |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Public/other audiences |
| Results and Impact | Article at the online magazine IFLScience focused in dessmination of scientific news. After 24 hours it had more than 1000 shares |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.iflscience.com/editors-blog/highestresolution-images-of-dna-ever-reveal-how-it-dances/ |
| Description | Article in Phys.org |
| Form Of Engagement Activity | A magazine, newsletter or online publication |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | Article at the Phys.org magazine |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://phys.org/news/2021-02-visualization-dna.html |
| Description | Article in the Daily Express |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | Article in the Daily Express on the article in Nature Communications 2021 |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.express.co.uk/news/science/1398473/dancing-dna-highest-resolution-pictures-dna-molecules... |
| Description | Article in the Daily Mail |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | Article in the press on the Nature Communications 2021 |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.dailymail.co.uk/sciencetech/article-9265951/Scientists-capture-highest-resolution-images... |
| Description | Banff International Research Station for Mathematical Innovation and Discovery: "The Topology of Nucleic Acids" |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | Invited talk that has been made accessible online |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://www.birs.ca/events/2019/5-day-workshops/19w5226 |
| Description | CCPBioSim Training Week - An introduction to software and hardware for biomolecular simulation |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | This was a hands-on workshop in which around 200 delegates attended virtually from around the world. The workshop was a basic introduction into setting up their own workstation for biomolecular simulation. This was a look at virtual machines and the installation of the most common tools and utilities used for the setup, running and analysis of biomolecular simulation. |
| Year(s) Of Engagement Activity | 2020 |
| URL | https://www.ccpbiosim.ac.uk/events/workshop-course-material/eventdetail/127/-/training-week-2020 |
| Description | CCPBioSim Training Week - An introduction to software and hardware for biomolecular simulation |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | This activity was a talk on the basics of biomolecular simulation software and the hardware landscape available to scientists. It was attended by around 200 delegates and was pitched at postgraduate researchers within the first 6 months of their studies. It discussed the basic hardware differences and the considerations for running software in various ways on these hardware types. |
| Year(s) Of Engagement Activity | 2020 |
| URL | https://www.ccpbiosim.ac.uk/events/workshop-course-material/eventdetail/127/-/training-week-2020 |
| Description | Cardiff Scientific Society Lecture - public lecture online |
| Form Of Engagement Activity | Engagement focused website, blog or social media channel |
| Part Of Official Scheme? | No |
| Geographic Reach | Regional |
| Primary Audience | Public/other audiences |
| Results and Impact | 9 February 2022 Cardiff Scientific Society Lecture - public lecture online Title: Folding a Protein: Nature's Origami |
| Year(s) Of Engagement Activity | 2022 |
| Description | EUTOPIA-2: 2 nd meeting of the European Topology interdisciplinary Initiative |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | Invited talk and enroll at this EU-funded COST network |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://eutopiaam2019.wordpress.com |
| Description | GPUs for biomolecular researchers: an overview of accelerated computing, plus a deep dive into GROMACS on GPU |
| Form Of Engagement Activity | A formal working group, expert panel or dialogue |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | This event was an industrial talk from NVidia on getting the best out of running biomolecular simulations on the GPU and a general talk about GPUs and how they are different from other architectures. HECBioSim members also sat on a discussion panel where information was discussed about what machines are available around the UK and how to get access to them. |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.hecbiosim.ac.uk/events/training-talks |
| Description | Hamied Foundation UK-India Antimicrobial Resistance Meeting 2019 |
| Form Of Engagement Activity | A formal working group, expert panel or dialogue |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | I was part of an expert panel of scientists and medics from the UK and India assembled to discuss how we can (a) combat existing antimicrobial resistance and (b) prevent or slow down the development of resistance in future. The chief medical officer of the UK was also in attendance and a gave talk. Collaborations between scientists from both countries were established and existing ones were strengthened. An emerging theme was that all levels must be addressed, from atoms and molecules through to patients, communities and the environment. I helped establish this theme. |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://acmedsci.ac.uk/more/events/hamied-foundation-uk-india-antimicrobial-resistance-meeting-2019 |
| Description | Hands on demonstrations of Longbow at CCPBioSim annual conference |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Postgraduate students |
| Results and Impact | At the CCPBioSim annual conference we had a stand at the poster sessions and throughout the programme where we show cased Longbow and offered advice to PIs about how to apply for simulation time and to users of HPC advice on performance. This proved quite popular with lots of requests for more of this kind of participation in the future. |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://www.ccpbiosim.ac.uk/events/past-conferences/eventdetail/119/-/7th-annual-ccpbiosim-conferenc... |
| Description | Interview in That's York TV |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | Regional |
| Primary Audience | Public/other audiences |
| Results and Impact | Interview for the local TV |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.facebook.com/ThatsTVYork/ |
| Description | Linacre Lecture, University of Cambridge, in person. Title: Folding a Protein: Nature's Origami, 15 February 2022 |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Other audiences |
| Results and Impact | Linacre Lecture, Cambridge, for undergraduates, postgraduates, postdocs, PIs and for outreach and general interest |
| Year(s) Of Engagement Activity | 2022 |
| Description | Pint of Science |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Public/other audiences |
| Results and Impact | Outreach talk for general public about current research in genomes and how affects our lives. |
| Year(s) Of Engagement Activity | 2019 |
| URL | https://pintofscience.co.uk/ |
| Description | Press release by the University of York |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Media (as a channel to the public) |
| Results and Impact | Press Release on the article on Nature Communications 2021 |
| Year(s) Of Engagement Activity | 2021 |
| URL | https://www.york.ac.uk/news-and-events/news/2021/research/dna-dancing-video/ |
| Description | Queen's Computational Biology Club Symposium. Almudena Pino-Angeles |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Postgraduate students |
| Results and Impact | Poster: Tracing the allosteric binding pathways to the cell membrane-GPCR interface. Discussions with bioinformaticians and appreciation of the application of AI in GPCR research. |
| Year(s) Of Engagement Activity | 2019 |
| Description | Seeing is Believing: Build a DIY microscope!, Discovery Zone at York Festival of Ideas, UK |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | Local |
| Primary Audience | Public/other audiences |
| Results and Impact | Family friendly stand at the Science Fair 'Discovery Zone' at York Festival of Ideas, UK |
| Year(s) Of Engagement Activity | 2023 |
| URL | https://yorkfestivalofideas.com/2023/calendar/discovery-zone/ |
| Description | Talk in the Physics of Life - Physics of Medicine Network Launch Event |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Other audiences |
| Results and Impact | Launch event of the third network of Physics of Life funded by the three UKRI councils EPSRC, MRC, BBSRC, Rosetrees Trust and the Universities of York, Durham and Leeds |
| Year(s) Of Engagement Activity | 2020 |
| URL | http://www.physicsoflife.org.uk |
| Description | Training workshop |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Postgraduate students |
| Results and Impact | 12 early career researchers (mainly a mixture of PhD students and PDRAs) attended a 1-day workshop in Leeds organised by CCPBioSim/HECBioSim to learn how to use the computational workflows package "Crossflow" that was developed during this grant. The objective was to raise awareness of the software and promote its usage on the national supercomputing service ARCHER2. |
| Year(s) Of Engagement Activity | 2023 |