A platform for future development and application of the ONETEP software

Lead Research Organisation: Imperial College London
Department Name: Materials

Abstract

Computer simulations play a growing role in our society e.g. flight simulators allow pilots to be trained more cheaply and safely than in the air. In science and technology, computer simulation is a powerful tool for understanding or even predicting complex processes in real materials. Simulations are often used alongside conventional experiments, but they can also be used in situations where experiments would be too expensive or even impossible to perform e.g. when studying materials in extreme conditions such as the high temperatures and pressures found in the Earth's core.

The turn of the last century saw the start of a scientific revolution with the discovery of quantum mechanics (QM), a theory that describes the world on the atomic scale with astonishing accuracy, and thus provides the foundation for all of low-energy physics, chemistry and biology. In principle at least, quantum mechanics underlies the microelectronics, chemical and pharamaceutical industries upon which our society relies today. The challenge is that the equations of QM are very complicated. Even on the fastest computers it is only possible to solve them exactly for small molecules, whereas the systems of interest to scientists today contain many thousands. Even the rapid and relentless progress of computer technology cannot overcome this because of the scaling of the problem.

The work needed to complete a task usually increases with its size e.g. the time taken to mow a lawn is proportional to its area: double the size of the garden and it takes twice as long. This is an example of linear scaling, but the effort to do many tasks increases more rapidly. Arranging a hand in a game of cards usually scales as the square of the number of objects involved: triple the number of cards and it takes nine (three squared) times as long. Some are even worse e.g. solving the travelling salesman problem to find the quickest route which visits a given set of locations. Adding one extra location doubles the amount of time to solve the problem. If three locations can be done in one minute, four will take two minutes, and five will take four minutes. Just 22 will take a whole year! Solving QM exactly scales like this when increasing the number of atoms being simulated.

However in the 1960s a leap forward was made with the discovery of density-functional theory (DFT), for which the Nobel Prize was awarded in 1998. The remarkable result of DFT is that the physical properties of the whole system (the answers to the questions we want to ask) can in principle be calculated in a time that scales linearly with the number of atoms. The research proposed here relates to one of the leading pieces of software for performing linear-scaling DFT calculations in the world. This ONETEP code has been demonstrated on systems containing up to 30,000 atoms so far. This method will expand the scope and scale of QM simulations across a wide range of fields. The problems we intend to address using this Platform grant include:
- giving insight into the design of new drugs by simulating the interactions between proteins
- designing new materials for solar energy conversion and storage
- studying the properties of nanoparticles as catalysts for chemical reactions
- understanding the microscopic processes associated with friction that cause machinery to wear out

Our vision is to create a virtual laboratory which uses ONETEP as one of a family of techniques to simulate the results of real experiments e.g. how a material absorbs light. The virtual lab gives total control: you can change the arrangement of atoms in a material or molecule and see the effect. The virtual lab will never replace the real one, but it promises to be a powerful tool alongside it.

Planned Impact

In 2004, Accelrys adopted the ONETEP software as the flagship for a new Nanotechnology Consortium aimed at industrial and government partners, that ultimately attracted over 30 members worldwide. Accelrys is a leading scientific research and development software and service company. It is listed on the NASDAQ Global Select market and based in San Diego with European headquarters in Cambridge. It has 1,300 customers including 29 of the top 30 pharma/biotech companies, 4 of the top 5 chemicals companies and the top 5 aerospace companies. In particular, it has an established position in the field of materials modelling and simulation currently based around the Materials Studio application.

Members of the ONETEP Developers' Group (ODG - at that time consisting of the four applicants) contributed to training events across Europe and the US and a new staff position at Accelrys was created to develop the interface of consortium software with Materials Studio. In 2008, after the end of the first phase of the consortium, ONETEP was launched as a new product within Materials Studio. Total revenue from ONETEP to date exceeds £1M from over 200 organisations worldwide.

The licensing of intellectual property (IP) was handled by Cambridge Enterprise. As a result of these IP transfer agreements, new functionality developed outside the ODG has been secured and is being made available to customers of Accelrys through the Materials Studio interface. This pipeline will continue to be used to an increasing extent as the Platform (i) enables the team to expand and (ii) expedites the delivery of new code by maintaining the support of key PDRAs. Thus the knowledge associated with new simulation techniques developed by the ONETEP team for their own academic purposes will be efficiently converted into new tools readily available to ONETEP users across the industrial and government sectors. As the importance of materials simulation for research and development grows within industry, so the development of new methodologies with expanded capabilities such as ONETEP will contribute to the UK economy through the creation of jobs and improved products.

The ODG has direct collaborations with the pharmaceutical company Boehringer Ingelheim and the chemicals company Johnson Matthey, both of which have sponsored CASE studentships at Southampton. At Imperial there are links with the energy company E.ON. We intend to keep working to increase the number of industrial collaborators.

Members of the ODG will continue to work with Accelrys to provide training for customers. The Platform will benefit all users of the code by providing for the maintenance and enhancement by the key PDRAs of online support at the ONETEP Wiki (www.onetep.org). During the Platform we plan to extend a new paradigm for training to industrial and government users of the code through Accelrys, based on the format of the ONETEP Master Class held in Cambridge in July 2011.

Finally, the training of individuals working within the ONETEP team will make a significant impact. PhD students and PDRAs at the three institutions receive a rigorous training in computational science, numerical methods and parallel programming, all within a collaborative environment characterised by an ethos that embraces best practice in software development. Although most of the researchers leaving the team to date have continued in academia, at least in the short term, two recently-graduated PhD students have gone on to use their skills in employment at MathWorks , the developers of the MATLAB software, and another is now working for Arqiva developing software for simulations of wireless audio and video broadcasting. As the Platform fuels the growth of the team, so the number of highly-trained individuals with combined skills in computational science and software development will increase, helping to fill the skills gap identified by the numerous international reports cited in the case for support.

Publications

10 25 50
 
Description The ONETEP code aims to enable the simulation of materials at the level of the behaviour of electrons so that insight into processes at this scale can be gained and the properties of new materials predicted. The task of solving the quantum-mechanical equations is formidable, and this project has enabled high performance computers with thousands of processors to be used efficiently to simulate tens of thousands of atoms. This enables much more realistic simulations to be performed.
This Platform Grant underpins the development of new functionality within the code and its application to a wide variety of systems including nanomaterials and biological macromolecules.
Exploitation Route The ONETEP code has been licensed to Accelrys (now BIOVIA, part of Dassault Systèmes) for commercial distribution as part of the Materials Studio suite of materials modelling software. Over 200 organisations have currently purchased licenses worldwide to use the code in their own research and development. Academic licenses are also available from Cambridge Enterprise Ltd.
Sectors Aerospace

Defence and Marine

Chemicals

Digital/Communication/Information Technologies (including Software)

Electronics

Energy

Transport

URL http://www.onetep.org/
 
Description BIOVIA, part of Dassault Systèmes (formerly Accelrys) is a leading scientific research and development software and service company. It has 1,300 customers including 29 of the top 30 pharma/biotech companies, 4 of the top 5 chemicals companies and the top 5 aerospace companies. In particular, it has an established position in the field of materials modelling and simulation currently based around the Materials Studio application. This provides a graphical interface to a wide variety of simulation
First Year Of Impact 2004
Sector Aerospace, Defence and Marine,Chemicals,Digital/Communication/Information Technologies (including Software),Electronics,Energy,Transport
Impact Types Economic

 
Description Computational Science and Engineering: Software Flagship Project Call
Amount £609,469 (GBP)
Funding ID EP/P02209X/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 04/2017 
End 10/2019
 
Title Input data for ONETEP calculations in "Linear-scaling time-dependent density-functional theory beyond the Tamm-Dancoff approximation: obtaining efficiency and accuracy with in situ optimised local orbitals" 
Description The file contains all input files used to generate the data in the publication "Linear-scaling time-dependent density-functional theory beyond the Tamm-Dancoff approximation: obtaining efficiency and accuracy with in situ optimised local orbitals" submitted to the Journal of Chemical Physics (JCP). The folder azobenzene/ contains input data necessary to reproduce Fig. 1 and Table 1. The folder bacteriochlorophyll/ contains data necessary to reproduce Fig. 2, Fig. 3, Fig 4 and Fig. 5. The folder linear_scaling_test contains data necessary to reproduce Fig. 6 and Fig 7. Furthermore, the folder bacteriochlorophyll/ also contains the raw excitation energy and oscillator strength data necessary to generate Fig. 4 and Fig 5. All calculations were performed using ONETEP Version 4.1.12.7 
Type Of Material Database/Collection of data 
Year Produced 2015 
Provided To Others? Yes  
URL https://www.repository.cam.ac.uk/handle/1810/251293
 
Title Input files and raw data for publication publication "First-principles treatment of solvent effects on electronic excitations in alizarin" 
Description This file was created by Tim J. Zuehlsdorff on the 15th of October 2015. It contains all input files necessary to reproduce the results of the publication "First-principles treatment of solvent effects on electronic excitations in alizarin". The data in this file is organised as follows. The folders alizarin_frame1 to alizarin_frame5 contain the input files associated with the MD snapshots that are referred to as frame 1 to frame 5 in the publication. The subfolders labelled NWChem contain the NWChem input files for the vacuum, the implicit solvent and the 4 angstrom calculation. All ONETEP input files are labelled with a .dat file label. Note that the ONETEP calculations making use of implicit solvation were all performed in several stages. First a calculation in vacuum and open BC was performed. Then the resulting density was used to define a cavitiy for the dielectric medium felt by the system and a ground state, conduction optimisation and LR_TDDFT calculation was perfomed sequentially. Please refer to the user manuals on the ONETEP website (www.onetep.org) for further details regarding how to perform these calculations. For frame 1, frame 3 and frame 5, the atoms in the 8ang, 10ang and 12ang .dat files are labelled in such a way that waters to within 7 ang are denoted with a H1 and O1 label, while in the 6 ang .dat file, waters within the 4 ang region are denoted in the same way. This labelling enables the performance of TDDFT calculaions using a truncated density matrix in order to reproduce Fig. 5 (please again refer to the manuals on the ONETEP website for further details). The frame5 folder contains 2 additional files where atoms beyond the 7ang and 4ang region are replaced by cassical charges. These files can be used to obtain the black data points in the lower part of figure 5. The folder raw_data/ contains all the raw data used to generate the plots in this work. The NWChem calculations performed in this work were carried out unsing version 6.3, while all ONETEP calculations were performed using version 4.1.12.7 
Type Of Material Database/Collection of data 
Year Produced 2015 
Provided To Others? Yes  
URL https://www.repository.cam.ac.uk/handle/1810/252337
 
Title Research data supporting "Predicting solvatochromic shifts and colours of a solvated organic dye: The example of Nile Red" 
Description This file was created by Tim J. Zuehlsdorff on the 17th of October 2016. It contains all input files necessary to reproduce the results of the publication "Predicting solvatochromic shifts and colours of a solvated organic dye: The example of Nile Red". The data in this file is organised as follows. The folder NWChem contains input files to perform the scan over the S1 potential energy surface under rotation around the C-N bond of nile red in vacuum, both with PBE and with CAM-B3LYP. The PBE results also generate the ground state potential energy surface that is used for the parameterisation of the AMBER forcefield. The folder experimental_results contains both the raw data plotted in Figure 2 of the publication, as well as the raw data collected from 20 different solvents used to parameterise Eqn. 1 of the publication. The folders toluene, acetone, ethanol, hexane, respectively contain all ONETEP input files for all snapshots considered for the appropriate solvents. The ones labelled "implicit" are the ones that have the solvent environment stripped away and are calculated in the implicit solvent only. The folder raw_spectra contains the raw data that is used to produce all spectra in the publication and is created from the converged results from all ONETEP input files. All ONETEP input files are labelled with a .dat file label. Note that the ONETEP calculations making use of implicit solvation were all performed in several stages. First a calculation in vacuum and open BC was performed. Then the resulting density was used to define a cavitiy for the dielectric medium felt by the system and a ground state, conduction optimisation and LR_TDDFT calculation was perfomed sequentially. Please refer to the user manuals on the ONETEP website (www.onetep.org) for further details regarding how to perform these calculations. All NWChem input files were run using version 6.3. All ONETEP calculations were run using the ONETEP developers version 4.5.0.3. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://www.repository.cam.ac.uk/handle/1810/260801
 
Title Research data supporting 'Simulation of electron energy loss spectra of nanomaterials with linear-scaling density functional theory'. 
Description Input and output files for the simulations performed for the publication "Simulation of electron energy loss spectra of nanomaterials with linear-scaling density functional theory" Several software packages were used: onetep (linear Scaling DFT) http://www.onetep.org/ castep (conventional plane wave DFT) http://www.castep.org/ ELK (all electron DFT) http://elk.sourceforge.net/ OptaDoS (computing spectra) http://www.optados.org/ Input files for all simulations are included, as are relevant textual and binary outputs. The data to reproduce the experimental lines is not included, but may be extracted from the publications cited in the paper. Viewers may find the .elnes_bin and .odi files of particular interest as these may be used (together with OptaDoS) to compute new EEL spectra for the atoms studied using different broadening parameters or in polarised geometries (i.e. not isotropically averaged). 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
URL https://www.repository.cam.ac.uk/handle/1810/253717
 
Description Accelrys (now Dassault Systèmes BIOVIA) 
Organisation Accelrys
Country United Kingdom 
Sector Private 
PI Contribution The ONETEP Developers Group works closely with our partners at Dassault Systèmes BIOVIA (formerly Accelrys) to make sure the latest research developments in the software are exposed to the industrial, government lab and academic users of the commercial code, in order to maximise the impact of these developments.
Collaborator Contribution BIOVIA provides the resources required to ensure a continued support of the new developments into commercial quality software environments. In particular, BIOVIA has committed to devoting the equivalent of roughly one full time person to the integration, bug fixing, quality control, scientific support and marketing of ONETEP over the duration of this award at a value in the region of £100,000 per year. BIOVIA also provides each member of the ONETEP Developers Group with the whole of the Materials Studio(TM) software suite with an academic price in the region of £70,000.
Impact Total commercial revenue from ONETEP to date exceeds £4.5M from over 200 organisations worldwide.
Start Year 2012
 
Title ONETEP software 
Description ONETEP is one of the leading codes of its kind for large-scale first-principles quantum mechanical simulations of materials. It is described in further detail on the ONETEP website at www.onetep.org 
IP Reference  
Protection Copyrighted (e.g. software)
Year Protection Granted
Licensed Yes
Impact ONETEP is unique in being sold commercially: in 2004 it was adopted by Accelrys (now Dassault Systemes BIOVIA), a leading scientific software company, as the flagship for a new international Nanotechnology Consortium of mainly industrial and government partners, leading to its launch as a new product within the Materials Studio suite of software in 2008. An inexpensive academic license is also available worldwide direct from Cambridge Enterprise Ltd. Total revenue from ONETEP to date exceeds $4.5M from over 200 organisations worldwide.
 
Title ONETEP linear-scaling DFT code 
Description Linear-scaling density-functional theory code for understanding and predicting the properties of materials from first-principles quantum mechanics. 
Type Of Technology Software 
Year Produced 2016 
Impact ONETEP is continually developed and new, updated versions are released on an annual basis. The developments associated with this grant were released during the period of the grant, between 2009 and 2013. It is one of the leading codes of its kind in the world and unique in being sold commercially: in 2004 it was adopted by Accelrys (now Dassault Systèmes BIOVIA), a leading scientific software company, as the flagship for a new international Nanotechnology Consortium of mainly industrial and government partners, leading to its launch as a new product within the Materials Studio suite of software in 2008. An inexpensive academic license is also available worldwide direct from Cambridge Enterprise Ltd. Total revenue from ONETEP to date exceeds $4.5M from over 200 organisations worldwide. 
URL http://www.onetep.org/