York: Transforming Research-Oriented Software Engineering

Lead Research Organisation: University of York
Department Name: Physics

Abstract

Research Software Engineering (RSE) is the creation of well-designed, reliable, efficient computer programs to solve research problems. In my Fellowship I will focus on: RSE in materials modelling, a large research field with important industrial applications which affect our everyday lives; plasma and fluid modelling, which has many applications to clean energy-generation as well as industrial & medical processes; and on promoting and developing RSE skills in the research community.

The core software development is focused on CASTEP, a state-of-the-art implementation of density functional theory (DFT) for materials modelling, and BOUT++, a plasma fluid code. These programs are world-leading exemplars for UK RSE. Both codes were designed from the ground up using sound RSE principles, and are free to all UK academics. CASTEP's ease of use has drawn users from across the STEM disciplines; it is used by over 900 academic & industrial groups worldwide and cited over 9200 times in the scientific literature. I will transform CASTEP's ease of use by non-computational scientists, ensuring quick, accurate and reliable simulations, and reduce the time-to-science for all users. I will further enable CASTEP to become the software foundation for new, higher-level computational methods, including multiscale modelling, rare-event sampling & high-throughput materials discovery. This will strengthen UK science right across materials research, and ultimately lead to better materials for everyone.

The developments in BOUT++ will expand its field of applicability to allow both advanced new plasma physics and geometries, and to enable it to solve equations from other fields of science. This will both empower plasma scientists to model sophisticated new designs for fusion reactors, and open BOUT++ up to whole new scientific communities; as an example, UoY will shortly start a pilot project with Nestle to use BOUT++ to model bubbles in chocolate.

This Fellowship also includes development of new software to tackle current research problems, not only in York but also in industry and at the UK's world-leading experimental facilities Diamond Light Source, ISIS Neutron Facility and SuperSTEM. Initial projects include RSE to aid modelling chemical synthesis, predicting core-loss spectra and crystal & magnetic structure prediction, with further projects to be sought and delivered within the Fellowship period. These software services will promote RSE across a diverse range of STEM, increase the effectiveness and impact of a wide variety of research initiatives, and address directly many of EPSRC's Grand Challenges in Physics, Engineering and Chemical Science.

The final component of this Fellowship is to train, support and inspire the next generation of research software engineers. I will develop new training material, to be delivered in York but disseminated online to the wider community; create support groups within York, and link up with neighbouring institutions; and work with national scientific consortia (including HECs and CCPs) to promote and support RSE nationally. The people this supports are the future of this vital field, and will be invaluable to research in the UK, as well as the wider world. It is they who will ensure that the skills and experiences gained by researchers on the core development projects are transferred into the wider community.

In addition to these specific RSE components, I will also raise the profile and recognition of RSE workers and skills in the UK and abroad. I will champion RSE particularly in the materials modelling domain, promoting RSE to academia and industry through high-profile showcases, conferences, workshops and targeted, ongoing collaborations. The strong RSE Group I will build at the University of York will extend RSE provision and skills training to all researchers at York, promoting Research Software Excellence in all disciplines.

Planned Impact

Economic & societal impacts will come from the development and wider uptake of state-of-the-art software that this Fellowship facilitates, the science that its use enables, and the highly-skilled RSEs who will be trained.

The first strand of the Fellowship is core development of CASTEP and BOUT++. CASTEP is an ab initio materials modelling code and a powerful enabler of materials design, whose capabilities and potential for impact is demonstrated by high-profile programmes like the Materials Genome Initiative in the US [1] or by the European Commission [2], and their widespread use in electronic & manufacturing industries [2-3]. A general problem with ab initio codes is the specialist knowledge required for their efficient and accurate use, but this will be removed by PJH in this project, opening CASTEP up to non-computational scientists, enabling them to make predictions to guide experimental design and interpret observations. This is particularly useful for partners at Diamond Light Source, SuperSTEM and ISIS Neutron facility. It will also enable far larger simulations, bringing new capabilities such as "whole-device modelling".

BOUT++ is a world-leading plasma fluid code, but will be transformed in this Fellowship into a general partial differential equation solver, capable of handling multiple physics models. This simultaneously allows simulations of complex geometries, such as advanced diverters, and opens BOUT++ up to applications in new areas of science such as bacterial motion or wound healing.

The potential economic & societal impact is difficult to overstate. There are over 3.5 billion mobile phones and 1 billion PCs worldwide, with total energy consumption projected to triple by 2030, exceeding the present combined residential electricity use of the US and Japan [4]. "Materials design" and whole-device modelling can revolutionise the capabilities and energy efficiency of these devices, and new fusion reactor geometries could provide a breakthrough in clean, renewable energy production.

These developments will be distributed to UK academics free-of-charge. In the case of CASTEP, BIOVIA market it worldwide and provide an easy-to-use CASTEP GUI, plus training & support. Customers include many UK and global companies who use materials modelling in R&D. The new developments will enable many more industrial users, outside the traditional confines of expert "DFT modellers". The direct economic impact of CASTEP's industrial use is evident from the Return on Investment of 300-1000% in the physical and life sciences sectors[5][6], and the 260 patents citing Castep simulations[7].

The software services which form the second strand of this Fellowship will empower researchers to tackle high-impact EPSRC "Grand Challenges" in science. The additional emphasis on national experimental facilities will lead to new modelling capabilities, new user communities, and will maximise the impact of both this work and that of the facilities. The new software will be available freely to all UK academics, along with training material.

The third strand of this project is to train the next generation of RSEs and teach RSE skills to researchers. This will lead to a step-change in the quality of research software produced at York and beyond, and help tackle the "reproducibility crisis" in science. This also addresses directly the "skills gap" in UK industry and enables the impact of this work to continue to grow long after the Fellowship period.

1. G. Ceder & K. Persson, Materials Science, The stuff of dreams, Scientific American, 36-40 (2013).
2. European Commission, "ICT Infrastructures for e-science" Brussels, 2009.
3. http://www.enterprise.cam.ac.uk/news/2013/1/castep-achieves-30-million-sales/
4. Gadgets and Gigawatts - Policies for Energy Efficient Electronics, 2009.
5. M. Swenson et al, IDC white paper, July 2004.
6. A.S. Louie et al, Health Industry Insights, Jan. 2007
7. Google Patent Search (21/8/2017)

Publications

10 25 50
 
Description The work has improved the speed and memory usage of key UK materials simulation software, and also led to greater understanding of the uncertainties and errors in the simulation results, and how they depend on the users' choice of input parameters. This work has led to significant improvements materials modelling software, both directly in the CASTEP materials modelling software (www.castep.org) and through collaborations with other software development teams (e.g. CP2K, CRYSTAL, ONETEP). Over the course of this project, CASTEP has also been released at no cost to the worldwide academic community.

In addition to the core software development work in CASTEP, important contributions have been made to other software (e.g. ShengBTE) and new bespoke software has been created, including for "outreach" to schools, undergraduates and the general public.
Exploitation Route The main software advances have been implemented in CASTEP, and released worldwide via a cost-free academic source code licence, and a paid-for commercial licence (via BIOVIA, a subsidiary of Dassault Systemes). This means that the work is already being taken forward by well over 1000 academic and industrial research groups worldwide, who have licenced CASTEP.
Sectors Aerospace, Defence and Marine,Chemicals,Electronics,Energy,Manufacturing, including Industrial Biotechology,Pharmaceuticals and Medical Biotechnology

URL http://www.castep.org
 
Description The work developed in the CASTEP computer program has been merged frequently with the main codebase, continuing with the latest release (23.1). CASTEP is available at no cost for non-commercial use worldwide, and marketed commercially worldwide by Dassault Systemes. CASTEP is used by around 1000 companies and academic groups worldwide, and is cited in support of over 250 patents.
First Year Of Impact 2018
Sector Aerospace, Defence and Marine,Agriculture, Food and Drink,Chemicals,Digital/Communication/Information Technologies (including Software),Education,Electronics,Energy,Environment,Manufacturing, including Industrial Biotechology,Culture, Heritage, Museums and Collections,Pharmaceuticals and Medical Biotechnology,Security and Diplomacy,Transport
Impact Types Economic

 
Description CASTEP-USER: Predictive Materials Modelling For Experimental Scientists
Amount £541,320 (GBP)
Funding ID EP/W030438/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 04/2022 
End 03/2025
 
Description Materials and Molecular Modelling (MMM) Exascale Design and Development Working Group (DDWG)
Amount £53,357 (GBP)
Funding ID EP/V001256/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 05/2020 
End 07/2021
 
Description The UK Car-Parrinello HEC Consortium
Amount £563,229 (GBP)
Funding ID EP/X035891/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 01/2023 
End 12/2026
 
Description NVIDIA 
Organisation NVIDIA
Country Global 
Sector Private 
PI Contribution Developing a GPU port of the CASTEP code
Collaborator Contribution 2 GPU-based compute cards, compilers, access to NVIDIA's development cluster and expert engineering advice
Impact On-going work
Start Year 2018
 
Title Atomistic simulation Python API 
Description A generic API to provide an abstract interface to atomistic simulation programs, without needing to know the implementation details of the simulation programs themselves. This enables high-level software frameworks and workflows to switch easily between different simulation packages (for example, to compute forces); similarly, developers of simulation software implement this API in their programs, and immediately have their software be usable in the high-level tools. 
Type Of Technology Software 
Year Produced 2021 
Open Source License? Yes  
Impact An initial implementation of this API has been developed in CASTEP, which has already been used to develop advanced transition-state search methods. 
URL https://bitbucket.org/byornski/dft-python-api/
 
Title CASTEP 19 
Description CASTEP is a software package for predictive, quantum-mechanical simulations of materials and chemicals. It is based on density functional theory, and can simulate a wide range of materials proprieties including energetics, the structure at the atomic level, vibrational properties and electronic response properties. In particular it has a wide range of spectroscopic features that link directly to experiment, such as infra-red and Raman spectroscopies, NMR, and core level spectra. CASTEP version 19 included many enhancements and optimisations, including improved performance and the ability to compute electron-phonon coupling (important for electronic transport modelling). 
Type Of Technology Software 
Year Produced 2018 
Impact CASTEP is used by around 1000 companies and research groups around the world. The key papers describing CASTEP are cited over 1000/year and CASTEP is cited in support of over 250 patents. CASTEP is available under a free-of-charge licence to UK academia, and marketed commercially worldwide by Dassault Systemes. The enhancements to CASTEP 19 will be particularly useful for industries in the Semiconductor and Energy sectors. 
URL http://www.castep.org
 
Title CASTEP 20 
Description CASTEP is a software package for predictive, quantum-mechanical simulations of materials and chemicals. It is based on density functional theory, and can simulate a wide range of materials proprieties including energetics, the structure at the atomic level, vibrational properties and electronic response properties. In particular it has a wide range of spectroscopic features that link directly to experiment, such as infra-red and Raman spectroscopies, NMR, and core level spectra. CASTEP version 20 included many enhancements and optimisations, including improvements to spin-orbit coupling simulations. A range of interface codes are also included, the most recent of which allow calculations of electronic and thermal transport. 
Type Of Technology Software 
Year Produced 2019 
Impact CASTEP is used by around 1000 companies and research groups around the world. The key papers describing CASTEP are cited over 1000/year and CASTEP is cited in support of over 250 patents. CASTEP is available under a free-of-charge licence to UK academia, and marketed commercially worldwide by Dassault Systemes. The enhancements to CASTEP 20 will be particularly useful for industries in the Semiconductor sectors, with the new interface codes especially relevant to thermoelectric applications. 
URL http://www.castep.org
 
Title CASTEP 21 
Description CASTEP is a software package for predictive, quantum-mechanical simulations of materials and chemicals. It is based on density functional theory, and can simulate a wide range of materials proprieties including energetics, the structure at the atomic level, vibrational properties and electronic response properties. In particular it has a wide range of spectroscopic features that link directly to experiment, such as infra-red and Raman spectroscopies, NMR, and core level spectra. The enhancements to CASTEP 21 are particularly useful for modelling heavy elements, especially where spin-orbit coupling or non-collinear magnetism are important. 
Type Of Technology Software 
Year Produced 2020 
Impact CASTEP is used by around 1000 companies and research groups around the world. The key papers describing CASTEP are cited over 1000/year and CASTEP is cited in support of over 250 patents. CASTEP is available under a free-of-charge licence to academia worldwide, and marketed commercially worldwide by Dassault Systemes. 
URL http://www.castep.org
 
Title CASTEP 22 
Description CASTEP is a software package for predictive, quantum-mechanical simulations of materials and chemicals. It is based on density functional theory, and can simulate a wide range of materials proprieties including energetics, the structure at the atomic level, vibrational properties and electronic response properties. In particular, it has a wide range of spectroscopic features that link directly to experiment, such as infra-red and Raman spectroscopies, NMR, and core level spectra. CASTEP version 22 included a top-level Python layer to enable CASTEP to be embedded within other computational workflows, for example transition-state searches or multiscale modelling. 
Type Of Technology Software 
Year Produced 2021 
Impact CASTEP is used by around 1000 companies and research groups around the world. The key papers describing CASTEP are cited over 1000/year and CASTEP is cited in support of over 250 patents. CASTEP is available under a free-of-charge licence to academia worldwide, and marketed commercially worldwide by Dassault Systemes. 
URL http://www.castep.org
 
Title CASTEP 23 
Description CASTEP is a software package for predictive, quantum-mechanical simulations of materials and chemicals. It is based on density functional theory, and can simulate a wide range of materials proprieties including energetics, the structure at the atomic level, vibrational properties and electronic response properties. In particular, it has a wide range of spectroscopic features that link directly to experiment, such as infra-red and Raman spectroscopies, NMR, and core level spectra. CASTEP version 22 included a top-level Python layer to enable CASTEP to be embedded within other computational workflows, for example transition-state searches or multiscale modelling. 
Type Of Technology Software 
Year Produced 2022 
Impact CASTEP is used by around 1000 companies and research groups around the world. The key papers describing CASTEP are cited over 1100/year and CASTEP is cited in support of over 250 patents. CASTEP is available under a free-of-charge source-code licence to academia worldwide, and marketed commercially worldwide by Dassault Systemes. 
URL http://www.castep.org
 
Description BIOVIA e-seminar 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact Invited seminar series for BIOVIA (part of Dassault Systemes), for expert knowledge exchange, training, dissemination of best practice and to raise the profile of current work. There was an interesting Q&A after each seminar, and several emails following-up discussion points.
Year(s) Of Engagement Activity 2022
URL https://events.3ds.com/accelerating-castep-modern-computers
 
Description CASTEP Advanced Developer Training Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Advanced training in software development "best practice" in the context of the CASTEP computer program. There were twelve attendees comprising five different nationalities, and by the end of the five-day course they had all worked on at least one development project within CASTEP.
Year(s) Of Engagement Activity 2019
URL http://www.castep.org/CASTEP/DeveloperWorkshop2019
 
Description CASTEP Community Conference 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact The CASTEP Community Conference (formerly the CASTEP User Workshop) is an event specifically to support the international community of CASTEP users and developers. Typically over 50 people come from around the world, with a high-profile plenary address from an internationally-recognised scientist and several invited talks, plus a range of talks contributed directly by members of the community. Everyone is encouraged to bring a poster on a recent CASTEP-related project, and the friendly, collaborative atmosphere there promotes lively, informal debate.
Year(s) Of Engagement Activity 2019
URL http://www.castep.org/Main/CASTEPCommunityConference2020
 
Description CASTEP User Training Workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Around 60 people attend this week-long workshop, learning how to use the CASTEP computer program to understand, explain and predict the behaviour of materials. By the end of the workshop all attendees have performed advanced simulations such as NMR chemical shifts, EELS or Raman spectra.
Year(s) Of Engagement Activity 2018,2019,2022
URL http://www.castep.org/CASTEP/Workshop2022
 
Description CASTEP User Training for ScotCHEM 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Postgraduate students
Results and Impact Around 20 postgraduate students, postdoctoral researchers and academics from the ScotCHEM (Scottish chemistry collaboration) network attended this one-day training workshop. By the end of the day all attendees had run CASTEP successfully to model a simple chemical system, and investigated at least one of the more sophisticated modelling methods.
Year(s) Of Engagement Activity 2019
URL https://sites.google.com/view/scotchem2019/home
 
Description CASTEP training for QM-CDT (St Andrews) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Postgraduate students
Results and Impact An online two-day workshop organised for the Quantum Matter CDT in St Andrews. I was one of two speakers, and together we presented the theory and practical aspects of first-principles materials modelling (using CASTEP) to an audience of postgraduate students, postdoctoral researchers and academics (all from St Andrews). There were many excellent questions and we had a good discussion about materials modelling and the challenges in interpreting experimental results (especially solid-state NMR).
Year(s) Of Engagement Activity 2020
 
Description Faith and Science Discussion 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Public/other audiences
Results and Impact Two ~2 hour discussions, organised by a local ecumenical group (Easingwold churches), about the nature and role of "Faith" in science and the scientific method. In both cases my initial presentation piqued people's interest and sparked a lively, wide-ranging debate. Initially only a single event was planned, but there was such an enthusiastic response at that event that the organisers arranged a second follow-on event.
Year(s) Of Engagement Activity 2022,2023
 
Description HPC Autumn Academy 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact The HPC Autumn Academy takes place in Cambridge, and typically draws 20-30 attendees (primarily from the UK and Greece). This fortnight-long course trains attendees in the advanced software and numerical methods required to solve real-world problems with high-performance computers (HPC).
Year(s) Of Engagement Activity 2019,2020,2021,2022
URL https://www.csc.cam.ac.uk/academic/cpd/hpcacademy
 
Description Headstart and Work Experience Research "Taster" Projects 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Schools
Results and Impact Groups of secondary school students come to the University of York's Department of Physics for a week to experience a little of what life is like as a working physicist, and to have a taste of what can be achieved with research-level physics. The aspect I am most involved with is the research mini-project, where I introduce the students to an area of current research interest which they can then research independently over the week. The students present their work in poster-form at a mini-conference at the end of the week. The projects always spark a great deal of discussion and really capture the students' imaginations, and the overall quality of the final posters is outstanding.
Year(s) Of Engagement Activity 2018,2019,2021,2022
URL https://www.york.ac.uk/physics/public-and-schools/events/headstart-2019/
 
Description Hermes Summer School 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Postgraduate students
Results and Impact Over 30 students from around the world attended this summer school to learn cutting-edge simulation methods and approaches. The atmosphere was relaxed and collaborative, and there was a lively discussion following the talks which continued throughout the summer school.
Year(s) Of Engagement Activity 2018
URL http://hermessummerschool.org/about-1/
 
Description OMG Science 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach Local
Primary Audience Schools
Results and Impact An Outreach event in a school in Leeds, demonstrating quantum physics to secondary school children. This was organised by the Archbishop of York Youth Trust and concluded with a talk and a lively panel debate on "Faith and Science" (I was a member of the panel).
Year(s) Of Engagement Activity 2018
URL https://www.archbishopofyork.org/news/news-2018/omg-science-abbey-grange-academy
 
Description OpenACC Summit 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Invited talk at the OpenACC Summit, presenting our work to port the CASTEP software to GPUs using OpenACC to an international audience of scientists, RSEs and researchers.
Year(s) Of Engagement Activity 2022
URL https://www.openacc.org/events/summit-2022
 
Description The Uneasy Sleeps of Max Maxwell 
Form Of Engagement Activity A formal working group, expert panel or dialogue
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Public/other audiences
Results and Impact Following a thought-provoking performance of "The Uneasy Sleeps of Max Maxwell" in Malvern, I joined a 5-person expert panel for a discussion of AI, Machine Learning and Ethics with the audience. There were many insightful questions, and it was clear that the audience had engaged well with the topic.
Year(s) Of Engagement Activity 2023
URL https://ridinglights.org/max-maxwell/
 
Description University Open Days 
Form Of Engagement Activity Participation in an open day or visit at my research institution
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Schools
Results and Impact Thousands of pupils and parents came to each University of York Open Day, and typically over 100 were specifically interested in Physics. I devised a computer demonstration of materials modelling, running on an "odroid" low-power computer, which was on the Physics stall, delivered a talk on "theoretical and computational physics" and showcased some recent research. I also developed a practical demonstration of thermoelectric conversion from a temperature gradient to electrical energy, demonstrating the correctness of the computational predictions.
Year(s) Of Engagement Activity 2018,2019,2022