Software and Skills for Large-Scale Computing: collecting evidence to develop a National Research Software Strategy

Lead Research Organisation: University of Edinburgh
Department Name: Edinburgh Parallel Computing Centre

Abstract

Software has revolutionised the way that we conduct research, pervading all aspects of the research lifecycle. From machine-learning and big data to modelling and simulation, from management of experiments to record keeping and reproducibility, research is increasingly powered by software and digital technologies have become vital tools and techniques for the modern researcher. These tools and techniques require new skills and, often, new forms of research collaboration.

Recently, two reports laid the foundations for understanding the requirements for software and skills to support modern research. The Government Office for Science published a report titled "Large-scale computing: the case for greater UK coordination" which reviewed the UK's large-scale computing ecosystem and the interdependency of hardware, software and skills. This report highlighted the "strong case for continued public investment in large-scale computing" while noting that "investment in computer hardware alone will not be sufficient". It made strong recommendations on investment in software and skills, noting that "high-quality software is fundamental to realising the benefits of investments in computing. Software must be fit for purpose and be regularly tested, updated, and archived" and "there is an acute shortage of large-scale computing professionals, including system architects, data engineers, system operations professionals and software engineers. Exploitation of advanced computing requires skilled cross-disciplinary teams". Separately, the "ExCALIBUR Research Software Engineer Knowledge Integration Landscape Review" provided a comprehensive review of the skills required by Research Software Engineers (RSEs) in High Performance Computing (HPC) and their future training needs.

Building on these reports and the work of the Software Sustainability Institute, the goal of this project is to deliver a better understanding of the software and skills required for large-scale research computing (including HPC, HTC/cloud, AI/ML and data science) and recommendations for how policies and support for these should be structured.

This work will involve a two stage, mixed methods approach. The first stage, encompassing the first four project months, will comprise data gathering through a series of approaches including interviews, focus groups, workshops and surveys, as well as a literature review. The second stage, encompassing the last three project months, will comprise data analysis and the distillation of the data into a report on (and recommendations for) the support required for software and skills to support large-scale research computing. This study will contribute to the evidence base that can be used to develop a National Research Software Strategy for the UK and directly addresses Objective 2 of EPSRC's phase one DRI Objectives: Building the UK's skills base and software development community required for the continued operation of high quality HPC services.

Publications

10 25 50
 
Description This project undertook a review of the UK software and skills landscape, setting it in an international context.

It has published a report providing insight into the perceptions and attitudes to research computing, along with recommendations for how policies and support could be structured. It is based upon a mixed methods study comprising an online survey with 405 responses, eight online group interviews (23 participants) and four online stakeholder interviews (6 participants) alongside a literature review. Data and analysis code from the study will be published after cleaning and curation is complete.
Exploitation Route The results provide insight to policy and decision makers in the Digital Research Infrastructure space, and may be used to help identify ways of funding DRI.
Sectors Digital/Communication/Information Technologies (including Software)

 
Description Early results from the project were used in a response to the DCMS Future of Compute review. The findings from the study have also been used to support discussion at the ReSA Research Software Funders Forum events.
First Year Of Impact 2023
Sector Digital/Communication/Information Technologies (including Software),Government, Democracy and Justice
Impact Types Policy & public services

 
Description Response to Future of Compute Review
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
URL https://www.gov.uk/government/publications/future-of-compute-review
 
Description Poster at UKRI Digital Research Infrastructure Congress 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Presented a poster on initial results of UKRI software and skills report to policy and decision makers. A number of new collaborators expressed interest in finding out more about the findings of this project.
Year(s) Of Engagement Activity 2023
 
Description RSECon23 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact Attendance with exhibition stand and keynote talk at RSECon23 in Newcastle, UK. Engaged with many attendees.
Year(s) Of Engagement Activity 2023