Blueprinting AI for Science at Exascale - Phase II (BASE-II)

Lead Research Organisation: Science and Technology Facilities Council

Department Name: Scientific Computing Department

Abstract

Advances in Artificial Intelligence (AI) are transforming the world we live in today. The innovations are driving two, interconnected aspects: They augment our knowledge, for example, we understand the behaviour of a virus better and faster than we did a decade ago. This improved understanding fuels innovations, improving the quality of our life, such as better vaccines, or better batteries for our mobile phones or our electric vehicles. The role AI and thus of computing is rather crucial for such advancements.

The desire to improve our knowledge on fundamentals, and thus to improve the quality of our life, has become central to our existence. Better and faster understanding leads to better and faster innovations being developed. This essential desire, in turn, demands computations to be performed at a faster rate than ever before - not only to understand very large datasets better, but also to perform very complex simulations, at least at a rate 50 times faster than most powerful computers we have on the planet today --- era of exascale computing. Exascale computers will be able to perform billion billion calculations per second.

The general challenge is to have relevant software technologies ready when such exascale computing becomes a reality, and it is a significant challenge to the international community.

This proposal aims to develop a software suite and relevant software designs to serve as blueprints for using AI for scientific discoveries at exascale --- Blueprinting AI for Science at Exascale (BASE-II). This project is a continuation of our previous work, carried out as part of Phase I, namely, Benchmarking for AI for Science at Exascale (BASE-I). In Phase I, we gathered an essential set of requirements from various scientific communities, which underpins our work in this phase,

The resulting software and designs will cover the following:

a) Facilitate better understanding of the interplay between different AI algorithms, and AI hardware systems across a range of scientific problems. We will be achieving this through a set of AI benchmarks, against which different AI software can be verified,
b) Facilitating incredibly complex simulations using AI: Although exascale systems will facilitate complex simulations (which are essential for mimicking realistic cases), we will accelerate them using AI. This can result in remarkable speedups (e.g., from days to seconds). Such a transformation can provide a massive leap in scientific discoveries.
c) Harmonising the efforts of scientific communities and of vendors through better partnerships: Exascale systems will have complex hardware capabilities, which may be difficult for scientists to understand. Equally, hardware system manufacturers working on the design of exascale systems, do not always understand the underpinning science. This unharmonised effort or non-synchronised advancements, hitherto has been sub-optimal. We intend to build better software / hardware through better partnerships, which we refer to as hardware-software co-design.
d) The success of AI is primarily due to a technology called, deep learning, which inherently relies on very large volumes of data. With technological advances, we can foresee that in the exascale era, the data volumes will not only be huge but also will be multi-modal. Understanding these extremely large-scale datasets will remain key to ensuring that AI can be conducted at exascale.
e) Finally, the community, whether scientific, or academic or industry, will need additional software technologies, or more specifically, an ecosystem of software tools to help with exascale computing. To this end, we will be producing a software toolbox.

We will also be conducting various knowledge exchange activities, such as, workshops, training events and in-field placements to ensure multi-directional flow of information and knowledge across relevant stakeholders and communities.

Funded Value:

£750,713

Funded Period:

Dec 22 - Nov 24

Funder:

SPF

Project Status:

Active

Project Category:

Research Grant

Project Reference:

EP/X019918/1

Principal Investigator:

Jeyan Thiyagalingam

Research Subject:

Info. & commun. Technol. (50%)

Tools, technologies & methods (50%)

Research Topic:

Artificial Intelligence (50%)

High Performance Computing (50%)

Organisations

People	ORCID iD
Jeyan Thiyagalingam (Principal Investigator)
Yuriy Chaban (Co-Investigator)
Vignesh Gopakumar (Co-Investigator)
Paul Calleja (Co-Investigator)
Rajeev Pattathil (Co-Investigator)
Jeremy Yates (Co-Investigator)	http://orcid.org/0000-0003-1954-8749
Jonathan Rowe (Co-Investigator)
Marion Samler (Co-Investigator)
Lamar Alex Moore (Co-Investigator)
Mark Wilkinson (Co-Investigator)
Anders Markvardsen (Co-Investigator)
Satheesh Maheswaran (Co-Investigator)
Tim Snow (Co-Investigator)

Publications

Author Name

Title Publication Date Published

10 25 50

Abstract

Organisations

People

ORCID iD

Publications