📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

AI-SAFE - Artificial Intelligence Shield Against Fraudulent Exploits

Abstract

The emergence of tools like ChatGPT has significantly reshaped public perception of Artificial Intelligence. Since its launch in 2022, ChatGPT has gained approximately 300 million users. This growth has been further complimented by platforms such as Copilot and DeepSeek, with around 1.8 million and 12 million users, respectively. The rapid advancement of these tools, along with the expanding range of capabilities they offer, has opened unprecedented opportunities. As a result, many organisations have begun integrating them into their workplace applications and users have started resorting to these platforms for the simplest of tasks. This has thus transpired into use of such tools for even the smallest tasks such as drafting out an email.

With the rise in interest in these tools, developers have expanded their use with the support of extensions that cover various domains, including personalised chatbots, healthcare assistants, development frameworks, and specialised applications like data analysis and code generation tools. ChatGPT alone is estimated to have around 1,000 plugins catering to diverse use cases.

However, this rapid expansion and widespread integration has also introduced new cybersecurity vulnerabilities. These include prompt injection attacks, data poisoning, model inversion, adversarial attacks, and direct exploitation by cybercriminals. A primary motivation behind most such attack vectors is the creation of malicious code or files intended to exploit users. The success of these attacks is further complemented by the high level of confidence users have on these platforms with studies indicating that 60%-70% users trust outputs generated by Large Language Models (LLMs) without further verification. Despite safeguards against generating harmful code, vulnerabilities persist, particularly in API extensions, which serve as critical entry points for attackers. Malicious actors exploit these endpoints to inject harmful code, compromising users. This mirrors conventional security breaches, where 72% of web application vulnerabilities stemmed from flaws in coding practices.

AI-SAFE addresses these challenges by leveraging advanced AI to assess the suitability of codebases and detect potential risks. Its real-time evaluation, facilitated through a browser extension and a screen reader, helps prevent the execution of malicious code, ensuring a more secure development environment.

Lead Participant

Project Cost

Grant Offer

THE UNIVERSITY OF WEST LONDON £13,436 £ 13,436
 

Participant

UNIVERSITY OF CENTRAL LANCASHIRE £15,579 £ 15,579

Publications

10 25 50