📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Legal Systems and Artificial Intelligence

Lead Research Organisation: University of Cambridge
Department Name: Centre For Business Research

Abstract

A World Economic Forum meeting at Davos 2019 heralded the dawn of 'Society 5.0' in Japan. Its goal: creating a 'human-centred society that balances economic advancement with the resolution of social problems by a system that highly integrates cyberspace and physical space.' Using Artificial Intelligence (AI), robotics and data, 'Society 5.0' proposes to '...enable the provision of only those products and services that are needed to the people that need them at the time they are needed, thereby optimizing the entire social and organizational system.' The Japanese government accepts that realising this vision 'will not be without its difficulties,' but intends 'to face them head-on with the aim of being the first in the world as a country facing challenging issues to present a model future society.' The UK government is similarly committed to investing in AI and likewise views the AI as central to engineering a more profitable economy and prosperous society.

This vision is, however, starting to crystallise in the rhetoric of LegalTech developers who have the data-intensive-and thus target-rich-environment of law in their sights. Buoyed by investment and claims of superior decision-making capabilities over human lawyers and judges, LegalTech is now being deputised to usher in a new era of 'smart' law built on AI and Big Data. While there are a number of bold claims made about the capabilities of these technologies, comparatively little attention has been directed to more fundamental questions about how we might assess the feasibility of using them to replicate core aspects of legal process, and ensuring the public has a meaningful say in the development and implementation.

This innovative and timely research project intends to approach these questions from a number of vectors. At a theoretical level, we consider the likely consequences of this step using a Horizon Scanning methodology developed in collaboration with our Japanese partners and an innovative systemic-evolutionary model of law. Many aspects of legal reasoning have algorithmic features which could lend themselves to automation. However, an evolutionary perspective also points to features of legal reasoning which are inconsistent with ML: including the reflexivity of legal knowledge and the incompleteness of legal rules at the point where they encounter the 'chaotic' and unstructured data generated by other social sub-systems. We will test our theory by developing a hierarchical model (or ontology), derived from our legal expertise and public available datasets, for classifying employment relationships under UK law. This will let us probe the extent to which legal reasoning can be modelled using less computational-intensive methods such as Markov Models and Monte Carlo Trees.

Building upon these theoretical innovations, we will then turn our attention from modelling a legal domain using historical data to exploring whether the outcome of legal cases can be reliably predicted using various technique for optimising datasets. For this we will use a data set comprised of 24,179 cases from the High Court of England and Wales. This will allow us to harness Natural Language Processing (NLP) techniques such as named entity recognition (to identify relevant parties) and sentiment analysis (to analyse opinions and determine the disposition of a party) in addition to identifying the main legal and factual points of the dispute, remedies, costs, and trial durations. By trailing various predictive heuristics and ML techniques against this dataset we hope to develop a more granular understanding as to the feasibility of predicting dispute outcomes and insight to what factors are relevant for legal decision-making. This will allow us to then undertake a comparative analysis with the results of existing studies and shed light on the legal contexts and questions where AI can and cannot be used to produce accurate and repeatable results.

Planned Impact

Artificial Intelligence research encompasses a broad-and ever-expanding-array of disciplines including, but not limited to: the computer sciences, mathematics, linguistics, electrical engineering, psychology, neuroscience, economics, and operations research. While we do not profess to make discrete contributions to each of these fields, we believe this project cuts to the core of questions whose answers have implications for the use of AI in law, but for technical research into AI and social-scientific examinations of its societal impact. Specifically, our proposed research examines the central question of how we might identify and define limits or 'red lines' for using AI to replicate core aspects of the legal system, specifically legal adjudication.

With this in mind, our research is likely to have an impact beyond those contexts that we can foresee at the outset. Most immediately, however, we believe that our research will have the most proximate impact on legal scholarship, public policy around the use of AI in law, government innovation and investment strategies, and ongoing regulatory compliance debates. By involving stakeholders from the LegalTech community, intergovernmental organisations, and government ministries we will receive input from relevant stakeholders driving the development of AI, but our research will not be limited to their input. Here we will build on our existing contacts with international organisations including the OECD and ILO, government departments (BEIS and the MoJ in the UK, METI and the MoJ in Japan), and civil society groups.

The transformative potential and promise of AI is a matter of great public interest and concern. As such, we will ensure that our research project includes input and evidence from the public and civil society organisations. Here we will build on a series of public engagement forums hosted by CoI Christopher Markou as part of his Leverhulme Trust postdoctoral fellowship, and supported by the Law Society of England and Wales and Royal Society for the Arts. These events, scheduled for September 2019, will be hosted at law schools across the UK to educate the public on the implementation of AI and Big Data into legal administration and law enforcement. Public sentiment and concerns will then be fed back into a jointly authored report presented to the UK Ministry of Justice, and capped off by a public lecture by Dr Markou for the Cambridge Festival of Ideas in October 2018. This work will be accompanied by a series of op-eds in major newspapers, blogs, and media spots in print, video, and radio that will help raise the public profile of the project and dissemination of its findings.

We will also integrate our dissemination plan with the activities of the Cambridge Trust and Technology Initiative which is run by Co-Is Jat Singh and Jennifer Cobbe. The Trust and Technology Initiative is a 'big tent,' bringing people together, facilitating collaboration, and engaging industry, civil society, government, and the public, across: (i) relationships and interplays between technology and society; the legal, ethical and political frameworks impacting both trust and technology, and innovative governance, in areas such as transport, critical infrastructure, identity, manufacturing, healthcare, financial systems and networks, communications systems, internet of things; (ii) the nature of trust and distrust; trust in technology, and trust through technology; the many dimensions of trust at individual, organisational and societal levels; and (iii) rigorous technical foundations, for resilient, secure and safe computer systems, including data and communications platforms, artificial intelligence, and robotics
 
Description The horizon scanning workshop we held in Cambridge in March 2023 discussed scenarios in three key areas of potential impact of AI in the workplace: HR performance assessments; protections for freelance workers in the gig economy; and the resolution of workplace disputes. There was general agreement on the risks arising from AI, which included over-reliance on biased and error-prone systems, which had to be weighed in the balance against potential cost improvements. Increases in freelance work driven by AI would empower employers at the cost of worker autonomy if new rights were not enacted, and made effective, with respect to surveillance and the maintenance of a living wage. Automation of dispute resolution in and beyond the workplace, advanced as a means of improving access to justice, could also lead to a loss of worker voice through information asymmetries and an absence of representation.
Our work on the use of ML for case prediction suggests that while there is huge potential for this form of law-related AI, the techniques involved are still at an early stage. So far, published studies have been confined to demonstrating correlations between different parts of the same judgment text. Since judges write their opinions knowing the outcome, there is a high risk of cross-contamination between the test and training data used in these studies. Our experience of building large scale corpora of legal cases suggests that some of the challenges involved in curating datasets and annotating cases for analysis can be overcome using large language models such as GPT-4, which can be used to facilitate automated annotation. Our historical research suggests that NLP techniques such as sentiment analysis can be used to identify trends in judicial decision making and to address the issue of how shifts in legal language are linked to wider changes in the political, economic and technological context of the law.
Exploitation Route There is considerable scope for our work to be used by lawtech firms seeking to understand the potential of AI in the legal sector, and by regulators seeking to strike a balance between innovation and risk management.
Sectors Communities and Social Services/Policy

Creative Economy

Digital/Communication/Information Technologies (including Software)

Education

Electronics

Environment

Financial Services

and Management Consultancy

Leisure Activities

including Sports

Recreation and Tourism

Government

Democracy and Justice

URL https://www.jbs.cam.ac.uk/centres/business-research-cbr/research/research-projects/project-legal-systems-and-artificial-intelligence/
 
Description Our work has influenced the debate over the adoption of AI in the legal sector and in government in the UK and Japan, the two countries with which our project was mostly concerned, and also more widely in Europe and in the USA, as part of the ongoing debate over AI regulation, and how to balance risk management with innovation.
First Year Of Impact 2023
Sector Communities and Social Services/Policy,Creative Economy,Digital/Communication/Information Technologies (including Software),Education,Government, Democracy and Justice,Culture, Heritage, Museums and Collections
Impact Types Cultural

Societal

Economic

Policy & public services

 
Title Cambridge Law Corpus 
Description We introduce the Cambridge Law Corpus (CLC), a corpus for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. Together with the corpus, we provide annotations on case outcomes for 638 cases, done by legal experts. This dataset consists of 15 selected cases from the CLC. The selected cases are publicly available under the Open Justice License. 
Type Of Material Database/Collection of data 
Year Produced 2023 
Provided To Others? Yes  
URL https://www.repository.cam.ac.uk/handle/1810/357329
 
Title Deakin, S., Shuku, L., Cheok, V. (2024). English Poor Law Cases, 1691-1834. [data collection]. UK Data Service. SN: 857470, DOI: 10.5255/UKDA-SN-857470 
Description The dataset consists of annotated cases concerning the right to a settlement by hiring under poor law legislation which was in force between 1691 and 1834. During this period, parishes could contest their liabilities to pay poor relief by taking appeals from decisions of the quarter sessions to the Court of King's Bench. The decisions of the Court of King's Bench were recorded in a number of nominate reports, and analysed in several treatises and textbooks. The dataset contains the text of relevant cases with annotations designed to facilitate the application of natural language processing techniques. For further detail on potential uses of the dataset, see Simon Deakin and Linda Shuku, 'Exploring computational approaches to law: the evolution of judicial language in the Anglo-Welsh poor law, 1691-1834', Journal of Law and Society (2024) and CBR working paper series, https://www.jbs.cam.ac.uk/centres/business-research-cbr/publications/working-papers/. This dataset was constructed as part of a project ('Legal Systems and Artificial Intelligence') assessing the implications of the introduction of Artificial Intelligence (AI) into legal systems in Japan and the United Kingdom. The project was jointly funded by the UK's Economic and Social Research Council, part of UKRI, and the Japanese Science and Technology Agency (JST), and involved collaboration between Cambridge University (the Centre for Business Research, Department of Computer Science, and Faculty of Law) and Hitotsubashi University, Tokyo (the Graduate Schools of Law and Business Administration). Additional funding was provided by the Keynes Fund of the University of Cambridge through the project 'Legal Evolution and Industrialisation: Computational Analysis of Historical Poor Law and Workmen's Compensation Cases'. In November 2024 an updated version of the dataset was added to the one initially deposited. 
Type Of Material Database/Collection of data 
Year Produced 2024 
Provided To Others? Yes  
Impact The dataset is the basis for a published paper in the Journal of Law and Society (2025). 
URL https://reshare.ukdataservice.ac.uk/857470/