EthicalML: Injecting Ethical and Legal Constraints into Machine Learning Models

Lead Research Organisation: University of Sussex

Department Name: Sch of Engineering and Informatics

Abstract

Our choice as to which movies to watch or novels to read can be influenced by suggestions made by machine learning (ML)-based recommender systems. However, there are some important scenarios where ML systems are deficient. Each of the following scenarios involves a situation where we wish to train an ML system so that it delivers a service. In each case, however, there is an important constraint that must be imposed on the operation of the ML system.
Scenario 1: We want a system that will match submitted job applications to our list of academic vacancies. The system has to be non-discriminatory to minority groups.
Scenario 2: We need an automated cancer diagnosis system based on biopsy images. We also have HIV test results, which can be used at training time but should not be collected from our new patients.
Scenario 3: We wish to have a system that can aid us in deciding whether or not to approve a mortgage application. We need to understand the decision process and relate it to our checklist such as whether or not the applicant has an overdraft in the last three months and is on electoral roll.

Scenario 1 asks an ML system to be fair in its decisions by being non-discriminatory with regards to, e.g., race, gender, and disability; scenario 2 requires an ML system to protect confidentiality of personal sensitive data; and scenario 3 demands transparency from an ML system by providing human-understandable decisions.

Equipping ML models with ethical and legal constraints, scenarios 1-3, is a serious issue; without this, the future of ML is at risk. In the UK, this is recognized by the House of Commons Science and Technology Committee, which recommended an urgent formation of a Council of Data Ethics ("The Big Data Dilemma" report, 2016). Furthermore, since 2015, the Royal Society has started a policy project that looks at the social, legal, and ethical challenges associated with advancement in ML models and their use cases.

Building ML models with fairness, confidentiality, and transparency constraints is an active research area, and disjoint frameworks are available for addressing each constraint. However, how to put them all together is not obvious. My long-term goal is to develop an ML framework with plug-and-play constraints that is able to handle any of the mentioned constraints, their combinations, and also new constraints that might be stipulated in the future.

The proposed ML framework relies on instantiating ethical and legal constraints as privileged information. This privileged information is available at training time to better train a decision model and to make a decision model non-discriminatory, but it will not be accessible for future data at deployment time. For confidentiality constraints, personal confidential data such as HIV test results are the privileged information. For fairness constraints, protected characteristics such as race and gender are the privileged information. For transparency constraints, complex un-interpretable but highly discriminative features such as deep learning features are the privileged information.

This project aims to develop an ML framework that produces accurate predictions and uncertainty estimates about its predictions while also complying with ethical and legal constraints. The key contributions of this proposal are: 1) a new privileged learning algorithm that overcomes limitations of existing methods by allowing to plug-and-play various constraints at deployment time, by being kernelized, by optimizing its hyperparameters, and by producing estimates of prediction uncertainty, 2) a scalable and automated inference that makes the new privileged learning algorithm easily applicable for any large scale learning problem such as binary classification, multi-class classification, and regression, and 3) an instantiation of the new algorithm for incorporating fairness, confidentiality, and transparency restrictions into ML models.

Planned Impact

Advancement in ethically and legally aware machine learning (ML) models has broad implications. In this project, I will focus on engaging with users in the following domains:
1. Predictive Policing
Predictive policing refers to "computer systems that use data to forecast where crime will happen or who will be involved" (Upturn's report, 2016). In the UK, the effectiveness of predictive policing is widely reported, for example, Strathclyde Police cited a reduction in domestic violence reoffenders (Joe Newbold's report in 2015). The fact that predictive policing technologies rely on historical and inherently biased crime data to build ML models raises several ethical and legal concerns such as fairness and transparency. Upturn's 2016 report on "Early Evidence on Predictive Policing and Civil Rights" concluded that the predictive policing tools, which are currently designed and implemented, reinforce discriminatory policing practices. This is a serious issue that puts the future of predictive policing at risk despite its success in reducing crime. This project will develop an ML model that corrects past biases via non-discriminatory fairness constraints.

2. Healthcare Analytics
For health workers, hospitals and governmental decision makers, it is extremely useful to have tools to predict healthcare problems such as maternal mortality rates and cancer. The AI and Life in 2030 Report stressed that ML-driven applications need to "gain the trust of doctors, nurses, and patients". The proposed ML framework acknowledges the need to use patient confidential data only in a strict need-to-know basis and to not use them in the deployed system. Furthermore, the transparency constraints will aid health professionals in their decisions and will steer clear of the statement "because the computer said so".

3. Improving the skills base
"The Big Data Dilemma" report has urged immediate action to tackle the crisis of data analytics skills. The appointed PDRA will develop skills and experience in data analytics, collaboration, scientific and public presentations, and organization of workshop and stakeholder meetings.

4. General public
This project will have societal impact by informing both ML enthusiasts and sceptics about the reality that ML technologies have permeated our everyday life, and that there is an active push within the ML community to develop models that respect ethical and legal constraints.

Although not a direct focus, I recognize the long-term implications of the study in the following areas, and will be alert to any opportunity for establishing links for future actions:

5. Human Resource Analytics
Companies and universities use ML models on candidates' background information (e.g. application form data including disability) and employee data to predict whether this candidate should be hired.

6. Mortgage Approval
Similarly, lenders use ML models on borrowers' background information, including licensed data such as credit score information, to predict whether it is risky to extend a mortgage offer.

7. Insurance Premium Setting
Also, insurance companies use ML models on applicants' driving history and biographical data to predict the driver type of an applicant, and subsequently to set an insurance premium accordingly.

An algorithmic assessment method, which is used for predicting human outcomes such as recruitment (5), loan approval (6), and insurance premium (7), contributes to a world with decreasing human biases. To achieve this, however, we need advanced ML models that are free of algorithmic biases (fairness), despite the fact that they are trained based on historical and biased data. Additionally, the deployed models should not collect personal sensitive data (confidentiality). Furthermore, in an interactive mode, where humans can check the computer's judgment, understanding the reasons behind predictions made by ML models (transparency) is a prescription for improved collaborative decisions.

Funded Value:

£100,675

Funded Period:

Sep 17 - Mar 19

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/P03442X/1

Principal Investigator:

Novi Quadrianto

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Artificial Intelligence (100%)

Organisations

University of Sussex (Lead Research Organisation)

People	ORCID iD
Novi Quadrianto (Principal Investigator)

Publications

Author Name

Title Publication Date Published

|< < 1 2 > >|

10 25 50

Butcher B (2021) Causal datasheet for datasets: An evaluation guide for real-world data analysis and data collection design using Bayesian Networks in Frontiers in Artificial Intelligence

Butcher B (2021) Causal Datasheet for Datasets: An Evaluation Guide for Real-World Data Analysis and Data Collection Design Using Bayesian Networks. in Frontiers in artificial intelligence

Gadetsky A (2020) Low-Variance Black-Box Gradient Estimates for the Plackett-Luce Distribution in Proceedings of the AAAI Conference on Artificial Intelligence

Gadetsky A (2020) Low-Variance Black-Box Gradient Estimates for the Plackett-Luce Distribution in AAAI Conference on Artificial Intelligence

Gadetsky A (2019) Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution

Kehrenberg T (2020) Tuning Fairness by Balancing Target Labels. in Frontiers in artificial intelligence

Kehrenberg T (2018) Tuning Fairness by Balancing Target Labels

Quadrianto N (2019) Discovering Fair Representations in the Data Domain in IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Quadrianto N (2019) Discovering Fair Representations in the Data Domain

Quadrianto N (2017) Recycling Privileged Learning and Distribution Matching for Fairness in Conference on Neural Information Processing Systems (NeurIPS -formerly NIPS)

Key Findings
Impact Summary
Further Funding
Engagement Activities


Description	The key goal of this project is to develop new machine learning models that embed fairness, accountability, transparency, and trustworthiness into them for ensuring ethical outcomes and long-term public confidence in the deployment of automated systems. We have developed a set of machine learning models and algorithms that: a) is able to handle any definitions of fairness, e.g. equality of acceptance rates, equality of true positive/negative rates, or equality of positive/negative predicted values between sub-groups; and b) delivers transparency in how fairness is met. Having models that allow for plug-and-play with different fairness definitions is desirable as it is widely accepted that relevant fairness definitions depend on the specific context and application. Transparency in fairness is an integral, yet so far overlooked, ingredient for facilitating public conversations and productive public debates regarding fair machine learning systems. We have developed a machine learning model that returns the Pareto frontier of multi-objective maximization of classification accuracy (system's utility) and fairness in predictions. This will facilitate decision makers to select an operating point of the deployed system and to be accountable for it. In another important step in re-introducing human accountability into algorithmic decision making, we have also developed a machine learning model that has an important capability for setting a target acceptance/positive rate. For example, we can set an acceptance rate of 0.7 for both sub-groups. Orthogonal to developing novel models that embed fairness into their learning process, we have also looked at the data-generation pipeline. We have proposed a method that transforms input data to their fair and interpretable representations, in which the semantic of the input domain is retained at the transformed space. We have also devised methods to augment the training data with imagined/contrastive examples, creating a more balanced training set with respect to sub-groups. Those contrastive examples can be used to provide an individual-level explanation of fair systems. These contributions have been published, among others, in the Neural Information Processing Systems Conference, Computer Vision and Pattern Recognition Conference, and Frontiers in Artificial Intelligence Journal. We have also released several software packages ("EthicML" a package for evaluating the performance of methods which aim to increase fairness, accountability and/or transparency, and "ethicml-models") at https://github.com/predictive-analytics-lab.
Exploitation Route	We are pursuing the following opportunities: -- exploring issues of equitable interventions in the global development sector based on survey data that segment individuals based on psychographic and behavioral variables (together with an international non-profit organisation); and -- applying methods to ensuring ethical outcomes of Open Banking apps (together with an FCA authorized start-up).
Sectors	Communities and Social Services/Policy Digital/Communication/Information Technologies (including Software) Financial Services and Management Consultancy Healthcare
URL	https://wearepal.ai/


Description	The results from this grant have been used in the following ways: a) formed the basis for an ERC Starting Grant; b) formed the basis for a partnership with BCAM on setting up a BCAM Severo Ochoa Strategic Lab on Trustworthy Machine Learning in Spain (the Lab will play a crucial role in the third Severo Ochoa Centre of Excellence accreditation for BCAM; 1st January 2023 - 31st December 2026); c) formed the foundation for Astute AI Ltd.
First Year Of Impact	2019
Sector	Digital/Communication/Information Technologies (including Software),Education
Impact Types	Societal Economic


Description	Amazon AWS Cloud Credits Award
Amount	$20,000 (USD)
Organisation	Amazon.com
Sector	Private
Country	United States
Start	05/2018
End	05/2019


Description	Developing Bias Auditing and Mitigation Tools for Self-Assessment of AI Conformity with the EU AI Act through Statistical Matching
Amount	€ 150,000 (EUR)
Funding ID	101123000
Organisation	European Research Council (ERC)
Sector	Public
Country	Belgium
Start	05/2024
End	11/2025


Description	ERC Starting Grant
Amount	€ 1,443,697 (EUR)
Organisation	European Research Council (ERC)
Sector	Public
Country	Belgium
Start	03/2020
End	03/2025


Description	GPU Grant Program
Amount	£2,000 (GBP)
Organisation	NVIDIA
Sector	Private
Country	Global
Start	11/2017


Description	It takes two to tango: a synergistic approach to human-machine decision making
Amount	€ 7,008,798 (EUR)
Funding ID	101120763
Organisation	European Commission
Sector	Public
Country	Belgium
Start	09/2023
End	09/2027


Description	IET Sussex Network Seminar (Falmer)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Professional Practitioners
Results and Impact	The Institution of Engineering and Technology (IET) Lecture on Ethical Machine Learning. My 2-hour lecture can be used to contribute towards Continuing Professional Development (CPD) of IET members. The event is organized by IET Sussex Local Network.
Year(s) Of Engagement Activity	2018
URL	https://communities.theiet.org/files/14922


Description	Presentation at Brighton and Hove Council's Corporate Management Team Meeting (Brighton Town Hall)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Local
Primary Audience	Policymakers/politicians
Results and Impact	My talk on ethical machine learning was attended by the chief executive of Brighton & Hove City Council, several executive directors, and the Corporate Management Team (senior officers across all departments).
Year(s) Of Engagement Activity	2018


Description	Presentation at Home Office Analysis and Insight (Croydon)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Local
Primary Audience	Professional Practitioners
Results and Impact	The talk on ethical machine learning was attended by 20+ data scientists from the Data Analytics Competence Centre.
Year(s) Of Engagement Activity	2018


Description	Presentation at Stochastic Processes and Probabilistic Models in Machine Learning Workshop (Moscow)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Postgraduate students
Results and Impact	A two-day workshop organised by Centre of Deep Learning and Bayesian Methods, National Research University Higher School of Economics. Attended by mostly postgraduate students from several universities in Moscow.
Year(s) Of Engagement Activity	2018
URL	https://www.youtube.com/watch?v=1-OYitMQNDQ


Description	Presentation for the Research Training Group of German National Merit Foundation (Heidelberg)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	Local
Primary Audience	Undergraduate students
Results and Impact	A half-day presentation to 20+ scholars from the "Artificial Intelligence" Research Training Group of the German National Academic Scholarship Foundation.
Year(s) Of Engagement Activity	2018


Description	Public Lecture (Lviv)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Public/other audiences
Results and Impact	General public including the local data science community in Lviv, Ukraine attended the talk which was held on Thursday, 26 July 2018 from 18:00-19:00. The talk was organised by Applied Sciences Faculty, UCU.
Year(s) Of Engagement Activity	2018


Description	Sussex Data Science Meetup (Brighton)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	Regional
Primary Audience	Industry/Business
Results and Impact	I gave a talk on ethical machine learning to the general public who are interested in data science, big data, machine learning or AI.
Year(s) Of Engagement Activity	2018
URL	https://www.meetup.com/SussexDataScience/events/255388643/

Abstract

Planned Impact

Organisations

People

ORCID iD

Publications