Automating Representation Choice for AI Tools

Lead Research Organisation: University of Cambridge

Department Name: Computer Science and Technology

Abstract

AI engines are ubiquitous in our lives: we talk to our mobile phones, ask directions from our sat-navs, and learn new facts and skills with our digital personal assistants. But most of the time, we need to learn how these systems work first: we have to adapt to them as they are not aware of our level of experience, expertise and preferences. AI engines are fast and can deal with a deluge of data much better than us, but they do so in machine-oriented ways, which are often inaccessible and unintelligible to humans. How can we build machines that will adapt to us?

The aim of this project is to identify and study how humans represent information that they want to work with and from which they will obtain new knowledge. Humans have the capability to choose the representation that works for them to enable them to solve a new problem, and moreover, if the representation needs to be changed, they can spot this and change it. Unlike humans, machines in general have fixed representations and do not have the understanding of the user. For example, sat-nav systems will only give directions with elementary spatial commands or route planning functions, whereas humans give directions in many forms, for instance in terms of landmarks or other geographic features that are based on shared knowledge.

We want to model in computational systems this inherently human ability to choose or change appropriate representations, and make machines do the same. We want to find out what are the cognitive processes that humans use to select representations, what criteria they use to choose them, how we can model this ability on machines, and thus how to engineer systems that will select effective representation to enhance people's problem solving and learning.

This is a grand challenge, because it must marry human with machine capabilities. Our previous EPSRC funded feasibility project brought together an interdisciplinary team to combine expertise in computer science on automated reasoning with diagrammatic representations (Jamnik, Cambridge) with expertise in cognitive science on human problem solving and learning with representations (Cheng, Sussex). This interdisciplinary approach has been critical to the success of the feasibility project.

We previously showed that when humans choose a representation of a problem, they use cognitive and formal properties of the problem and its representation to make their choice. In this project, we build on and generalise this hypothesis and demonstrate its utility by building a mathematics tutor that intelligently picks good representations according to the skill level of different learners. We have the following goals:

1. Develop representation selection theory based on the formalisation of formal and cognitive properties.

2. Develop a cognitive theory to assess the efficacy of alternative representations and methods for selecting representations suited to the competencies of individual users.

3. Devise computational algorithms (software) that mechanise the right choice of representation based on the theoretical foundations.

4. Develop and test the algorithms on a range of domains to demonstrate the scalability and generality of the approach.

5. Build an AI tutoring system that implements automated and personalised representation choice based on the user's level of expertise and experience.

6. Empirically evaluate the capability of the tutoring system to select beneficial representations for supporting problem solving.

Our work is novel in that it will address the problem of appropriate representation choice. Moreover, we will build novel cognitive theories and computational models that will allow AI systems to operate in more human-like ways and adapt to the requirements of the problem and the needs of the user. Thus, the potential impact will span numerous domains where systems interact with humans to represent information and use it for extracting new knowledge.

Planned Impact

"Automating Representation Choice for AI Tools" is an ambitious project where we want to find out how humans choose and also change representation during problem solving. We want to develop engines that will give machines the same capability. Such machines will accrue many of the benefits that humans obtain from changing representation, including: more effective communication with a human by selecting a representation that is well-suited to their level of familiarity with the topic; better understanding of a human by identifying and adopting the representation that the human favours; and greater flexibility in adapting to the needs of the user. As such, the project will have economic, societal and knowledge impact.

The impact of this work will be wide as AI systems are becoming ubiquitous. When interacting and exchanging information with humans, developers of such products need to represent information in human understandable ways. We will devise techniques that will help developers of AI systems to build products that choose representations appropriate for their users -- and thus have potential economic benefit to them.

Making machines more human accessible will also benefit the society at large as it will potentially bring technical tools to those that are less technically versed.

We have already outlined how our work will contribute to the expansion of knowledge in Academic Beneficiaries. The techniques we will develop are novel and will provide a scientific advance in all areas (in academia and in industry) that must represent information for solving problems, reasoning, etc.

Whilst we focus on education as our initial concrete application target, our contributions will be general and could apply across different domains. They will lead to better understanding of representations and their relation to human expertise and preferences. As intelligent tools make part of our everyday life, we will all benefit from having more human-like systems that adapt to us, rather than us adapting to them.

Artificial Intelligence is a research area for growth by the EPSRC. The capability to select and adapt representations to suit human needs will provide a novel foundational capability for AI. The EPSRC funded Human-Like Computing (HLC) network recognises the importance of representation selection; the first of the cognitive science challenges in the network's strategy roadmap (https://epsrc.ukri.org/newsevents/pubs/human-like-computing-strategy-roadmap/) asks: "How do we create systems that create and revise their mental representations to fit the problem being addressed?" As a recognition of the importance and the potential impact of our work, the HLC network of interdisciplinary experts fully endorse this project (see Other Attachments).

To achieve maximum impact, we will follow a comprehensive dissemination strategy. We will publish our results in all relevant communities (e.g., artificial intelligence, cognitive science, automated reasoning, diagrams, knowledge representation, human-computer interaction, information visualisation, education) to achieve wide dissemination. We will demonstrate our work by organising a workshop and preparing tutoring material aimed at academic as well as more specific industrial communities. This will also provide a community building opportunity for people interested in representations and more generally, human-like computing. We will continue to participate at outreach activities to raise the awareness of the importance of human-like computing. We will create a web repository to enable free and public access to our papers, corpus of problems and their solutions, software and tutorials. Finally, we have enlisted an Advisory Board of experts spanning all areas relevant to this project, and coming from academia, education and industry - their advice will help us stay focused on relevant problems, influence diverse communities that they lead, and transfer our technology widely.

Funded Value:

£760,892

Funded Period:

Mar 20 - Dec 23

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/T019603/1

Principal Investigator:

Mateja Jamnik

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Artificial Intelligence (70%)

Human-Computer Interactions (30%)

Organisations

People	ORCID iD
Mateja Jamnik (Principal Investigator)
Daniel Raggi (Researcher)	http://orcid.org/0000-0002-9207-6621
Gem Stapleton (Researcher Co-Investigator)

Publications

Author Name

Title Publication Date Published

|< < 1 2 3 > >|

10 25 50

Baimagambetov A (2020) Diagrammatic Representation and Inference - 11th International Conference, Diagrams 2020, Tallinn, Estonia, August 24-28, 2020, Proceedings

Cheng P (2021) Diagrammatic Representation and Inference - 12th International Conference, Diagrams 2021, Virtual, September 28-30, 2021, Proceedings

Cheng P (2024) A Human Information Processing Theory of the Interpretation of Visualizations: Demonstrating Its Utility

Cheng P (2022) Diagrammatic Representation and Inference - 13th International Conference, Diagrams 2022, Rome, Italy, September 14-16, 2022, Proceedings

Colarusso F (2023) A novel interaction for competence assessment using micro-behaviors:

Colarusso F (2021) Diagrammatic Representation and Inference - 12th International Conference, Diagrams 2021, Virtual, September 28-30, 2021, Proceedings

Collins K (2024) Evaluating language models for mathematics through interactions.

Collins K (2023) Human Uncertainty in Concept-Based AI Systems

Collins K (2023) Evaluating Language Models for Mathematics through Interactions

Collins KM (2024) Evaluating language models for mathematics through interactions. in Proceedings of the National Academy of Sciences of the United States of America

Key Findings
Impact Summary
Policy Influence
Further Funding
Research Databases and Models
Collaboration
Software and Technical Products
Engagement Activities


Description	Outcome: * Representational Systems Theory (RST): a general theory about the structure of representations and the relations between representations across representational systems. * Oruga is an implementation of the main concepts of RST * Cognitive Properties of Representations: a theoretical framework designed to organise and identify key processes (properties) that affect cognitive load when individuals use representations. * RIST, RISN and RISE: tools for building models of Representation Interpretive Structure Theory (RIST).
Exploitation Route	Research into the nature of representations can be furthered from the informational and cognitive perspectives. Educators, designers and anybody creating AI systems can use our technology to personalised the way they communicate with their audience.
Sectors	Education
URL	http://www.cl.cam.ac.uk/research/rep2rep


Description	Below is the description of the outcomes of this project and how they can be used. Representational Systems Theory (RST) is a general theory about the structure of representations and the relations between representations across representational systems. It allows us to reason about the structure of representations at a level of abstraction that permits reasoning about diagrams, tables, geometric figures, and all sorts of representations within a single rigorous framework. In particular, the development of RST gave rise to a way of thinking about how the relations and invariants between representational systems can be used to transform representations across representational systems. The concepts and methods of RST were developed at a level of mathematical formality that enabled their implementation into a computational system called Oruga. Oruga is an implementation of the main concepts of RST. It includes type systems, constructor specifications, a graph-theoretic way of encoding the structure of representations, and schemas: a way of encoding invariants across systems. The main functionality of Oruga is the automatic transformation of representations across systems. The method for doing this is structure transfer. This is an inference-based method that exploits schemas to generate the structure of a target representation in relation to a given representation. Oruga contains some methods to estimate the cognitive costs of a given representation, based on a model justified by findings in Cognitive Science. Oruga is the back-end of MaRE (Mathematical Representation Engine), a front-end tool that specifically uses the functionalities of Oruga for generating representations of probability (formal algebraic notation, area diagrams, probability trees and contingency tables). Cognitive Properties of Representations is a theoretical framework designed to organise and identify key processes (properties) that affect cognitive load when individuals use representations. The framework is designed based on theoretical knowledge of human cognition and initial empirical tests conducted during this project. It considers processes that happen at different time levels (from milliseconds to days) and at different representation granularities (from symbols to full representations). The framework has been used in Oruga to support the selection of representations based on users' knowledge of the representations. Further, relevant properties' cognitive costs are also displayed in MaRE to support end-users (e.g., educators) make informed decisions on what and in what order to re-represent given a problem and given a specific user profile. RIST, RISN and RISE. The Representation Interpretive Structure Theory (RIST) is a theory that allows modelling of memory structures (schemas) that humans build when interpreting representations and representational systems. It proposes that schemas not only hold information about the concept that is being represented, but also about how it is represented. RISNotation (RISN) operationalises RIST by presenting a graphical modelling notation for RIST. To support end-users construction of models of interpretations an 'intelligent' web-based Editor, RISE, was produced. Models created with RISN/E can be used to understand users' knowledge of a representation and topic. Further, RISN/E can support the design of new representations for designers. TIPS. Transcription with Incremental Presentation of the Stimulus (TIPS) is a method for assessing users familiarity with graphical representations, such as graphs and charts. In TIPS users make verbatim copies of representations in a tightly controlled task in order to capture micro-behaviour signals at scales from 100 ms to 2 seconds. The signals are analysed to infer properties of users' memory chunk structures which underpin their understanding of representations.
First Year Of Impact	2020
Sector	Digital/Communication/Information Technologies (including Software),Education
Impact Types	Societal Economic


Description	House of Lords Specialist Adviser to Select Committee on AI
Geographic Reach	National
Policy Influence Type	Contribution to a national consultation/review
Impact	ATI was established, industry considered ethical guidelines, profile of UK work in AI increased
URL	https://publications.parliament.uk/pa/ld201719/ldselect/ldai/100/100.pdf


Description	How to (re)represent it?
Amount	£151,391 (GBP)
Funding ID	EP/R030650/1
Organisation	Engineering and Physical Sciences Research Council (EPSRC)
Sector	Public
Country	United Kingdom
Start	03/2018
End	11/2019


Title	A novel interaction for competence assessment using micro-behaviors: Extending CACHET to graphs and charts
Description	Data for paper published in: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23) These files contain: The data from the Graph Familiarity questionnaire used in our study (competence assessment using micro-behaviours_Demographic data and questionnaire) The interactions produced by our participants on each stimulus (competence assessment using micro-behaviours _interactions on all stimuli) All the pauses produced by our participants on each stimulus (competence assessment using micro-behaviours _Pauses) Paper abstract Competence Assessment by Chunk Hierarchy Evaluation with Transcription-tasks (CACHET) was proposed by Cheng [14]. It analyses micro-behaviors captured during cycles of stimulus view- ing and copying in order to probe chunk structures in memory. This study extends CACHET by applying it to the domain of graphs and charts. Since drawing strategies are diverse, a new interactive stimulus presentation method is introduced: Transcription with In- cremental Presentation of the Stimulus (TIPS). TIPS aims to reduce strategy variations that mask the chunking signal by giving users manual element-by-element control over the display of the stimulus. The potential of TIPS, is shown by the analysis of six participants transcriptions of stimuli of different levels of familiarity and com- plexity that reveal clear signals of chunking. To understand how the chunk size and individual differences drive TIPS measurements, a CPM-GOMS model was constructed to formalize the cognitive process involved in stimulus comprehension and chunk creation.
Type Of Material	Database/Collection of data
Year Produced	2023
Provided To Others?	Yes
Impact	Paper published in: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23)
URL	https://sussex.figshare.com/articles/dataset/A_novel_interaction_for_competence_assessment_using_mic...


Description	AI4ED
Organisation	University of Cambridge
Department	Faculty of Education
Country	United Kingdom
Sector	Academic/University
PI Contribution	I partnered wth Prof Andreas Stylianides as expert in maths education. We supervised 2 PhD students together.
Collaborator Contribution	Supervised 2 PhD students together. Secured PhD studentships.
Impact	Publications together
Start Year	2020


Title	Oruga
Description	Oruga
Type Of Technology	New/Improved Technique/Technology
Year Produced	2020
Impact	Transformation of representations can be done automatically.
URL	http://www.cl.cam.ac.uk/research/rep2rep


Title	RISE: Editor for Representational Interpretive Structure Notation (RISN)
Description	web app for building cognitive models
Type Of Technology	Webtool/Application
Year Produced	2020
Open Source License?	Yes
Impact	Enables building cognitive models
URL	https://aaron.stockdill.nz/repnotation/


Title	RST editor
Description	Editor for representation system theory
Type Of Technology	Webtool/Application
Year Produced	2020
Open Source License?	Yes
Impact	We can construct different representations in the language of Oruga.
URL	http://rep2rep.cl.cam.ac.uk


Description	Talks and tutorials
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Keynote and invited talks: Mateja presented her talk How Can We Make Trustworthy AI? at the following conferences: * Invited talk at CADE-29: 29th International Conference on Automated Deduction (Rome, July 1 - 4, 2023). * Keynote talk at ICCM 23: International Conference on Cognitive Modeling (Amsterdam, July 18 - 21, 2023). * Keynote talk at CogSci 2023 (Sydney, July 26 - 29, 2023). * Keynote talk at CICM 2023 (Cambridge, September 2023). * Departmental seminar (Cambridge, October 2023). Gem presented the keynote talk The Power of Diagrams: Observation, Inference and Overspecificity at Diagrams joint with VL/HCC (September 14, 2022 in Rome, Italy). Mateja presented the invited talk Towards a Human-Like Reasoning System at the Machine Intelligence Workshop at HLC 2022. Daniel presented the invited talk The Structure and Transformations of Representations at LEISYS 2021. Mateja presented the invited talk How to (Re)Represent it? at KR 2020. Tutorials: Peter presented the tutorial Graphical Modelling of Users' Interpretations of Visual Displays at VL/HCC 2023 Aaron presented the tutorial Representational Systems Theory: What, Why, and How at Diagrams 2022
Year(s) Of Engagement Activity	2019,2020,2021,2022,2023
URL	http://www.cl.cam.ac.uk/research/rep2rep/events

Abstract

Planned Impact

Organisations

People

ORCID iD

Publications