Inference mechanisms using pragmatics and detailed knowledge for Commonsense reasoning

Lead Research Organisation: University of Leeds

Department Name: Sch of Computing

Abstract

Commonsense reasoning (CSR) is a key component of any truly artificially intelligent agent; moreover the ability to interpret questions posed in natural language (NL) is important for many applications of AI. This project aims to investigate and make progress in this important but challenging area.

A key aspect of CSR involved in question answering tasks is the ability to draw on a wealth of knowledge as well as appeal to common assumptions about the world and interpret terms in a defeasible manner. The task of structuring such knowledge is a huge challenge and is an active area of research. Well-developed cognitive theories exist involving pragmatics, dealing with common assumptions and interpretations of terms; however little is said on how to extract rules from these theories. The question then is, how can we leverage existing knowledge bases while maintaining ideas from pragmatics in order to solve question answering benchmarks?

NL can be very informationally dense, allowing speakers to convey a lot of information in only a few words. There is a wealth of research relating to theories of discourse and how humans manage to deal with this semantic under specification in communication. Only with pragmatic inferences can a sentence be transformed into a formal representation of the intended proposition of the speaker. Therefore any system attempting to interpret NL should incorporate mechanisms for inferring this hidden information. Further, being able to account for these sorts of inferences can be very important in CSR for AI, in particular in the Winograd Schema Challenge (WSC). However, the difficulty of the task means that little work exists on formalising this pragmatic knowledge or integrating it into reasoning systems.

Though there exist attempts to formalize some aspects of pragmatics, as pointed out by Bunt and Black some parts of pragmatics "do not enjoy a wealth of representational formalisms". One interesting avenue from the perspective of pragmatics is how to deal with prototypes. As I discuss in my paper, appealing to prototypes is an important aspect of CSR. This raises two particular questions, firstly how do we identify prototypes? Secondly, how do we identify when using prototypes is inappropriate and in these cases how do we change/loosen the definitions we are using? For instance, when one reasons about the sentence "The trophy doesn't fit into the suitcase because it is too large", what is too large? It is not necessary to worry about a precise semantic commitment for the notion of 'large', but instead to evaluate the sentence considering an interpretation of large which satisfies most notions of large. There is also the interesting case of spatial prepositions. A spatial preposition like 'on' is heavily underspecified; it can be used to denote a variety of spatial configurations - the glass on the table, the picture on the wall etc. Though it appears to have a prototypical definition - being above and in contact with - it often just denotes a salient spatial relationship between two objects and then from knowledge of the two objects and context we infer a more precise configuration. Understanding when to use a prototypical 'on' is a similar problem to 'large' - interpreting the intention of the speaker and world knowledge is required.

The aim of this research is to develop systems for interpreting NL while incorporating detailed knowledge and using the principles of pragmatics to guide this. The WSC helps motivates this discussion, however formalizing the necessary aspects of reasoning to tackle the WSC and integrating them into one system is notoriously hard. Therefore, the WSC is not necessarily a suitable problem to tackle. As we will be trying to extract complex pragmatic information it makes sense to work in an area where semantic representations are well developed and understood, therefore I intend to restrict to work on problems in the spatial domain.

Student:

Adam Richard-Bollans

Period of Study:

May 17 - Oct 21

Funder:

EPSRC

Project Status:

Closed

Project Category:

Studentship

Project Reference:

1961029

Research Topic:

Unclassified

Organisations

University of Leeds (Lead Research Organisation)

People	ORCID iD
Anthony Cohn (Primary Supervisor)
Adam Richard-Bollans (Student)

Publications

Author Name Title Publication Date Published

10 25 50

Richard-Bollans, A L (2017) The Role of Pragmatics in Solving the Winograd Schema Challenge

Richard-Bollans, A L (2018) Towards a Cognitive Model of the Semantics of Spatial Prepositions

Richard-Bollans, A L (2019) Investigating the Dimensions of Spatial Language

Richard-Bollans A (2020) Modelling the Polysemy of Spatial Prepositions in Referring Expressions

Richard-Bollans, A L (2020) Automatic Generation of Typicality Measures for Spatial Language in Grounded Settings

Richard-Bollans A (2020) Categorisation, Typicality & Object-Specific Features in Spatial Referring Expressions

Richard-Bollans A (2023) Identifying and modelling polysemous senses of spatial prepositions in referring expressions in Cognitive Systems Research

Studentship Projects

Project Reference	Relationship	Related To	Start	End	Student Name
EP/N509681/1			01/10/2016	30/09/2021
1961029	Studentship	EP/N509681/1	01/05/2017	31/10/2021	Adam Richard-Bollans

Key Findings
Research Databases and Models
Engagement Activities


Description	There has been much work in linguistics and cognitive science regarding the semantics of spatial prepositions ('in', 'on', 'above' etc..), highlighting the complexity of representing their meaning. However, much of this insight has not yet been translated into a satisfactory computational model. Most existing models of spatial prepositions take a very simple approach and ignore many features that may be salient as well as the various different ways the terms may be understood. So far we have provided methods for formalising some of these often ignored features and shown that models can be generated from data which account for a wide variety of features. We have also provided methods which allow various different senses a spatial preposition may express to be accounted for.
Exploitation Route	Modelling spatial prepositions is crucial for many applications involving human-robot interaction. It is possible that the outcomes of this project are implemented to help robot agents better manage spatial language.
Sectors	Digital/Communication/Information Technologies (including Software)


Title	Comparing Category and Typicality Judgements for Spatial Prepositions
Description	Various accounts of cognition and semantic representations have highlighted that, for some concepts, different factors may influence category and typicality judgements. In particular, some features may be more salient in categorisation tasks while other features are more salient when assessing typicality. In experiment we explore the extent to which this is the case for the English spatial prepositions (`in', `inside', `on', `on top of', `over', `above', `under', `below' and `against'). We hypothesise that object-specific features --- related to object properties and affordances --- are more salient in categorisation, while geometric and physical relationships between objects are more salient in typicality judgements. In order to test this hypothesis we conducted a study using virtual environments to collect both category and typicality judgements in 3D scenes. The data collection framework is built on the Unity3D game development software, which provides ample functionality for the kind of tasks we implement. Two tasks were created for our study --- a Categorisation Task and a Typicality Task. In the Categorisation Task participants are shown a figure-ground pair (highlighted and with text description) and asked to select all prepositions in the list which fit the configuration. Participants may select `None of the above' if they deem none of the prepositions to be appropriate. In the Typicality Task participants are given a description and shown two configurations. Participants are asked to select the configuration which best fits the description. Again, participants can select none if they deem none of the configurations to be appropriate. The current dataset is from an online study where participants were recruited via internal mailing lists along with recruitment of friends and family. We have created 18 virtual 3D scenes each containing a single highlighted figure-ground pair. Four scenes each were created for `in', `on', `over' and `under' and these scenes were also shared with their respective geometric counterparts: `inside', `on top of', `above' and `below'. Two scenes were created for `against'. In the Typicality Task, participants compare scenes/configurations associated with the preposition given in the description. The study was conducted online and participants from the university were recruited via internal mailing lists along with recruitment of friends and family. Each participant performed first the Categorisation Task on 6 randomly selected scenes and then the Typicality Task on 15 randomly selected scenes, which took participants roughly 5 minutes. 30 native English speakers participated providing 180 annotations in the Categorisation Task and 447 annotations in the Typicality Task. As the study was hosted online, we first asked participants to show basic competence. This was assessed by showing participants two simple scenes with an unambiguous description of an object. Participants are asked to select the object which best fits the description. If the participant makes an incorrect guess in either scene they are taken back to the start menu.
Type Of Material	Database/Collection of data
Year Produced	2020
Provided To Others?	Yes
Impact	Various accounts of cognition and semantic representations have highlighted that, for some concepts, different factors may influence category and typicality judgements. In particular, some features may be more salient in categorisation tasks while other features are more salient when assessing typicality. In this paper we explore the extent to which this is the case for English spatial prepositions and discuss the implications for pragmatic strategies and semantic models. We hypothesise that object-specific features --- related to object properties and affordances --- are more salient in categorisation, while geometric and physical relationships between objects are more salient in typicality judgements. In order to test this hypothesis we conducted a study using virtual environments to collect both category and typicality judgements in 3D scenes. Based on the collected data we cannot verify the hypothesis and conclude that object-specific features appear to be salient in both category and typicality judgements, further evidencing the need to include these types of features in semantic models.
URL	http://archive.researchdata.leeds.ac.uk/755/


Title	Investigating the Dimensions of Spatial Language: A Preliminary Study
Description	In order to collect rich data on spatial prepositions, we have created a framework for gathering user annotations in 3D virtual environments. The framework is built using the Blender 3D modelling software and game engine. The framework initially provides two distinct tasks with which our preliminary study was conducted - a Selection Task and a Description Task. In both tasks, participants are given a first person view of a scene which they can navigate using the mouse and keyboard. In the Selection Task participants are given a preposition on screen and asked to select all figure-ground pairs in the scene which fit the preposition. Once they have selected all pairs they believe to be admissible they are shown another preposition and asked to repeat the process. In our preliminary study, we limited this task to 'in', 'inside', 'on', 'on top of', 'against', 'over', 'under', 'above' and 'below'. In the Description Task objects are highlighted and participants are able to type in a spatial description of the object. In our preliminary study we asked participants to give descriptions of the object locations using a definite description, in the format figure + preposition + ground e.g. 'the guitar by the bookshelf'. We also allowed the use of multiple prepositions if the participant deemed it necessary e.g. 'the cup on the table near the lamp'. To make the annotations more relevant to our research we asked participants to only use the prepositions in the Selection Task plus 'to the right of', 'to the left of', 'in front of', 'behind', 'near', 'next to', 'at'. The current dataset is from a session we hosted in one of our computing labs where we invited participants from across the university campus to take part. We asked participants to complete either the Selection Task or Description Task in one of our scenes. Participants were given a brief introduction to the study and instructions on how to complete the given task along with explanations of key terminology. All participants were asked to create annotations for ten minutes. 23 native English speakers participated, of which 13 performed the Selection Task and 10 performed the Description Task
Type Of Material	Database/Collection of data
Year Produced	2019
Provided To Others?	Yes
Impact	Spatial prepositions in the English language can be used to denote a vast array of configurations which greatly diverge from any typical meaning and there is much discussion regarding how their semantics are shaped and understood. Though there is general agreement that non-geometric aspects play a significant role in spatial preposition usage, there is a lack of available data providing insight into how these extra semantic aspects should be modelled. This paper is aimed at facilitating the acquisition of data that supports theoretical analysis and helps understand the extent to which different kinds of features play a role in the semantics of spatial prepositions. We first consider key features of spatial prepositions given in the literature. We then introduce a framework intended to facilitate the collection of rich data; including geometric, functional and conventional features. Finally, we describe a preliminary study, concluding with some insights into the difficulties of modelling spatial prepositions and gathering meaningful data about them.
URL	http://archive.researchdata.leeds.ac.uk/558/


Title	Study on the Semantics of Spatial Language
Description	Building on a previous study to collect rich data on spatial prepositions, we have conducted a study to gather annotations related to spatial language in 3D virtual environments. The data collection environment is built using the Unity3D modelling software and game engine. The study comprised two tasks --- a Preposition Selection Task and a Comparative Task. The Preposition Selection Task allows for the collection of categorical data while the Comparative Task provides typicality judgements. In the Preposition Selection Task participants are shown a figure-ground pair (highlighted and with text description) and asked to select all prepositions in the list which fit the configuration. Users may select `None of the above' if they deem none of the prepositions to be appropriate. In the Comparative Task a description is given with a single preposition and ground object where the figure is left ambiguous. Participants are asked to select an object in the scene which best fits the description. Again, participants can select none if they deem none of the objects appropriate. In both tasks, participants are given a first person view of an indoor scene which they can navigate using the mouse and keyboard. To allow for easy selection, objects in the scene are indivisible entities e.g. a table in the scene can be selected but not a particular table leg. We limited both tasks to prepositions: `in', `inside', `on', `on top of', `against', `over', `above', `under' and `below'. The current dataset is from an online study where participants were recruited via internal mailing lists along with recruitment of friends and family. For the study 67 separate scenes were created in order to capture a variety of tabletop configurations. Each participant performed first the Preposition Selection Task on 10 randomly selected scenes and then the Comparative Task on 10 randomly selected scenes, which took participants roughly 10 minutes. Some scenes were removed towards the end of the study to make sure each scene was completed at least 3 times for each task. 32 native English speakers participated in the Preposition Selection Task providing 635 annotations, and 29 participated in the Comparative Task providing 1379 annotations. As the study was hosted online we first asked participants to show basic competence. This was assessed by showing participants two simple scenes with an unambiguous description of an object. Participants are asked to select the object which best fits the description in a similar way to the Comparative Task. If the participant makes an incorrect guess in either scene they are taken back to the start menu.
Type Of Material	Database/Collection of data
Year Produced	2020
Provided To Others?	Yes
Impact	In cognitive accounts of concept learning and representation three modelling approaches provide methods for assessing typicality: rule-based, prototype and exemplar models. The prototype and exemplar models both rely on calculating a weighted semantic distance to some central instance or instances. However, it is not often discussed how the central instance(s) or weights should be determined in practice. The current dataset allowed us to explore how to automatically generate prototypes and typicality measures of concepts from data, introducing a prototype model and discussing and testing against various cognitive models. We conclude that our model provides significant improvement over the other model and also discuss the improvements given by a novel inclusion of functional features in our model.
URL	https://doi.org/10.5518/764


Description	COSIT Doctoral Colloquium Presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented current research project and approaches to the challenge. Promoted interesting discussions afterwards
Year(s) Of Engagement Activity	2019
URL	https://cosit2019.ur.de/index.php


Description	Commonsense Presentation 2017
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented our paper 'The Role of Pragmatics in Solving the Winograd Schema Challenge' to researchers at the conference, leading to discussion of related topics and potential future collaborations.
Year(s) Of Engagement Activity	2017
URL	http://eprints.whiterose.ac.uk/122937/


Description	ECAI presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Presented our paper "Automatic generation of typicality measures for spatial language in grounded settings"
Year(s) Of Engagement Activity	2020
URL	https://www.youtube.com/watch?v=psFOqOmuK44


Description	ESSLLI Student Session 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Gave a talk to summer school attendees outlining my research project, this led to lots of discussions of related topics.
Year(s) Of Engagement Activity	2018
URL	http://esslli2018.folli.info/student-session/


Description	KRR Presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Presented our paper "Modelling the polysemy of spatial prepositions in referring expressions"
Year(s) Of Engagement Activity	2020
URL	https://www.youtube.com/watch?v=QLF3xFOnNZs&t=10s


Description	Machine Intelligence 21 workshop Presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented current research project and approaches to the challenge. Promoted interesting discussions afterwards
Year(s) Of Engagement Activity	2019
URL	http://mi21-hlc.doc.ic.ac.uk/


Description	SPLU Presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Presented our paper "Categorisation, typicality & object-specific features in spatial referring expressions"
Year(s) Of Engagement Activity	2020
URL	https://www.youtube.com/watch?v=KRCNfjHAwMA&t=233s


Description	Speaking of Location 2019: Communicating about Space Presentation
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented our paper 'Investigating the Dimensions of Spatial Language'
Year(s) Of Engagement Activity	2019
URL	http://eprints.whiterose.ac.uk/148335/8/Investigating_the_Dimensions_of_Spatial_Language.pdf

Abstract

Organisations

People

ORCID iD

Publications

Studentship Projects