Turing AI Fellowship:Neural Conversational Information Seeking Assistant

Lead Research Organisation: University of Glasgow
Department Name: School of Computing Science

Abstract

There have been significant recent advances in Virtual Personal Assistants (VPAs) such as Google Assistant, Siri, and Alexa. However, development of these assistant systems is expensive and difficult (often requiring multiple skilled PhDs). And further, current systems are capable of limited "conversations", with most actions consisting of a single interaction in limited domains to perform simple tasks ("set a timer", "play music", etc...). The goal of this research is to develop research to enable a future conversational search systems that can help solve complex information tasks. Examples of these types of information tasks could be "Teach me about the causes of climate change." or "Help me write the literature survey for this paper." These require complex discussion and long-running modelling of the user and their information task. We propose building on recent advances in machine learning to adapt a general purpose information agent for specialized domains (like health, law, finance) by "machine reading" of text (such documents from a website) to learn a domain model and to discover information tasks automatically from existing interaction data such as search logs, existing conversations, or help tickets. The result of this work will be information agents that can effectively work with the user (including asking questions back and forth) and explain their reasoning more effectively than current information assistants.

Related Projects

Project Reference Relationship Related To Start End Award Value
EP/V025708/1 01/01/2021 29/09/2023 £1,623,810
EP/V025708/2 Transfer EP/V025708/1 30/09/2023 31/12/2025 £1,033,083
 
Description The result of this award includes the development of new algorithms and a new open-source system for virtual personal assistants, called the Open Assistant Toolkit (OAT). It was deployed on Alexa in the US in the Alexa Prize Taskbot challenge and won first and second prizes (2022, 2023) helping users accomplish complex real-world tasks like cooking and home repair. The award also enabled the creation of multiple new benchmark datasets and evaluation methods to measure progress on conversational search systems in partnership with the U.S. National Institute for Standards and Technology (NIST) in the Text Retrieval Conference (TREC).
Exploitation Route Companies and researchers can use and extend the open-source tools and frameworks developed to build new virtual assistants for their target domains. They can leverage the benchmarks created to measure the progress of their search components and extend them to specialised sectors, such as financial research and medicine.
Sectors Digital/Communication/Information Technologies (including Software)

Financial Services

and Management Consultancy

Healthcare

 
Description ChatGPT training
Geographic Reach Local/Municipal/Regional 
Policy Influence Type Influenced training of practitioners or researchers
 
Description Amazon Alexa Prize Challenge
Amount $250,000 (USD)
Organisation Amazon.com 
Sector Private
Country United States
Start 05/2021 
End 06/2025
 
Description Amazon Alexa Prize Challenge
Amount $250,000 (USD)
Organisation Amazon.com 
Sector Private
Country United States
Start 01/2022 
End 09/2022
 
Title Open Assistant Toolkit 
Description OAT is a task-oriented open-source conversational agent platform for supporting multimodal complex tasks. OAT is modular and extensible, enabling complex stateful behavior for human-like long-form interaction. It supports multiple conversational voice platforms as well as multimodal output. 
Type Of Material Improvements to research infrastructure 
Year Produced 2021 
Provided To Others? Yes  
Impact It's been tested and evaluated by Amazon during multiple Taskbot challenges; winning first and second prize in the data challenges. 
URL https://github.com/grill-lab/OAT
 
Title TaskMAD: Task Multimodal Agent Dialogue 
Description Task-oriented Multimodal Agent Dialogue (TaskMAD) is a new platform that supports the creation of interactive multimodal and task-centric datasets in a Wizard-of-Oz experimental setup. TaskMAD includes support for text and voice, federated retrieval from text and knowledge bases, and structured logging of interactions for offline labeling. 
Type Of Material Improvements to research infrastructure 
Year Produced 2022 
Provided To Others? Yes  
Impact Multiple publications and demos at leading IR conferences. Used by other research labs (USC, Radboud) for conversational search research and data gathering. 
URL https://github.com/grill-lab/TaskMAD
 
Title CODEC - COmplex Document and Entity Collection 
Description This is a new dataset for benchmarking systems for complex entity-based retrieval, such as financial analysts or social science researchers. It provides research topics, queries, judgments, etc... to use as training and evaluation data for information retrieval algorithms. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact None significant, yet. It is currently under submission and was just released last month. 
URL https://github.com/grill-lab/CODEC
 
Title Cooking With Conversation 
Description A dataset of wizard-of-oz conversations on the topic of cooking. It supports the paper: "Cooking with Conversation: Enhancing User Engagement and Learning with a Knowledge-Enhancing Assistant" 
Type Of Material Database/Collection of data 
Year Produced 2024 
Provided To Others? Yes  
Impact N/A - Just released. 
URL https://github.com/AlexFrummet/cooking-with-conversations
 
Title Deep Learning HARD 
Description This is a new dataset focused on evaluation of deep learning-based retrieval algorithms. It extends previous benchmark datasets and introduces new methodology for identifying and incorporating challenging cases. It includes a rich set of data used to benchmark and evaluate deep-learning based web search algorithms. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact It has been used and cited by multiple research groups. It influenced the direction of the TREC Deep Learning track for future years, towards different types of problems. 
URL https://github.com/grill-lab/DL-Hard
 
Title TREC Conversational Assiastance Track 2021 & 2022 
Description This is a new benchmark dataset for conversational search developed in partnership with the US National Institute for Technology and Standards (NIST). It is used to measure progress and evaluate algorithms for conversational search. It was used by fifteen worldwide teams competing in the 2021 and 2022 TREC Conversational Assistance Tracks (CAsT). 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact It was the first to benchmark the use of new dense retrieval approaches for conversational search. 
URL https://trec.nist.gov/data/cast2021.html
 
Title TREC Interactive Knowledge Assistant Track (iKAT) 
Description A benchmark for interactive conversational assistants with contextual personalised knowledge statements. 
Type Of Material Database/Collection of data 
Year Produced 2023 
Provided To Others? Yes  
Impact Used by leading research groups (dozens of participants) from leading research groups around the world. 
URL https://www.trecikat.com/
 
Description Amazon Alexa Prize 
Organisation Amazon.com
Country United States 
Sector Private 
PI Contribution My group creates the GRILLBot conversational agent skill that is deployed to US Amazon Alexa users. This includes research, software development, and deployment.
Collaborator Contribution The partners provide software support (Cobot), weekly demo and feedback sessions with software engineers and UI experts, AWS cloud compute resources, and more.
Impact VILT: Video Instructions Linking for Complex Tasks GRILLBot: An Assistant for Real-World Tasks with Neural Semantic Parsing and Graph-Based Representations GRILLBot: A flexible conversational agent for solving complex real-world tasks Open Assistant Toolkit (software) Open Assistant Toolkit -- version 2 (software) GRILLBot In Practice: Lessons and Tradeoffs Deploying Large Language Models for Adaptable Conversational Task Assistants GRILLBot-v2: Generative models for multi-modal task-oriented assistance
Start Year 2021
 
Description BBC Scotland Beeb Virtual Assistant 
Organisation British Broadcasting Corporation (BBC)
Department BBC Scotland
Country United Kingdom 
Sector Private 
PI Contribution Myself and a research assistant worked closely with the BBC Scotland Voice and AI on a prototype system, UoG Bot. This was a full time position for an RA and approximately 20% of my time. We contributed and developed machine learning models for conversational exploration and conversational search. The work and partnership and impact from this project is ongoing as part of the Fellowship.
Collaborator Contribution The partners provided substantial support including a repository of news and sound content, four engineers part time working on the project, as well as project managers and experience developers. They assisted by contributing user study. They are currently assisting with releasing data and software.
Impact COMEX: A Multi-task Benchmark for Knowledge-grounded COnversational Media EXploration https://dl.acm.org/doi/abs/10.1145/3543829.3543830
Start Year 2020
 
Description Radboud-Glasgow Collaboration 
Organisation Radboud University Nijmegen
Country Netherlands 
Sector Academic/University 
PI Contribution My team collaborations on creation of personal knowledge graphs from conversational data. This involves my postdoc Shubham Chaterjee and PhD Students Paul Owoicho working closely with Radboud students and staff to collect data and run experiments. We provide our open assistant toolkit (OAT) as platform for testing.
Collaborator Contribution The partners provide students, staff, and data in the collaboration for running experiments. They also provide their CREL entity linking software for analyzing conversations and performing information extraction.
Impact Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search
Start Year 2022
 
Title CAsT Searcher 
Description CAsT Searcher is a simple tool developed to help with creating the evaluation topics for the 2021 edition of the Conversational Assistance Track. The tool allows topic developers to visually assess the behaviour of a retrieval system, ultimately making it easier to develop challenging, but interesting, topics for the Track. 
Type Of Technology Webtool/Application 
Year Produced 2021 
Open Source License? Yes  
Impact Used in the development of the TREC CAsT evaluation benchmark dataset that the group released. 
URL https://github.com/grill-lab/CAsTSearcher
 
Title DiffIR 
Description DiffIR is a tool for visually 'diffing' the difference between two sets of rankings. Given a pair of TREC runs containing rankings for multiple queries, DiffIR identifies contrasting queries that have "substantially" different results between the two systems and generates a visual side-by-side comparison illustrating how the key rankings differ. This was developed and appeared as a SIGIR 2021 demo. 
Type Of Technology Software 
Year Produced 2021 
Open Source License? Yes  
Impact It's been used my multiple research groups, including being starred and watched by tens of groups. 
URL https://github.com/capreolus-ir/diffir
 
Title Open Assistant Toolkit - OAT 
Description A conversational task agent, this is the open source version of the 2022 winning Alexa Prize Taskbot system, GRILLBot developed with the ability to allow conversational interactions with Alexa voice and screen devices. It is currently deployed and serving US customers in the Amazon Alexa Taskbot challenge 2022-2023. 
Type Of Technology Software 
Year Produced 2022 
Open Source License? Yes  
Impact It's been used as the basis for a SIGIR 2022 tutorial, WWW 2023 tutorial, and workshops with leading research and industry groups, including Stanford University and Square (a US payments company). 
 
Title TaskMAD 
Description Task MAD is a new Wizard-of-Oz data collection platform for conversational systems. It supports the collection of conversational data for interactive tasks (like cooking or researching a product). It's novel in that it supports rich task context, structured system actions, integrates knowledge-based search, and include rich support for images and videos in a conversational interface. 
Type Of Technology Software 
Year Produced 2022 
Open Source License? Yes  
Impact This software was just released. 
URL https://github.com/grill-lab/taskmad
 
Company Name Malted AI Ltd 
Description  
Year Established 2023 
Impact There are approximately 2 full-time scientific posts within the company.
 
Description BBC about ChatGPT 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Appeared on BBC Tech Tent on December 9th
https://www.bbc.co.uk/programmes/w3ct4khv
This was talking about ChatGPT and the future of conversational models to the general public.

This also aired on Click short on the BBC
https://www.bbc.co.uk/iplayer/episode/m001gf3r/click-short-edition-17122022
This was again about the implications of ChatGPT for the general public and developments in conversational language models.
Year(s) Of Engagement Activity 2022
URL https://www.bbc.co.uk/programmes/w3ct4khv
 
Description CSaP Policy Leaders Fellowship Roundtable 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Policymakers/politicians
Results and Impact Spoke at a roundtable discussion to approximately a dozen leading civil servant and government workers on AI policy. It sparked significant discussion during the roundtable, the continued dinner discussion on related subject matters on the topic of: How should government respond to advances in AI technologies?.
Year(s) Of Engagement Activity 2023
URL https://www.csap.cam.ac.uk/network/policy-fellowship/1344-ae0932d4b50d01d8a619058ddb59e5b2a394e71f/
 
Description European Chatbot Summit Keynote 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Industry/Business
Results and Impact A keynote talk at an industry and professional conference to primarily business leaders and practitioners in industry, given to approximately 60 people in person and 500-1000 virtually online. This sparked discussion on the future of voice technology and policy on privacy, etc... in related subject areas.
Year(s) Of Engagement Activity 2022
URL https://theeuropeanchatbot.com/
 
Description Rework Conversational AI Summit 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Professional Practitioners
Results and Impact The goal of this was to communicate research advances in conversational AI to group of industry practitioners. There were approximately 100 people in the room as well as 2-300 attending online.
Year(s) Of Engagement Activity 2022
URL https://london-conv-ai.re-work.co/
 
Description Sky News Podcast - Search Engine Wars 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact Interview on Sky News Podcast about BingGPT and the future of search
https://news.sky.com/story/search-engine-wars-battle-of-the-chatbots-12807081
Year(s) Of Engagement Activity 2023
URL https://news.sky.com/story/search-engine-wars-battle-of-the-chatbots-12807081