Argument Mining

Lead Research Organisation: University of Dundee
Department Name: Computing

Abstract

Argument and debate form cornerstones of civilised society and of intellectual life. Processes of argumentation run our governments, structure scientific endeavour and frame religious belief. Recognising and understanding argument are central to decision-making and professional activity in all walks of life, which is why we place them at the centre of academic pedagogy and practice; it's why such a premium is placed upon these skills; and it's why rationality is one of the very defining notions of what it is to be human.

Our theories of how argument is structured go back to Ancient Greece. In the past thirty years or so, the computational sciences have started to build models and engineer software based on these theories: this is the field of argument technology, and the recent surge in activity is testament to the vitality and broad applicability of the field. Though argument technology, in which the UK is a world leader, has had applications in domains as diverse as healthcare, public policy, government and the media, the focus has been squarely upon Artificial Intelligence technologies for supporting human argumentation and subsequent automated reasoning with the results. Arguments made outside such software walled gardens have been off the agenda simply because automatic machine understanding of unfettered naturally occurring reasoning has been too hard to tackle.

Before 2014, that task -- argument mining -- had been tackled only speculatively and only in specific domains by a very small number of groups such as those at Toronto, Leuven and Dundee. By the end of 2014, more than twenty research labs across the US and EU were gearing up to tackle the problem, there were several international meetings including a regular workshop series at the largest computational linguistics conference, and dozens of results being reported. The reason for this huge upswing in activity lies in maturing technology and the returns available. Opinion mining has transformed the way that market research and PR is carried out, deploying big data analysis techniques to understand the attitudes people hold towards products and brands. Sentiment analysis has had an even greater impact in predicting financial markets by analysing broad moods and perspectives that are expressed in the press. Argument mining is the natural evolution of these technologies, providing a step change in the level of detail available -- moving from not just analysing what opinions people hold, but why they hold the opinions they do. This is why major organisations such as IBM, with whom we are partnering in this project, are so interested in the technology.

The Centre for Argument Technology now curates the largest publicly accessible corpus of analysed argument in the world, and has a well known and widely used tool stack for managing datasets, conducting analyses, and visualising the results. This provides a unique platform from which we can both extend existing techniques for argument mining, but also, much more ambitiously, use insights from the philosophy of argumentation and from rhetoric to transform the reliability and applicability of argument mining technology. In particular, we will use the theory of argumentation schemes that characterises stereotypical patterns of reasoning to guide the process of searching for argument components, and the theory of rhetorical figures and tropes as the basis for developing a new class of algorithms for argument recognition. We will thus be transforming bare statistically-driven approaches with detailed theories of structure which can act to define expectations in a way that constrains the machine learning task thereby improving accuracy and applicability. By partnering with IBM and J&L Techology (a domain-specific SME), the project aims not just to radically improve performance of these techniques, establishing the UK's position at the cutting edge, but also to deliver those performance gains to end users.

Planned Impact

The Centre for Argument Technology has a sustained, fifteen-year track record of delivering transformative research results that have enabled both engineered infrastructure (see aifdb.org) and practical applications (arg-tech.org). Argument Mining represents an area in which the UK has the potential to be the world leader if it capitalises upon its early successes in this high potential global market (estimated to be worth $10bn by 2020). This project aims to deliver foundational results in its first three years which are then scaled up and deployed in the fourth year.

One of the challenges in developing a transformative programme of research is that, on the one hand, working with too many partners quickly exhausts the effort available to manage the relationships effectively. On the other hand, working with too few risks over-specialising techniques to a particular user and, in the worst case, introducing a single point of failure. Our approach to this dilemma is to invite a number of companies (such as Amazon and the BBC with both of whom we already have a relationship) to engage in a series of meetings in year four, but more importantly to work closely throughout the project with two commercial partners -- companies that have substantially different profiles.

IBM is re-aligning much of its service provision around Watson, a cloud-based tool-suite to which Argument Mining is a natural companion. IBM thus represents a 'channel partner': a way for the results of the project to delivered to users of IBM's services. We will also work with J&L Technology, an SME working on a single product in a single domain, but a product that is likely to be highly disruptive in a domain that is traditionally woefully under-served by technology. For J&L's offering in the market, Argument Mining introduces functionality that represents an important unique selling point. J&L is currently piloting its software in one of the top ten organisations in its domain in the world (an organisation with an annual IT spend in excess of $25m). J&L thus represents an 'end user' of Argument Mining technology.

Despite their differences, both IBM and J&L also act as 'thought leaders' in their respective markets: they are seen by their customers to be establishing the trends that others will follow. If the techniques developed in this project were to be advocated by our two commercial partners, there is potential for very widespread impact as a result of this thought-leadership. Our programme of collaboration involves both formal structures (including quarterly meetings of the project's Commercial Board with representatives from both companies), and practical work practices, resting heavily on a series of bilateral exchanges of staff including over six months of residencies for University project staff with the partner organisations. The Commercial Board ensures that ad hoc, responsive communication is complemented by regular, scheduled interaction, whilst the residencies are a key way of enabling staff in the University to gain deep insight into the challenges and opportunities presented by the domains of the commercial partners, and at the same time for staff at the partners to better understand the problems and solutions investigated in the project. To ensure that the project's work is disseminated more broadly in the commercial partners (which is particularly important for such a large organisation as IBM), we will also run Demo Days on site at the partners, with open invitations to staff from across the organisation to see project results and explore opportunities.

For both IBM and J&L Technology, the project represents an area of strategic importance, which is why the two companies are together committing contribution in kind estimated to be worth over £272k, thereby significantly increasing the value for money for the public purse.

Publications

10 25 50
publication icon
Atkinson K (2017) Toward Artificial Argumentation in AI Magazine

publication icon
Budzynska K (2019) Advances in Argument Mining

publication icon
Jacky Visser (2018) Revisiting computational models of argument schemes in Computational Models of Argument - Proceedings of COMMA 2018

publication icon
Jo Y (2021) Classifying Argumentative Relations Using Logical Mechanisms and Argumentation Schemes in Transactions of the Association for Computational Linguistics

publication icon
John Lawrence (2018) BBC moral maze in Computational Models of Argument - Proceedings of COMMA 2018

 
Description Early results have demonstrated (a) that automatic mining of inferential relationships from unconstrained natural text is feasible; and (b) that rhetorical figures have an important role to play in this task. We have developed new measures for evaluating performance of techniques in the domain, and have built and released datasets that help to train algorithms both in our lab and in the wider community. These algorithms are now being used by teams in computational linguistics at IRIT Toulouse, in theoretical linguistics at the University of Konstanz in Germany and in political science at Griffith University in Australia.
Exploitation Route We have had recent interest from government in technology support for public participation, and particularly for participatory budgeting. This is the premise for a new £0.5m project secured from VW Foundation. We are applying now to extend this work with an application from the same funder for £1.5m

In addition, interest in these techniques as a way of identifying individuals in an approach iknown as Dialogical Fingerprinting has now won support from DSTL under their Defence Accelerator programme. This in turn has led to our securing funding of £576k under Dstl's Serapis framework involving application of our techniques for suppoorting intelligence analysis.
Sectors Communities and Social Services/Policy,Creative Economy,Digital/Communication/Information Technologies (including Software),Government, Democracy and Justice,Security and Diplomacy

URL http://arg.tech
 
Description Early results led to an article in New Scientist magazine (http://arg.tech/newscientist) in 2016 and then another in BBC News (http://www.bbc.co.uk/news/technology-41010848) in 2017, plus an article in the Conversation syndicated in The Independent, Newsweek and others (http://www.newsweek.com/artificial-intelligence-argument-debate-752199). The technical results in argument mining are being wrapped up for external use. The first live deployment of argument mining techniques anywhere in the world were lauched in conjunction with BBC News Day in March 2018 (https://www.bbc.co.uk/news/school-report-43391188). The BBC made two commissions for this event. For 11-16yo they commissioned Aardman Animation to create a game helping children understand fake news. For 16-18yo, they commissioned us to develop software that helps pupils identify fake news using critical thinking. The software, called The Evidence Toolkit, was released with full teacher notes into every secondary school in the UK. It received very positive feedback, with three quarters of users saying it helped them think more deeply about the issues. In 2019, the results are also starting to be used in the intelligence analysis sector, with Dstl-funded applied R&D work in the Centre for Argument Technology looking to deply TRL4 prototypes, initially with £94k DASA funding, then with a further £0.9m of Dstl funding in 2020-22.
First Year Of Impact 2016
Sector Aerospace, Defence and Marine,Creative Economy,Digital/Communication/Information Technologies (including Software),Education,Government, Democracy and Justice,Security and Diplomacy
Impact Types Cultural,Societal,Economic,Policy & public services

 
Description ACL 2016
Geographic Reach Multiple continents/international 
Policy Influence Type Influenced training of practitioners or researchers
Impact Graduate level training at the foremost computational linguistics meeting.
URL http://acl2016tutorial.arg.tech/
 
Description ACL 2019
Geographic Reach Multiple continents/international 
Policy Influence Type Influenced training of practitioners or researchers
URL https://aclanthology.org/P19-4008/
 
Description All Party Parliamentary Group on AI
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
URL https://www.appg-ai.org/event/evidence-meeting-7-citizen-participation-ai-me/
 
Description ESSLLI 2017
Geographic Reach Multiple continents/international 
Policy Influence Type Influenced training of practitioners or researchers
Impact Graduate level training at a preeminent event in the area
URL https://www.irit.fr/esslli2017/courses/20.html
 
Description Submission to Parliamentary & Scientific Committee
Geographic Reach National 
Policy Influence Type Contribution to a national consultation/review
URL https://www.scienceinparliament.org.uk/science-in-parliament-member/
 
Description Dialogical Fingerprinting (Defence Accelator - Behavioural Analytics)
Amount £94,498 (GBP)
Funding ID ACC6005123 
Organisation Defence Science & Technology Laboratory (DSTL) 
Sector Public
Country United Kingdom
Start 03/2019 
End 11/2019
 
Description Postdoctoral Research in the Computational Social Sciences
Amount € 750,000 (EUR)
Organisation Volkswagen Foundation 
Sector Charity/Non Profit
Country Germany
Start 05/2017 
End 05/2020
 
Description Serapis Framework Lot 6
Amount £576,075 (GBP)
Funding ID ITT/ R1000124743 
Organisation Defence Science & Technology Laboratory (DSTL) 
Sector Public
Country United Kingdom
Start 01/2020 
End 03/2021
 
Description Serapis Lot 6 U33
Amount £234,668 (GBP)
Funding ID 012217 - 122858V 
Organisation Defence Science & Technology Laboratory (DSTL) 
Sector Public
Country United Kingdom
Start 06/2021 
End 03/2022
 
Title AIFdb corpora 
Description A suite of tools for creating, managing and manipulating corpora of argumentation. 
Type Of Material Database/Collection of data 
Year Produced 2016 
Provided To Others? Yes  
Impact Datasets from Cornell, Rutgers, Potsdam and UC Santa Cruz are currently available on our infrastructure, in addition to many more created here. 
URL http://corpora.aifdb.org
 
Description Dundee-BBC 
Organisation British Broadcasting Corporation (BBC)
Country United Kingdom 
Sector Public 
PI Contribution Development of prototype debate systems
Collaborator Contribution Feedback at various levels (technical, mid-level managament, controller-level management)
Impact Multidisciplinary between computational science and journalism
Start Year 2007
 
Description Dundee-CMU 
Organisation Carnegie Mellon University
Country United States 
Sector Academic/University 
PI Contribution Co-supervision of CMU PhD student, Yohan Jo by Reed. Jo Visited Dundee for 6 months in 2018-19.
Collaborator Contribution Co-supervision of CMU PhD student, Yohan Jo by Prof E Hovy.
Impact https://doi.org/10.1162/tacl_a_00394 https://doi.org/10.18653/v1/2020.emnlp-main.1 https://aclanthology.org/2020.lrec-1.127/ https://doi.org/10.18653/v1/2020.emnlp-main.2 https://doi.org/10.18653/v1/W19-4502 Artificial Intelligence, Linguistics, Philosophy
Start Year 2018
 
Description Dundee-IBM 
Organisation IBM
Country United States 
Sector Private 
PI Contribution Foundation research in argument mining.
Collaborator Contribution Feedback on the direction, goals and techniques of the research, and provision of access to IBM software tools.
Impact None yet.
Start Year 2016
 
Description Dundee-IRIT 
Organisation Toulouse Institute of Computer Science Research
Country France 
Sector Private 
PI Contribution Collaboration on issues in dialogical argument mining: we supply data and theory.
Collaborator Contribution Collaboration on issues in dialogical argument mining: IRIT supplies computational expertise.
Impact Budzynska, K., Janier, M., Kang, J., Reed, C.A., Saint-Dizier, P., Stede, M. & Yaskorska, O. (2015) "Automatically identifying transitions between locutions in dialogue" in Proceedings of the 1st European Conference on Argumentation (ECA 2015), College Publications, Lisbon. Budzynksa, K., Janier, M., Kang, J., Reed, C., Saint Dizier, P., Stede, M. & Yaskorska, O. (2014) "Towards Argument Mining from Dialogue" in Parsons, S., Oren, N., Reed, C. & Cerutti, F. (eds) Proceedings of the Fifth International Conference on Computational Models of Argument (COMMA 2014), IOS Press, Pitlochry, pp185-196. Budzynska, K., Janier, M., Reed, C. Saint-Dizier, P., Stede, M. & Yaskorska, O. (2014) "A Model for Processing Illocutionary Structures and Argumentation in Debates" in Calzolari, N. et al. (eds) Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), ELRA, Rejkavik. Budzynska, K., Janier, M., Reed, C.A. & Saint-Dizier, P. (2013) "Towards extraction of dialogical arguments" in Working Notes of the 13th Workshop on Computational Models of Natural Argument (CMNA 2013), Rome. Budzynska, K., Janier, M., Reed, C.A. & Saint-Dizier, P. (2013) "Theoretical foundations for illocutionary structure parsing" in Working Notes of the 13th Workshop on Computational Models of Natural Argument (CMNA 2013), Rome.
Start Year 2010
 
Description Dundee-Konstanz 
Organisation University of Konstanz
Department Department of Biology
Country Germany 
Sector Academic/University 
PI Contribution Collaborating on a grant proposal to VW Foundation in the area of argument mining and linguistic processing for managing government-initiated public discussions such as in the area of participatory budgeting.
Collaborator Contribution As above.
Impact This new collaboration has, as yet, only delivered a new funding award to start in May 2017.
Start Year 2016
 
Title BBC Moral Maze Argument Analytics 
Description Infographic visualisation of analysis of argument and debate from BBC Radio 4's Moral Maze. Updates were live-tweeted during broadcast. The initiative was trailled on the Today programme and PM, the two most-listened to radio programmes in the UK. 
Type Of Technology Webtool/Application 
Year Produced 2017 
Impact James Purnell (formerly the Culture Secretary and at the time of the broadcast, BBC Head of Radio & Education) funded the initiative directly and was sufficiently impressed to release further funding. There was so much positive feedback that it was included on the slate for Radio 4's Feedback -- we were interviewed by Roger Bolton, but editorial priorities meant that the interviews were not broadcast. 
URL http://www.bbc.co.uk/programmes/p05jp46h/p05jp46x
 
Title BBC Taster / Test Your Argument 
Description Test Your Argument -- an application deployed on the BBC Taster platform. 
Type Of Technology Webtool/Application 
Year Produced 2017 
Impact This was the most tried and the most highly rated application on the BBC for two weeks. It was tied in to BBC programming around the 50th anniversary of the Abortion Act. From the BBC's perspective they were very excited that the demographics emphasised engagement from a younger audience with primarily Radio 4 content, and we were all thrilled that in feedback on the app, over 65% of users indicated that using the app had made them think about the issue differently. Confirmed audience views in the first six weeks: 28,000. 
URL http://www.bbc.co.uk/taster/projects/moral-maze
 
Title BBC Taster / The Evidence Toolkit 
Description A tool for training schoolchildren to recognise fake news. 
Type Of Technology Webtool/Application 
Year Produced 2018 
Impact Released as a part of BBC School Report into every secondary school in the UK. 
URL https://www.bbc.co.uk/taster/pilots/evidence-toolkit-moral-maze
 
Description BBC 8 Ways 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact An article authored by Reed for BBC Radio 4 that made the BBC homepage on 12 Oct 2017 and had 46,000 confirmed hits in its first 24 hours.
Year(s) Of Engagement Activity 2017
URL http://www.bbc.co.uk/programmes/articles/251N2YBLLwmPJnVvDn94GQR/moral-maze-eight-ways-to-win-an-arg...
 
Description BBC News Article 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Commissioned by BBC News, Reed authored an article on Argument Mining which appeared on the BBC News website. It received 260,000 hits in the first month.
Year(s) Of Engagement Activity 2017
URL http://www.bbc.co.uk/news/technology-41010848
 
Description BBC School Report 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Schools
Results and Impact We were commissioned by the BBC to develop an application for 16-18 year olds called The Evidence Toolkit, focused on helping students identify fake news. It was deployed as a part of BBC School Report (now Young Reporter) into every secondary school in the UK. Feedback was outstanding. The work was launched with reporting on the BBC News website, reaching an audience in excess of 100,000. https://www.bbc.co.uk/news/school-report-43391188
Year(s) Of Engagement Activity 2018
URL https://www.bbc.co.uk/taster/pilots/evidence-toolkit-moral-maze
 
Description Futureproof 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact A live interview on 'Futureproof', national Irish radio Newstalk. Confirmed audience 60,000.
Year(s) Of Engagement Activity 2018
 
Description New Scientist 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Working with a features editor with New Scientist magazine to produce an article on argument mining in which the work of this project featured very prominently. Estimated print audience: 800,000; digital audience: 3,500,000.
Year(s) Of Engagement Activity 2016
URL http://arg.tech/newscientist
 
Description Pindex video 
Form Of Engagement Activity A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact A youtube video produced by Pindex and narrated by Stephen Fry discussing the need for argument technology and the work of the Centre, 02 October 2021 (250,000 views)
Year(s) Of Engagement Activity 2021
URL https://www.youtube.com/watch?v=7cN3rZ5h3LE&ab_channel=Pindex
 
Description Reaction to IBM Debater 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Media (as a channel to the public)
Results and Impact IBM famously has a Grand Challenge every decade -- twenty years ago it was Deep Blue beating Kasparov; ten years ago it was Jeopardy; last year it was Project Debater. They invited 50 of the world's press to the launch in San Francisco. They also invited a single academic to offer comment -- Reed. Quotes from Reed appeared in The Guardian, The Financial Times, BBC News, New York Times, Wired magazine, CNET, and a brief television interview on BBC Breakfast, 19 Jun 2018
Year(s) Of Engagement Activity 2018
URL https://www.theguardian.com/technology/2018/jun/18/artificial-intelligence-ibm-debate-project-debate...
 
Description Resonance FM 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact Pre-recorded interview included in Future Intelligence's PassWord 9 June 2021 broadcast on Resonance FM (national UK radio, programme audience 750,0000) with accompanying online copy
Year(s) Of Engagement Activity 2021
URL https://www.futureintelligence.co.uk/2021/06/09/big-tech-data-scraping-to-discover-our-emotions/
 
Description Stephen Jardine show 2017 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact Reed gave an interview on BBC Radio Scotland's Stephen Jardine show.
Year(s) Of Engagement Activity 2017
URL https://soundcloud.com/arg_tech/profesor-chris-reed-talks-with-stephen-jardine-on-radio-scotland
 
Description TalkRADIO 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact live interview on Darryl Morris on talkRADIO 28 March 2021
Year(s) Of Engagement Activity 2021
 
Description The Conv / Indy / Newsweek 2017 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Reed authored an article that originally appeared in The Conversation but was then picked up by Newsweek in the US, The Independent in the UK and The Indian Economist in India, in addition to various other smaller publications. Confirmed audience reached 10,000 in the first ten days.
Year(s) Of Engagement Activity 2017,2018
URL http://www.newsweek.com/artificial-intelligence-argument-debate-752199
 
Description The Conv 2018 
Form Of Engagement Activity A magazine, newsletter or online publication
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Article written for The Conversation
Year(s) Of Engagement Activity 2018
URL https://theconversation.com/ibms-debating-computer-an-ai-experts-verdict-98783
 
Description Tommy Boyd Interview 
Form Of Engagement Activity A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Public/other audiences
Results and Impact Reed was invited to give an interview on Talk Radio's Tommy Boyd show. The interview lasted 25 minutes. Confirmed audience figures are not available but estimate from production staff was 250,000.
Year(s) Of Engagement Activity 2017
URL https://soundcloud.com/arg_tech/professor-chris-reed-interview-on-arg-tech-talk-radio