SPeech Across Dialects of English (SPADE): large-scale digital analysis of a spoken language across space and time

Lead Research Organisation: University of Glasgow

Department Name: School of Critical Studies

Abstract

Obtaining a data visualization of a text search within seconds via generic, large-scale search algorithms, such as Google n-gram viewer, is available to anyone. By contrast, speech research is only now entering its own 'big data' revolution. Historically, linguistic research has tended to carry out fine-grained analysis of a few aspects of speech from one or a few languages or dialects. The current scale of speech research studies has shaped our understanding of spoken language and the kinds of questions that we ask. Today, massive digital collections of transcribed speech are available from many different languages, gathered for many different purposes: from oral histories, to large datasets for training speech recognition systems, to legal and political interactions. Sophisticated speech processing tools exist to analyze these data, but require substantial technical skill. Given this confluence of data and tools, linguists have a new opportunity to answer fundamental questions about the nature and development of spoken language. Our project seeks to establish the key tools to enable large-scale speech research to become as powerful and pervasive as large-scale text mining. It is based on a partnership of three teams based in Scotland, Canada and the US. Together we will exploit methods from computing science and put them to work with tools and methods from speech science, linguistics and digital humanities, to discover how much the sounds of English across the Atlantic vary over space and time.

We will develop an innovative and user-friendly software which exploits the availability of existing speech data and speech processing tools to facilitate large-scale integrated speech corpus analysis across many datasets together. The gains of such an approach are substantial: linguists will be able to scale up answers to existing research questions from one to many varieties of a language, and ask new and different questions about spoken language within and across social, regional, and cultural, contexts. Computational linguistics, speech technology, forensic and clinical linguistics researchers, who engage with variability in spoken language, will also benefit directly from our software. This project will also open up vast potential for those who already use digital scholarship for spoken language collections in the humanities and social sciences more broadly, e.g. literary scholars, sociologists, anthropologists, historians, political scientists. The possibility of ethically non-invasive inspection of speech and texts will allow analysts to uncover far more than is possible through textual analysis alone.

Our project will develop and apply our new software to a global language, English, using 43 existing public and private spoken datasets of Old World (British Isles) and New World (North American) English, across an effective time span of more than 100 years, spanning the entire 20th century. Much of what we know about spoken English comes from influential studies on a few specific aspects of speech from one or two dialects. This vast literature has established important research questions which can be investigated for the first time on a much larger scale, through standardized data across many different varieties of English. Our large-scale study will complement current-scale studies, by enabling us to consider stability and change in English across the 20th century on an unparalleled scale. The global nature of English means that our findings will be interesting and relevant to a large international non-academic audience; they will be made accessible through an innovative and dynamic visualization of linguistic variation via an interactive sound mapping website. In addition to new insights into spoken English, this project will also lay the crucial groundwork for large-scale speech studies across many datasets from different languages, of different formats and structures.

Planned Impact

Who will benefit from this research?
The global nature of English means that our findings about Cross-Atlantic English speech across the 20th century will be interesting to a huge, international, non-academic audience, within and well beyond the sponsor countries. Our research tools and findings will be interesting to those working professionally with language and English at different levels, e.g. teachers and students, in industry those working with speech synthesis and recognition, in forensic speech practice, and clinical practice. More broadly, our spoken analysis tool has substantial potential for those working with spoken language in museums, informatics, libraries and schools, as well as the interested general public.

How will they benefit from this research?
SPADE make it possible to search spoken language in the same way as written texts, but without the need to listen to any speech whilst doing so. We aim to develop our spoken language analysis software and apply it to Old and New World (North American Englishes). our immediate findings will be useful and interesting to non-academic users of many kinds, but especially those who work with English. We will also make our source code publically accessible as we develop it, enabling all, including non-academic users, to test and use it for themselves. We will also develop an open access web resource, to make our results accessible during and towards the end of the project. In the longer term, SPADE will enable the ethically non-invasive inspection of speech and texts, and hence will allow analysts of all kinds, not just academics, to uncover far more about their spoken materials than is possible through textual analysis alone.

What will be done to ensure that they have the opportunity to benefit from this activity?
Our main specific outreach activity for SPADE is the creation of a digital media product, through which we will visualise both the spread of sounds and sound changes across the span of the Cross-Atlantic English varieties, and also the results themselves. With present technology we envisage creating a website with interactive (synthesized) sound examples, Mapping the dynamics of English. The large-scale, comparative nature of SPADE means that we can represent places not just with single examples, but with a continuous range of speech variants observed across time and social space at a particular location on the map, offering an innovative, dynamic, view of linguistic variation and change for English. The website will be designed with a responsive interface to allow easy access for fixed computers and mobile devices, and will be built and maintained indefinitely at Glasgow, given the strong expertise and experience within the GlasgowU Digital Humanities Research Network and GULP. Equally important, throughout the duration of the project, SPADE will ensure a lively and interactive relationship with the international public, through updates, announcements and activities via social and digital media (e.g. website, Twitter, Instagram, WordPress blog), and local, national and international public engagement. All members of the project team expect to take part in public engagement events, public talks, and media releases publicising the stages of the research project. We will also prepare and release our software as open-source with full documentation. We will work with both academic, but also skilled non-academic users, to ensure that we develop interfaces which are user friendly for a large range of users. The workshop in the final year of the project at a key Digital Humanities conference will ensure that we communicate effectively not only academic users, but also non-academic beneficiaries wishing to work with, and search through, spoken language corpora.

Funded Value:

£160,991

Funded Period:

Aug 17 - Aug 20

Funder:

ESRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

ES/R003963/1

Principal Investigator:

Jane StuartSmith

Research Subject:

Languages & Literature (20%)

Linguistics (80%)

Research Topic:

Computational Linguistics (20%)

English Language & Literature (20%)

Language Variation & Change (20%)

Phonetics (20%)

Sociolinguistics (20%)

Organisations

People	ORCID iD
Jane StuartSmith (Principal Investigator)
Josef Fruehwald (Co-Investigator)

Publications

Author Name

Title Publication Date Published

|< < 1 2 > >|

10 25 50

Kendall T (2023) Advancements of phonetics in the 21st century: Theoretical issues in sociophonetics in Journal of Phonetics

Macdonald R (2024) Speech Dynamics - Synchronic Variation and Diachronic Change

McAuliffe, Michael (2019) ISCAN: a System for Integrated Phonetic Analyses Across Speech Corpora

Mielke, Jeff (2019) Age Vectors vs. Axes of Intraspeaker Variation in Vowel Formants Measured Automatically From Several English Speech Corpora

Smith J (2024) Language in Britain and Ireland

Sonderegger M (2022) The Open Handbook of Linguistic Data Management

Sonderegger, M. (2023) How variable are English sibilants?

Stuart-Smith, Jane (2019) Large-scale Acoustic Analysis of Dialectal and Social Factors in English /s/-retraction

Tanner J (2019) Vowel duration and the voicing effect across English dialects in Toronto Working Papers in Linguistics

Tanner J (2020) Toward "English" Phonetics: Variability in the Pre-consonantal Voicing Effect Across English Dialects and Speakers. in Frontiers in artificial intelligence

Key Findings
Impact Summary
Further Funding
Research Databases and Models
Software and Technical Products
Engagement Activities


Description	The SPADE project aims to develop accessible speech corpus analysis tools, and then to use these tools to investigate Old and New World Englishes across time (the course of the 20th century) and space (UK and Ireland; US and Canada). The formal phase of the project ended in August 2020, but completion of project deliverables has been impacted by disruption to all aspects of project work by the COVID-19 pandemic. Glasgow University awarded an Extension Funding until September 2021, in order to enable formal deposit with ESRC Data Archive, and continue analysis/interpretation of results, prep datasets still coming in delayed by COVID, and work on the Shiny web app for the outreach resource. Since April 2018, we have: (1) developed and released the first version of the Integrated Speech Corpus ANalysis software, ISCAN v 0.1, which was tested within the project and had its first public outing at the NWAV ISCAN workshop in October 2018. ISCAN has now been installed on several computers at McGill, and in remote locations in Glasgow and North Carolina State University, and team-members at Oregon have also been also to successfully worked remotely with it on McGill server. The ISCAN software has two instantiations for users, 'under the hood' via scripts, or through the ISCAN GUI, Graphical User Interface, which enables less experienced users to search, analyse and query multiple corpora. Substantial software development includes: shift to a server-client architecture; work on scaling up and optimisation of corpus importers and analytical procedures; documentation. (2) we have now developed fast, consistent, acoustic analytical tools for the analysis of vowels, rhotics, sibilants and stops, specifically: vowel formants, sibilant spectral characteristics, segment durations, Voice Onset Time for stops, with dynamic (tracks) and static (single point) measures. (3) implemented our data collection using our novel GDPR-compliant Data Transfer Agreement, to make substantial progress with collection of Phase 2 privately held datasets across all three countries - to date 40 of the anticipated 43+ datasets have been collected, and are in different states (from fully measured, to being prepped, to being able to be transferred to our datasets 'library' at McGill). (4) used the new algorithms to import and analyse 42 speech corpora totalling around 8600 speakers for 30 English dialects, for subprojects on vowel duration, as well as vowels, sibilants, and rhotics, comprising the first, large-scale analyses of these sound categories across such a large number of speakers and dialects of English; (5) initiate new subprojects on sibilants, vowel dynamics, Scottish Vowel Length Rule, Scottish rhotics, vowel duration, and VOT; (6) presented 3 posters (WSC5, UDavis, California, June19; LabPhon16, Lisbon, June18), 14 talks, including 3 workshops, and 1 special session (UKLVC13, Glasgow, September2021; JK28 Seoul; ALS2020 Online December2020; LabPhon17 Virtual July 2020; r-atics6, Paris, November2019; BICLCE2019, Bamburg, Sep19; ICPhS19, Melbourne, Aug19; NWAV47, NY, Oct18; Huddersfield, Aug18; Newcastle, Nov18); 3 published conference papers (ICPhS2019, Melbourne, Aug19); 1 poster acceptances; 1 published chapter on data management for SPADE/ISCAN, MIT Open Handbook of Linguistic Data Management; 1 paper published Frontiers in Computational Sociolinguistics; 4 invited keynotes (r-atics7, Lausanne, November2021; 3rd International Conference on Applied Phonetics (ISAPh2021) hosted Tarragona online September2021; Bern, May2020; ICAME, Heidelberg, May2020); 2 future talk acceptances (BAAP2022).
Exploitation Route	Our software for analyzing speech corpora will be ultimately useful to anyone wishing to analyse aspects of spoken language, especially recordings which may be private for ethical reasons, so for those working in the law, digital./communication/IT, education, and healthcare. We are currently developing a Shiny webapp, to demonstrate our data findings in an accessible and useful way to those working with speech, e.g. phoneticians, speech tech, forensic and clinical phonetics. More specifically during the lifetime of the project, we anticipate those working on/with spoken language in different ways to benefit from our methodological developments and research findings.
Sectors	Creative Economy Digital/Communication/Information Technologies (including Software) Education Healthcare Government Democracy and Justice Culture Heritage Museums and Collections
URL	https://spade.glasgow.ac.uk/


Description	We have continued to give presentations on SPADE, the project, the methods, tools and approach at a range of workshops, events and colloquia. These have included both academic and non-academic audiences, including e.g. Oxford University Press Dictionary team, Amazon Tech, forensic phoneticians and caseworkers, and our national and international funders - amongst others. Since 2021 we have been working on a Shiny webapp, which presents the SPADE data (acoustic measures for vowels and consonants) as visual plots over time, and over geographical space, using maps. We presented an updated version of the web app to the international phonetics community, at the major ICPhS2023 conference in Prague, Czech Republic, in August 2023, and being continually invited to present on our findings and methods devised during the project.
First Year Of Impact	2018
Sector	Digital/Communication/Information Technologies (including Software),Education,Government, Democracy and Justice,Security and Diplomacy
Impact Types	Societal


Description	Comment notre voix trahit notre identité : une étude empirique de l'indexicalité dans la forme acoustique du langage (Melanie Lancien, Postdoc.Mobility)
Amount	SFr. 105,600 (CHF)
Funding ID	P500PH_211162
Organisation	Swiss National Science Foundation
Sector	Public
Country	Switzerland
Start	09/2022
End	08/2023


Description	Tracking talker dynamics within and across English dialects (James Tanner, BA Postdoc)
Amount	£269,212 (GBP)
Funding ID	PF21\210105
Organisation	The British Academy
Sector	Academic/University
Country	United Kingdom
Start	08/2023
End	08/2026


Description	Variability in Child Speech (VariCS)
Amount	£655,808 (GBP)
Funding ID	ES/W003244/1
Organisation	Economic and Social Research Council
Sector	Public
Country	United Kingdom
Start	07/2022
End	11/2026


Title	OSF project for SPeech Across Dialects of English (SPADE)
Description	The OSF Project comprises the dissemination component of SPADE. In this first release, we make available acoustic measures for sibilants and durations and static formants for vowels, for 39 corpora (~2200 hours of speech analysed from ~8600 speakers), anonymised where required, with information about dataset generation.
Type Of Material	Database/Collection of data
Year Produced	2020
Provided To Others?	Yes
Impact	This database is not yet completed, given limitations of COVID-19, which impacted on the final 6 months of the SPADE project.
URL	https://osf.io/4jfrm/


Title	ISCAN (Integrated Speech Corpus ANalysis) software v 0.1
Description	A key methodological output of the SPADE project is the development of freely-accessible, integrated speech corpus analysis software, which produces consistent acoustic phonetic measures from multiple spoken language corpora of diverse formats. The first version of ISCAN was released in 30 July 2018. It was extensively tested by the project team until October, when a public tutorial enabling users to carry out a set of tutorial routines took place in October 2018.
Type Of Technology	Software
Year Produced	2018
Open Source License?	Yes
Impact	ISCAN software was introduced to sociolinguists and phoneticians at the NWAV47 conference special workshop, 18 October 2018. The software was greeted with substantial enthusiasm, given the quality of the acoustic analysis, the speed of analysis across large speech corpora, and the fact that it can work with multiple diverse corpus formats. Members of the workshop requested an extension to languages other than English - ISCAN is not a language-specific tool, but we are currently working to ensure analyses (e.g. vowel analyses) can be carried out irrespective of language heritage.
URL	https://github.com/MontrealCorpusTools/ISCAN


Title	PolyglotDB version 1.2.1
Description	PolyglotDB is a Python package for storing and querying large speech corpora. It constructs various kinds of database, and has a consistent Python API for interacting with the various underlying databases. The online documentation is available at http://polyglotdb.readthedocs.io/en/latest/.
Type Of Technology	Software
Year Produced	2021
Open Source License?	Yes
Impact	PolyglotDB was devised for the SPADE project, as an adaptation from the Montreal Corpus Tools. This version of the software (1.2.1) was released in 2021.
URL	https://github.com/MontrealCorpusTools/PolyglotDB


Title	SPeech Across Dialects of English (Shiny app)
Description	The SPADE Shiny app is an interactive web app, which enables users to display and explore the acoustic data for English speech sounds, through visualisation on geographical maps, and in plots over time, by one of more dialects. Users can also hear synthesized versions of particular vowel sounds across different dialects and timepoints; and see representations of acoustic vowel spaces for dialects, and speakers within dialects.
Type Of Technology	Webtool/Application
Year Produced	2021
Open Source License?	Yes
Impact	Substantial interest from the phonetics community and colleagues when the Shiny app has been presented.
URL	https://shiny.chass.ncsu.edu/spade/stable/


Description	19th SIGMORPHON Workshop on Computational Research in Phonetics: Multidimensional acoustic variation in vowels across English dialects
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Talk given at 19th SIGMORPHON Workshop on Computational Research in Phonetics (14 July 2022). Presented by James Tanner, on behalf of James Tanner, Morgan Sonderegger, Jane Stuart-Smith and The SPADE Consortium. The paper was extremely well received and was awarded the prize for Best Paper.
Year(s) Of Engagement Activity	2022
URL	https://sigmorphon.github.io/workshops/2022/program/


Description	Keynote talk - /r/ you listening? Methodological challenges and theoretical insights from investigating a 'rhotic' English dialect. Methods in Dialectology XVIII. La Trobe University. Melbourne. 1-5 July 2024
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Invited keynote for international conference on sociolinguistics methods. A great deal of interest in long series of research studies was noted.
Year(s) Of Engagement Activity	2024


Description	Keynote talk - Beyond the auld alliance? Using automated processing to identify variation and change in French and Scottish /r/. LSRL: Romance languages in a rapidly-changing world. ENS. Paris Saclay. 26 June 2023.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Invited Keynote talk which presented the first collaborative research between SPADE and the French ANR-funded DIPVAR project (Adda-Decker; Lamel; Vasilescu). Substantial interest in articulatory research on rhotic sounds, and in the SPADE research on Scottish rhotics.
Year(s) Of Engagement Activity	2023


Description	Keynote talk - Digging into English vowels with (a) SPADE: Reflections on vowel duration and quality from corpus phonetics. Phonetics and Phonology Denmark (PPDK). University of Copenhagen. 17-18 November 2022.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Invited Keynote talk to the Danish Phonetics and Phonology Conference (PPDK). Substantial interest in the SPADE research, and the Shiny outreach web app for demonstrating acoustic speech data across geographical dialects of English.
Year(s) Of Engagement Activity	2022


Description	Keynote talk - What can speakers tell us about speech? 20th International Congress of Phonetic Sciences (ICPhS 2023). Prague Congress Center, Czech Republic. 7-11 August 2023.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Keynote talk at the major international phonetics conference. Featured both articulatory research and recent SPADE research, including the Shiny web app which maps acoustic measures across geographical dialects of English.
Year(s) Of Engagement Activity	2023


Description	Keynote talk at 3rd International Symposium on Applied Phonetics (ISAPh2021). Moving targets: Insights into speaker and dialect variability from articulatory and acoustic phonetic studies across English
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Keynote talk at 3rd International Symposium on Applied Phonetics (ISAPh2021) , virtual, hosted by Univertat Rovira i Virgili, Tarragona. 6 September 2021. Moving targets: Insights into speaker and dialect variability from articulatory and acoustic phonetic studies across English
Year(s) Of Engagement Activity	2021
URL	https://wwwa.fundacio.urv.cat/congressos/isaph2021/#:~:text=We%20are%20very%20excited%20to%20be%20ba...


Description	SPADE at BAAP 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Postgraduate students
Results and Impact	Presentation of first analysis of /s/ results at the BAAP 2018 Colloquium. A number of people have now offered to allow us to work with their speech datasets.
Year(s) Of Engagement Activity	2017
URL	https://blogs.kent.ac.uk/baap/talks-day-2/


Description	SPADE at CLiS
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Other audiences
Results and Impact	Digging into speech corpora: Introducing SPADE, a new initiative for mining spoken datasets on a large scale is the keynote presentation at the Corpus Linguistics in Scotland event Corpus linguistics and cross-disciplinarity in Glasgow.
Year(s) Of Engagement Activity	2017
URL	https://cdn.evbuc.com/eventlogos/175853256/schedule.jpg


Description	SPADE at NWAV46
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	SPADE is included in the workshop Sociolinguistics and forensic speech science: knowledge- and data-sharing at the New Ways of Analyzing Variation 46 conference in Madison, Wisconsin.
Year(s) Of Engagement Activity	2017
URL	https://dept.english.wisc.edu/nwav46/wp-content/uploads/2016/09/NWAV-46-Booklet-Nov3.pdf


Description	SPADE at the 174th Meeting of the Acoustical Society of America
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	SPADE was included in the presentation Sociophonetic trends in studies of Southern English at the 174th meeting of the Acoustical Society of America in New Orleans.
Year(s) Of Engagement Activity	2017
URL	http://acousticalsociety.org/sites/default/files/fullweek.pdf


Description	Talk at virtual LabPhon17 - Desperately seeking 'English' sibilants: Discovering dialect norms and speaker variability for /s S/ from large-scale, multi-dialect analysis
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Oral presentation at virtual LabPhon17: Desperately seeking 'English' sibilants: Discovering dialect norms and speaker variability for /s S/ from large-scale, multi-dialect analysis. Hosted UBC, Vancouver. 6-8 July 2020.
Year(s) Of Engagement Activity	2020
URL	https://spade.glasgow.ac.uk/wp-content/uploads/2020/06/StuartSmith_slides.pdf


Description	interview for 'accents' programme as part of BBC Radio 4's Broadcasting House; broadcast 18 November 2018
Form Of Engagement Activity	A press release, press conference or response to a media enquiry/interview
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Public/other audiences
Results and Impact	Personal invitation to contribute to a segment on 'accents, variation and change' for BBC Radio 4's Broadcasting House programme, which was broadcast on Sunday 18 November 2018. The programme producer, Dearbhail Starr, was fascinated by the material which I contributed, which covered phonetic variation and change in Scottish English, based on research relating to the articulatory phonetic investigation of speech, and large-scale analysis of speech variation across English accents.
Year(s) Of Engagement Activity	2018
URL	https://www.bbc.co.uk/programmes/m00018sg


Description	interview for Darren McGarvey's Class Wars BBC Scotland
Form Of Engagement Activity	A broadcast e.g. TV/radio/film/podcast (other than news/press)
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Media (as a channel to the public)
Results and Impact	Interview on social class and accent features for Darren McGarvey's Class Wars programme. The segment which included me talking about speech and accent prejudice, which directly rested on the funded-research projects reported for via ResearchFish, was then put onto Facebook by BBC Scotland, and to date has received 24k responses, 3.8k comments: https://www.facebook.com/BBCScotland/posts/4341338109229254
Year(s) Of Engagement Activity	2021
URL	https://www.bbc.co.uk/programmes/m000s7hd


Description	keynote talk: R Three Ways: Capturing variability in word-final /r/ in Scottish English, r-atics7, 18 November 2021
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Invited keynote talk at ratics7, 18 November 2021, University of Lausanne. Following this, Melanie Lancien submitted an application for a 3-year PostDoc Swiss Foundation Fellowship at Glasgow, to work on the SPADE data.
Year(s) Of Engagement Activity	2021


Description	poster presentation on /s/-retraction across English, LabPhon16, University of Lisbon, 19-22 June 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Poster presentation of 'Dialectal and social factors affect the phonetic bases of English /s/-retraction', authors: Jane Stuart-Smith, Morgan Sonderegger, Michael McAuliffe, Rachel Macdonald, Jeff Mielke, Erik Thomas and Robin Dodsworth, at 16th Conference on Laboratory Phonology - LabPhon16, held at University of Lisbon, 19-22 June 2018. First presentation of SPADE research on /s/-retraction, from 400+ speakers, to the international laboratory phonology community. Great deal of interest in both the research, but also our new Integrated Speech Corpus ANalysis software, ISCAN. Led to offers of more datasets for the SPADE project.
Year(s) Of Engagement Activity	2018


Description	poster presentation on vowel variation across English, LabPhon16, University of Lisbon, 19-22 June 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Poster presentation of 'Age vectors vs. axes of intraspeaker variation for North American and Scottish English vowel formants', authors: Erik Thomas, Jeff Mielke, Josef Fruehwald, Jordan Holley, Michael McAuliffe, Morgan Sonderegger, Jane Stuart-Smith, Robin Dodsworth and Tyler Kendall, at 16th Conference on Laboratory Phonology - LabPhon16, held at University of Lisbon, 19-22 June 2018. First presentation of SPADE research on vowel spaces and vowel trajectories in varieties of English, from 400+ speakers, to the international laboratory phonology community. Great deal of interest in both the research, but also our new Integrated Speech Corpus ANalysis software, ISCAN. Led to offers of more datasets for the SPADE project.
Year(s) Of Engagement Activity	2018


Description	talk LingLunch, Laboratoire de Linguistique Formelle(LLF). Université Paris Cité (2 June 2022): Sound perspectives for inferring social meaning? Speech and speaker dynamics over a century of Scottish English.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Talk to the general linguistics seminar for the Laboratoire de Linguistique Formelle, at Universite Paris Cite (2 June 2022). The theoretical and methodological perspectives presented during the talk generated a lively discussion amongst the participants, who were present physically and also by zoom. I was subsequently invited to apply for a prestigious Labex International Chair, to be held at LLF, for June 2023.
Year(s) Of Engagement Activity	2022


Description	talk at JK28 Satellite Workshop: Viewing accent variation from a large corpus perspective-Rhoticity in Scottish English
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	invited talk at Experimental Phonetics and Phonology at 28th Japanese/Korean Linguistics Conference (JK28), hosted by Seoul University. 28 December 2020.
Year(s) Of Engagement Activity	2020


Description	talk at workshop ALS2020 Online: Persuading birds of a feather to flock together - reflections on managing and measuring diverse speech corpora in SPADE
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	invited talk at workshop on 'Harnessing mobile technologies for the creation of new speech corpora from remote communities' at the Australian Lingustic Society ALS 2020 Online, 14-15 December 2020.
Year(s) Of Engagement Activity	2020
URL	https://als.asn.au/Conference/Program


Description	talk on /s/-retraction across English, at NWAV47, New York University, 18-21 October 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Talk on 'Dialectal and social factors affect the phonetic bases of English /s/-retraction', authors: Jane Stuart-Smith, Morgan Sonderegger, Michael McAuliffe, Rachel Macdonald, Jeff Mielke, Erik Thomas and Robin Dodsworth, at NWAV47, New York University, 18-21 October 2018. First presentation of SPADE research on /s/-retraction, from 400+ speakers, to the international sociolinguistics community. Great deal of interest in both the research, but also our new Integrated Speech Corpus ANalysis software, ISCAN. Led to offers of more datasets for the SPADE project.
Year(s) Of Engagement Activity	2018
URL	https://wp.nyu.edu/nwav47/


Description	talk on ISCAN software for SPADE, at Linguistics Seminar, 8 November 2018, University of Newcastle
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Stuart-Smith was invited to give a talk on the ISCAN (Integrated Speech Corpus ANalysis) software for SPADE, at Linguistics Seminar, 8 November 2018, University of Newcastle. The Newcastle speech group has a strong emphasis on Arabic phonetics, so the talk led to discussion about how to create a similar kind of cross-Arabic speech project. The ISCAN software can be used for any language, not only English - the focus on English for SPADE relates to the need for a specific focus for this first 3-year project.
Year(s) Of Engagement Activity	2018


Description	talk on ISCAN software for SPADE, at the WYRED project Data Sharing Event, 2 August 2018, University of Huddersfield
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Professional Practitioners
Results and Impact	The SPADE project was the key focus of the talk, 'Developing tools for data sharing: Polyglot + ISCAN (Integrated Speech Corpus ANalysis)' given by Stuart-Smith at the WYRED project Data Sharing Event held on 2 August 2018, at the University of Huddersfield. This was a satellite event of the 27th Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA). Participants ranged from practising forensic phoneticians to those working in the forensic and general speech technology industry. Very great interest was shown in the project, both research aims, but also ISCAN software and its potential for mining spoken language corpora, also for forensic purposes, without the need for visually inspecting or hearing speech, and hence very useful for ethically-restricted corpora such as police interviews.
Year(s) Of Engagement Activity	2018
URL	http://wyredproject.co.uk/data-sharing-satellite-event/


Description	talk on vowel variation across English, at NWAV47, New York University, 18-21 October 2018
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Talk on 'Age vectors vs. axes of intraspeaker variation for North American and Scottish English vowel formants', authors: Jeff Mielke, Josef Fruehwald, Erik Thomas, Michael McAuliffe, Morgan Sonderegger and Robin Dodsworth, at NWAV47, New York University, 18-21 October 2018. First presentation of SPADE research on vowel variation and vowel space in English, from 400+ speakers, to the international sociolinguistics community. Great deal of interest in both the research, but also our new Integrated Speech Corpus ANalysis software, ISCAN. Led to offers of more datasets for the SPADE project.
Year(s) Of Engagement Activity	2018
URL	https://wp.nyu.edu/nwav47/


Description	talk to CLILLAC-ARP: R Three Ways: Capturing the dynamics of Scottish word-final /r/, using DCT and GAMMs. CLILLAC-ARP. University Cite Paris.30 May 2022
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Talk to the phonetics researchers at all levels, in the CLILLAC-ARP lab at Universite Cite Paris.
Year(s) Of Engagement Activity	2022


Description	talk to McGill Phonology group: R Three Ways: Capturing the dynamics of Scottish word-final /r/. P* Reading Group. McGill University. 25 April 2023.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Talk to the P-seminar at McGill University; engaged discussion, followed by request from McGill PhD student, Massimo Lipari, to do a research visit to the Glasgow Phonetics lab.
Year(s) Of Engagement Activity	2023


Description	talk to the Research Seminar on Phonetics and Phonology (SRPP), at the Laboratoire de Phonetique et Phonologie, Sourbonne Nouvelle, Paris. 22 April 2022.
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Postgraduate students
Results and Impact	Talk at the main research seminar for phonetics and phonology for the Laboratoire de Phonetique et Phonologie (LPP), at the Sourbonne Nouvelle, Paris, on 22 April 2022. The seminar was attended by undergraduate and postgraduate students, as well as postdocs and researchers at LPP, as well as several other Paris labs. The talk generated a good deal of discussion, especially relating to the notion of gender and ethnicity construction through phonetic variation.
Year(s) Of Engagement Activity	2022


Description	workshop on Integrated Speech Corpus ANalysis (ISCAN) for SPADE, at NWAV47, New York University, 18 October 2018
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Hands-on workshop, with interactive tutorial, and presentation, on our new software system for SPADE, at NWAV47, New York University, 18 October 2018: Integrated Speech Corpus ANalysis - ISCAN: A new tool for large-scale, cross-corpus, sociolinguistic analysis: Jane Stuart-Smith, Morgan Sonderegger, Michael McAuliffe . Well attended by sociolinguists working on English and other languages, including Arabic, Hebrew, French, Brazilian Portuguese and Occitan. Very enthusiastically received by all participants. Feedback during tutorial helped with ongoing development of software for the sociolinguistic community.
Year(s) Of Engagement Activity	2018
URL	https://wp.nyu.edu/nwav47/workshops/

Abstract

Planned Impact

Organisations

People

ORCID iD

Publications