Bringing the Social City to the Smart City
Lead Research Organisation:
University of Leeds
Department Name: Sch of Geography
Abstract
Technological developments, such as the rise in GPS enabled devices and Web 2.0 technologies have created social transformations in how we connect and share information through the mass uptake of smart phones and social media platforms (Croitoru et al, 2014). This new generation of mobile technologies work as individual sensors capturing data on a wide range of human behaviours that have been previously hidden. These include data on individual movement, preferences and opinions. Understanding these behaviours is crucial if we are to create a joined up approach to simulating how cities breath and grow. However, considerable work is required in adapting and developing new technologies from machine learning to extract behaviours which can be embedded into cutting-edge modelling techniques. Creating this bridge between 'big' data representing the 'real' world, and simulations producing alternative versions of reality is of value to both academics and policymakers looking to develop new solutions to many of the challenges that today's cities face. To do this we need to understand how factors within the "Social City" (the impact of individual movements and decisions) play out every day in the "Smart City" (data collected from fixed sensors on for example, traffic counts, air pollution or movements of populations).
However, standard "Smart" City understanding assumes previous flows (e.g. traffic at a specific time of the week, energy requirements or pollution levels over a 24-hour period) will be replicated into the future, lacking both adaptability (how does this alter if a major event in the city is happening?) and predictive power (what is the impact on health if all petrol and diesel vehicles are banned?). This disconnection between the Smart and Social city means policymakers are unable to obtain answers to complex interrelated questions such as: what is the optimal transport infrastructure to promote healthy behaviours and reduce the City's carbon footprint?
Being able to answer these questions is increasingly important as cities are facing significant challenges associated with the pressures from rapidly increasing urban populations. These include improving water and transportation infrastructure, air pollution and waste management as well as provision of adequate housing, energy, health care, education and employment. These pressures on future cities has brought the Smart City agenda to dominate many government initiatives with governments and policymakers looking to new forms of (big) micro data to provide innovative solutions to these challenges. While many models have been developed to forecast future transport, housing or healthcare initiatives, most uses are purely empirical: they lack any consideration of the social processes behind the individual generating the data or the impact of their actions and decisions. This Fellowship will explore how machine learning inspired tools can be used to recognise such emergent patterns and processes within micro-level data sources such as data on how individuals move and use city spaces. Along with the additional methodology of Agent Based Models, which allows smart and social city data to be readily combined, this suite of methods will be explored through looking at the case-studies of (i) the impact of air pollution on individuals and (ii) urban mobility. This work will fundamentally transform our ability to show how the social elements of the Smart City can be recognised and understood, and how to bring 'lived experience' to the analysis of Smart City data.
However, standard "Smart" City understanding assumes previous flows (e.g. traffic at a specific time of the week, energy requirements or pollution levels over a 24-hour period) will be replicated into the future, lacking both adaptability (how does this alter if a major event in the city is happening?) and predictive power (what is the impact on health if all petrol and diesel vehicles are banned?). This disconnection between the Smart and Social city means policymakers are unable to obtain answers to complex interrelated questions such as: what is the optimal transport infrastructure to promote healthy behaviours and reduce the City's carbon footprint?
Being able to answer these questions is increasingly important as cities are facing significant challenges associated with the pressures from rapidly increasing urban populations. These include improving water and transportation infrastructure, air pollution and waste management as well as provision of adequate housing, energy, health care, education and employment. These pressures on future cities has brought the Smart City agenda to dominate many government initiatives with governments and policymakers looking to new forms of (big) micro data to provide innovative solutions to these challenges. While many models have been developed to forecast future transport, housing or healthcare initiatives, most uses are purely empirical: they lack any consideration of the social processes behind the individual generating the data or the impact of their actions and decisions. This Fellowship will explore how machine learning inspired tools can be used to recognise such emergent patterns and processes within micro-level data sources such as data on how individuals move and use city spaces. Along with the additional methodology of Agent Based Models, which allows smart and social city data to be readily combined, this suite of methods will be explored through looking at the case-studies of (i) the impact of air pollution on individuals and (ii) urban mobility. This work will fundamentally transform our ability to show how the social elements of the Smart City can be recognised and understood, and how to bring 'lived experience' to the analysis of Smart City data.
Planned Impact
One of the key elements of the Fellowship is to develop economic and societal impact through engagement with a wide range of stakeholders. The Fellow will be based within the Consumer Data Research Centre (CDRC). The CDRC has established a network of partnerships that the Fellow will thread together over a number of activities. There are a wide range of active partnerships within CDRC including Callcredit, Axciom, Sainsbury's, Zoopla, Whenfresh, Heart Research UK, M&S, Virgin Media, YouGov and Link. Access to data and stakeholders from these and other CDRC partners will be available to the Fellow. Engagement with external partners is facilitated through events including an annual User Forum, which brings together more than 60 commercial partners of CDRC.
How will they benefit?
Research project (RP) 1 and PhD 1: Creating tools to uncover hidden emergent trends and processes in spatial data. Can new tools give greater insight to explain why a phenomenon is prevalent in one area, but not another? Can new data tools reveal new insights into current data, and uncover gaps? Beneficiaries from this work include organisations that need to probe deeper into spatial data to understand the interplay of emergent patterns over time. Examples: Local government planning and transportation departments, public health organisations.
RP 2 and PhD 2: Testing methodologies through ABM: ABM gives the opportunity to bring together Smart and Social city data through simulating individuals. This aspect of the work will create new understanding about the utility of new social media data sets and how they can be readily combined with Smart data. This will create impact for those beneficiaries who are interested in both simulating and understanding individual behaviour and the consequences of individual decisions. Examples: Retail (commercial) and health organisations, local and national government.
RP3: Application to case-studies. This part of the Fellowship has the greatest potential impact as the work can be readily applied to specific case-studies to answer questions about the consequences of new policies or interventions. Examples: Can the transportation infrastructure be optimised to encourage active travel and city spaces to be more readily used? Whilst methodological innovations will have the greatest short-term impact on researchers in academic institutions, the medium to long-term impact of this work will feed through into research orientated public sector organisations, such as those interested in health behaviours, mobility patterns, impact of new infrastructure and new economic initiatives. Examples: Town and Country Planning Association; West Yorkshire Combined Authority.
How will I ensure stakeholders will benefit from the research?
Stakeholder advisory meetings: Creation of a stakeholder group, the purpose of which is to promote interactions and exchanges. Meetings will be scheduled for the beginning of the Fellowship and each subsequent year to allow time to shape research questions and identify additional case-studies.
Creation of new resources including training courses: Open access code, online training materials will be made available to professionals and academics alike. Workshops will be held to train stakeholders, academics and CDT PhD students in new methods and data, thus impacting on research capacity and skills.
Seminars: Turing and CDRC both offer seminar programmes at which research findings from this Fellowship will be disseminated. Webinars will also be used to increase the impact and range of engagement with different stakeholders.
Further funding applications: Engaging with stakeholders and building bridges is of prime importance for future funding opportunities. At month 18, a research incubator event will be held at CDRC. This will involve Turing Fellows as well as invited stakeholders and researchers
How will they benefit?
Research project (RP) 1 and PhD 1: Creating tools to uncover hidden emergent trends and processes in spatial data. Can new tools give greater insight to explain why a phenomenon is prevalent in one area, but not another? Can new data tools reveal new insights into current data, and uncover gaps? Beneficiaries from this work include organisations that need to probe deeper into spatial data to understand the interplay of emergent patterns over time. Examples: Local government planning and transportation departments, public health organisations.
RP 2 and PhD 2: Testing methodologies through ABM: ABM gives the opportunity to bring together Smart and Social city data through simulating individuals. This aspect of the work will create new understanding about the utility of new social media data sets and how they can be readily combined with Smart data. This will create impact for those beneficiaries who are interested in both simulating and understanding individual behaviour and the consequences of individual decisions. Examples: Retail (commercial) and health organisations, local and national government.
RP3: Application to case-studies. This part of the Fellowship has the greatest potential impact as the work can be readily applied to specific case-studies to answer questions about the consequences of new policies or interventions. Examples: Can the transportation infrastructure be optimised to encourage active travel and city spaces to be more readily used? Whilst methodological innovations will have the greatest short-term impact on researchers in academic institutions, the medium to long-term impact of this work will feed through into research orientated public sector organisations, such as those interested in health behaviours, mobility patterns, impact of new infrastructure and new economic initiatives. Examples: Town and Country Planning Association; West Yorkshire Combined Authority.
How will I ensure stakeholders will benefit from the research?
Stakeholder advisory meetings: Creation of a stakeholder group, the purpose of which is to promote interactions and exchanges. Meetings will be scheduled for the beginning of the Fellowship and each subsequent year to allow time to shape research questions and identify additional case-studies.
Creation of new resources including training courses: Open access code, online training materials will be made available to professionals and academics alike. Workshops will be held to train stakeholders, academics and CDT PhD students in new methods and data, thus impacting on research capacity and skills.
Seminars: Turing and CDRC both offer seminar programmes at which research findings from this Fellowship will be disseminated. Webinars will also be used to increase the impact and range of engagement with different stakeholders.
Further funding applications: Engaging with stakeholders and building bridges is of prime importance for future funding opportunities. At month 18, a research incubator event will be held at CDRC. This will involve Turing Fellows as well as invited stakeholders and researchers
People |
ORCID iD |
Alison Heppenstall (Principal Investigator / Fellow) |
Publications
An L
(2023)
Modeling agent decision and behavior in the light of data science and artificial intelligence
in Environmental Modelling & Software
Antosz P
(2023)
What do you want theory for? - A pragmatic analysis of the roles of "theory" in agent-based modelling
in Environmental Modelling & Software
Arnold KF
(2019)
Adjustment for time-invariant and time-varying confounders in 'unexplained residuals' models for longitudinal data within a causal framework and associated challenges.
in Statistical methods in medical research
Arnold KF
(2019)
DAG-informed regression modelling, agent-based modelling and microsimulation modelling: a critical comparison of methods for causal inference.
in International journal of epidemiology
Burns L
(2017)
Developing an Individual-level Geodemographic Classification
in Applied Spatial Analysis and Policy
Crooks Andrew
(2018)
Agent-Based Modelling and Geographical Information Systems: A Practical Primer
Franklin RS
(2023)
Making Space in Geographical Analysis.
in Geographical analysis
Gadd S
(2022)
The utility of multilevel models for continuous-time feature selection of spatio-temporal networks
in Computers, Environment and Urban Systems
Gadd S
(2021)
Simplifying the interpretation of continuous time models for spatio-temporal networks
in Journal of Geographical Systems
Description | Work has been focused in three areas: (i) developing methods for understanding movements in space-time data (where and how do people move around cities). We have developed network methods (published) and work is currently undergoing assessing visualisation and ML methods. (ii) initial work has been published looking at understanding causality from the perspective of statistical epidemiology - why and how does something change in a population. This is novel work that links together individual-based modelling with causal inference methods. This work will continue and tools will be produced. (iii) developing methods for quantifying uncertainty. Research has been published on methods for understanding uncertainty in agent-based models and a grant submitted on this topic. Ongoing work will look at novel methods to improve uncertainty in individual-based models (particle and Kalman Filters) as well as emulators. In addition, a new project (QUIPP) came out of this work -- it is focused on generating synthetic data from sensitive data that both preserves utility and privacy. |
Exploitation Route | All the code etc that will be developed will be placed on Github and I am working with the Turing RSE group to create reusable code wherever possible. I am actively collaborating and disseminating the work through multiple channels. |
Sectors | Aerospace Defence and Marine Digital/Communication/Information Technologies (including Software) Energy Environment Financial Services and Management Consultancy Healthcare Retail Transport |
Description | Exascale Agent-based Modelling for PoLicy Evaluation in Real-time (ExAMPLER) |
Amount | £412,695 (GBP) |
Funding ID | EP/Y008839/1 |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 05/2023 |
End | 11/2024 |
Description | Quantifying Utility and Privacy Preservation in Synthetic Populations (QUiPP) |
Amount | £408,611 (GBP) |
Funding ID | TPS2019\100019 |
Organisation | Alan Turing Institute |
Sector | Academic/University |
Country | United Kingdom |
Start | 09/2019 |
End | 03/2021 |
Description | Real-Time Advanced Data assimilation for Digital Simulation of Numerical Twins on HPC (RADDISH) |
Amount | £400,000 (GBP) |
Organisation | Alan Turing Institute |
Sector | Academic/University |
Country | United Kingdom |
Start | 09/2019 |
End | 03/2021 |
Title | Network Analysis Visualisation |
Description | The tool that is being developed is in two stages: network science tools adapted to handle large databases and (ii) adaption of visualisation tools for understanding what's happening inside of the data. My plan is to pass these tools to the Turing for a Data Scientist to create reusable code. |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2019 |
Provided To Others? | No |
Impact | The tool has not yet been completed. |
Title | Particle filters for agent-based models (uncertainty handling) - RADDISH |
Description | Particle filters are being explored as a method to emulate large numbers of agents and to quantify uncertainty in this approach. |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2020 |
Provided To Others? | Yes |
Impact | No notable impact as of yet, but this research has been linked to the Alan Turing TPS RADDISH project. This involves creating a real-time warning system for hazards such as tsunamis. |
Title | Quantifying Utility and Privacy Preservation in Synthetic Populations (QUiPP) |
Description | This tool is aimed at creating individual-level synthetic populations from sensitive data sets that preserves the internal structures and preserves individual privacy. |
Type Of Material | Improvements to research infrastructure |
Year Produced | 2020 |
Provided To Others? | Yes |
Impact | This tool is still in its initial phase. We are hoping to create a demonstrator in the next few months then work with health and financial data providers. |
URL | https://github.com/alan-turing-institute/QUIPP-pipeline/projects |
Title | A dataset of dockless bike sharing trips and road network for Nanchang, China |
Description | The dataset includes the data and scripts in the paper " Yang Y, Heppenstall A, Turner AGD and Comber A. A spatiotemporal and graph-based analysis of dockless bike sharing patterns to understand urban flows over the last mile. Paper accepted for publication in Computers Environment and Urban Systems (July 2019)". It contains (1) Road Network and dockless bike trips of 10 weekdays in Nanchang, China (2) Scripts used to produce the result in the paper. |
Type Of Material | Database/Collection of data |
Year Produced | 2019 |
Provided To Others? | Yes |
Title | Dataset of the enhanced demand prediction models in bike-sharing systems using graph structural information |
Description | The dataset underpins the findings reported in Yang, Y., Heppenstall, A., Turner, A., & Comber, A. (2020). Using graph structural information about flows to enhance short-term demand prediction in bike-sharing systems. Computers, Environment and Urban Systems, 83, 101521. https://doi.org/10.1016/j.compenvurbsys.2020.101521 The enhanced machine learning models, feature importance models are included along with the bike-sharing data. |
Type Of Material | Database/Collection of data |
Year Produced | 2020 |
Provided To Others? | Yes |
URL | http://archive.researchdata.leeds.ac.uk/803/ |
Title | Drive Cycle Data from the 3D Urban Traffic Simulator (ABM) in Unity (version 1.1.0) |
Description | This repository contains datasets from the 3D Urban Traffic Simulator ABM:Each vehicle type, PHEV and ICEV were simulated across nine different speed limit adherence and density scenarios. The output data columns are:AgentID: the unique agent identifier.xAxisPos: the x-axis position of the agent.zAxisPos: the y-axis position of the agent.collisions: the total number of times a vehicle has collided with another.topSpeed(mph): the top speed the vehicle is set to achieve throughout its drive cycle.currentSpeed(mph): the current speed of the vehicle in mph.distanceOfTravel(meters): the distance the vehicle has travelled over its drive cycle.raycastLength: the length at which the vehicle can identify objects, 1 - very short to 10 high vision distance.tractionControl: traction control initiated; some vehicles will have traction control, others will not; this is entirely arbitrary.VelocityMagnitude(BETA): the magnitude of the velocity for the vehicle.VehicleMass: the vehicle weight in kg.Downforce: the force applied to the vehicle to create more grip, 0.1 - small force to 10.0 - more force.date-time: date and timestamps for each data point collected, currently its milliseconds and multiple actions by the raw physics engine can occur throughout the simulation. Therefore, large amounts of data points are collected for each run in a short space of time.File path:Data/PHEV or ICEV/- MC1: 10 Vehicles, all vehicles do not adhere to speed limits.- MC2: 10 Vehicles, half of which do not adhere to speed limits.- MC3: 10 Vehicles, all adhere to speed limits.- MC4: 50 Vehicles, all vehicles do not adhere to speed limits.- MC5: 50 Vehicles, half of which do not adhere to speed limits.- MC6: 50 Vehicles, all adhere to speed limits.- MC7: 100 Vehicles, all vehicles do not adhere to speed limits.- MC8: 100 Vehicles, half of which do not adhere to speed limits.- MC9: 100 Vehicles, all adhere to speed limits. |
Type Of Material | Database/Collection of data |
Year Produced | 2021 |
Provided To Others? | Yes |
URL | https://figshare.com/articles/dataset/Drive_Cycle_Data_from_the_3D_Urban_Traffic_Simulator_ABM_in_Un... |
Description | GP Project work |
Organisation | NHS England |
Country | United Kingdom |
Sector | Public |
PI Contribution | We have worked on a prescription data set (pertaining to asthma) to interrogate different prescribing practices during the Covid Pandemic. |
Collaborator Contribution | We have worked with a GP, gaining her expertise within this area. |
Impact | There are journal publications that are being currently written. |
Start Year | 2020 |
Description | Improbable |
Organisation | Improbable |
Country | United Kingdom |
Sector | Private |
PI Contribution | I have helped them to develop more sophisticated models in return and methods to quantify uncertainty. |
Collaborator Contribution | Improbable have contributed software engineering time and visualisation expertise to the project. I have helped them to develop more sophisticated models in return and methods to quantify uncertainty. |
Impact | This work is ongoing. |
Start Year | 2020 |
Description | Modelling Uncertainty |
Organisation | Improbable |
Country | United Kingdom |
Sector | Private |
PI Contribution | This work looks at quantifying uncertainty through the use of probabilistic programming for improving the forecasting accuracy of simulation tools. This directly feeds into the work undertaken by the Fellowship and links to other funded work. |
Collaborator Contribution | They have funded two intern projects and a PhD studentship as well as giving full access to their probabilistic framework (Keanu). |
Impact | So far this work has resulted in the developed of two conference papers and software development. |
Start Year | 2018 |
Description | Procter & Gamble |
Organisation | Procter & Gamble |
Department | Newcastle Innovation Centre |
Country | United Kingdom |
Sector | Private |
PI Contribution | This is a piece of work applying the methods developed within the grant to research questions generated by the partner. I am supervising an intern to undertake the research. |
Collaborator Contribution | The partners have funded an intern based at the UoL and are supply expertise and access to data sets. |
Impact | This internship is due to commence in April 2021 for 6 months. |
Start Year | 2020 |
Description | Attendance at the monthly Turing Urban Analytics group |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Other audiences |
Results and Impact | Monthly meeting and exchange of ideas about research funding and development of ideas for the Turing Urban Analytics programme. |
Year(s) Of Engagement Activity | 2019,2020,2021,2022 |
Description | Chair of an international conference |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Co-Chair of the GIScience conference, one of the largest computational geography conferences in the field. 450 delegates attended. |
Year(s) Of Engagement Activity | 2023 |
URL | https://giscience2023.github.io |
Description | Conference talk, International Microsimulation Association, University of Vienna |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Challenges of using microsimulation for simulating the impact of interventions on populations. IMS. University of Vienna, Austria. Jan 2024. |
Year(s) Of Engagement Activity | 2024 |
URL | https://ima-2024.wifo.ac.at |
Description | Expert Data Advisor: JBC |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Policymakers/politicians |
Results and Impact | I am an invited expert on the JBC panel advising on data and methods used to analyse COVID data. This feeds directly into the SAGE group and government policy on COVID. |
Year(s) Of Engagement Activity | 2020,2021 |
URL | https://www.gov.uk/government/groups/joint-biosecurity-centre |
Description | Invited keynote Expos Utrecht |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Keynote presentation - Challenges and opportunities of using ABMs for simulating social system. |
Year(s) Of Engagement Activity | 2023 |
URL | https://exposome.nl/news-events/news/highlights-from-the-exposome-nl-conference-2023.html |
Description | Invited talk at a symposium at the University of Zurich |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Invited talk at University of Zurich. Challenges and opportunities of using ABMs for simulating social systems. |
Year(s) Of Engagement Activity | 2023 |
Description | Scientific Chair of AGILE conference |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Policymakers/politicians |
Results and Impact | Scientific chair of the AGILE conference, an applied geography conference. Responsible for the scientific theme of the conference. |
Year(s) Of Engagement Activity | 2024 |
URL | https://agile-gi.eu/conference-2024 |
Description | Simulating the effects of income and housing policies on Quality Adjusted Life Expectancy |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Professional Practitioners |
Results and Impact | Presentation on actionable insights of using AI tools. |
Year(s) Of Engagement Activity | 2024 |
Description | Talk at GIScience conference |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Talk called Exascale agent-based modelling for policy evaluation in real-time |
Year(s) Of Engagement Activity | 2023 |
URL | https://giscience2023.github.io |
Description | Turing workshop on reproducibility |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Co-organiser of the "Reliability and reproducibility in computational science: Implementing verification, validation and uncertainty quantification in silico" workshop. As a result of this event, I discussed methods with Prof Peter Challenor (Exeter) and wrote and submitted a Turing grant on Uncertainty. |
Year(s) Of Engagement Activity | 2020 |
URL | https://www.turing.ac.uk/events/reliability-and-reproducibility-computational-science |
Description | Workshop on modelling crime demand |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Professional Practitioners |
Results and Impact | Workshop to engage policymakers (police) to understand their requirements for a police demand tool. I am involved in this as a Co-investigator, solicited for my expertise in ABM and demand modelling. |
Year(s) Of Engagement Activity | 2020 |