Network Stochastic Processes and Time Series (NeST)
Lead Research Organisation:
Imperial College London
Department Name: Mathematics
Abstract
Dynamic networks occur in many fields of science, technology and medicine, as well as everyday life. Understanding their behaviour has important applications. For example, whether it is to uncover serious crime on the dark web, intrusions in a computer network, or hijacks at global internet scales, better network anomaly detection tools are desperately needed in cyber-security. Characterising the network structure of multiple EEG time series recorded at different locations in the brain is critical for understanding neurological disorders and therapeutics development. Modelling dynamic networks is of great interest in transport applications, such as for preventing accidents on highways and predicting the influence of bad weather on train networks. Systematically identifying, attributing, and preventing misinformation online requires realistic models of information flow in social networks.
Whilst simple random networks theory is well-established in maths and computer science, the recent explosion of dynamic network data has exposed a large gap in our ability to process real-life networks. Classical network models have led to a body of beautiful mathematical theory, but do not always capture the rich structure and temporal dynamics seen in real data, nor are they geared to answer practitioners' typical questions, e.g. relating to forecasting, anomaly detection or data ethics issues. Our NeST programme will develop robust, principled, yet computationally feasible ways of modelling dynamically changing networks and the statistical processes on them.
Some aspects of these problems, such as quantifying the influence of policy interventions on the spread of misinformation or disease, require advances in probability theory. Dynamic network data are also notoriously difficult to analyse. At a computational level, the datasets are often very large and/or only available "on the stream". At a statistical level, they often come with important collection biases and missing data. Often, even understanding the data and how they may relate to the analysis goal can be challenging. Therefore, to tackle these research questions in a systematic way we need to bring probabilists, statisticians and application domain experts together.
NeST's six-year programme will see probabilists and statisticians with theoretical, computational, machine learning and data science expertise, collaborate across six world-class institutes to conduct leading and impactful research. In different overlapping groups, we will tackle questions such as: How do we model data to capture the complex features and dynamics we observe in practice? How should we conduct exploratory data analysis or, to quote a famous statistician, "Looking at the data to see what it seems to say" (Tukey, 1977)? How can we forecast network data, or detect anomalies, changes, trends? To ground techniques in practice, our research will be informed and driven by challenges in many key scientific disciplines through frequent interaction with industrial & government partners in energy, cyber-security, the environment, finance, logistics, statistics, telecoms, transport, and biology. A valuable output of work will be high-quality, curated, dynamic network datasets from a broad range of application domains, which we will make publicly available in a repository for benchmarking, testing & reproducibility (responsible innovation), partly as a vehicle to foster new collaborations. We also have a strategy to disseminate knowledge through a diverse range of scientific publication routes, high-quality free software (e.g. R packages, Python notebooks accompanying data releases), conferences, patents and outreach activities. NeST will also carefully nurture and develop the next generation of highly-trained and research-active people in our area, which will contribute strongly to satisfying the high demand for such people in industry, government and academia.
Whilst simple random networks theory is well-established in maths and computer science, the recent explosion of dynamic network data has exposed a large gap in our ability to process real-life networks. Classical network models have led to a body of beautiful mathematical theory, but do not always capture the rich structure and temporal dynamics seen in real data, nor are they geared to answer practitioners' typical questions, e.g. relating to forecasting, anomaly detection or data ethics issues. Our NeST programme will develop robust, principled, yet computationally feasible ways of modelling dynamically changing networks and the statistical processes on them.
Some aspects of these problems, such as quantifying the influence of policy interventions on the spread of misinformation or disease, require advances in probability theory. Dynamic network data are also notoriously difficult to analyse. At a computational level, the datasets are often very large and/or only available "on the stream". At a statistical level, they often come with important collection biases and missing data. Often, even understanding the data and how they may relate to the analysis goal can be challenging. Therefore, to tackle these research questions in a systematic way we need to bring probabilists, statisticians and application domain experts together.
NeST's six-year programme will see probabilists and statisticians with theoretical, computational, machine learning and data science expertise, collaborate across six world-class institutes to conduct leading and impactful research. In different overlapping groups, we will tackle questions such as: How do we model data to capture the complex features and dynamics we observe in practice? How should we conduct exploratory data analysis or, to quote a famous statistician, "Looking at the data to see what it seems to say" (Tukey, 1977)? How can we forecast network data, or detect anomalies, changes, trends? To ground techniques in practice, our research will be informed and driven by challenges in many key scientific disciplines through frequent interaction with industrial & government partners in energy, cyber-security, the environment, finance, logistics, statistics, telecoms, transport, and biology. A valuable output of work will be high-quality, curated, dynamic network datasets from a broad range of application domains, which we will make publicly available in a repository for benchmarking, testing & reproducibility (responsible innovation), partly as a vehicle to foster new collaborations. We also have a strategy to disseminate knowledge through a diverse range of scientific publication routes, high-quality free software (e.g. R packages, Python notebooks accompanying data releases), conferences, patents and outreach activities. NeST will also carefully nurture and develop the next generation of highly-trained and research-active people in our area, which will contribute strongly to satisfying the high demand for such people in industry, government and academia.
Organisations
- Imperial College London (Lead Research Organisation)
- OFFICE FOR NATIONAL STATISTICS (Collaboration, Project Partner)
- Government Communications Headquarters (GCHQ) (Collaboration)
- EDF Energy (Collaboration)
- BT Group (Collaboration)
- Pasteur Institute, Paris (Collaboration)
- Royal Mail (Project Partner)
- Microsoft Research Ltd (Project Partner)
- FNA (Financial Network Analytics) (Project Partner)
- Microsoft (Project Partner)
- EDF (International) (Project Partner)
- GCHQ (Project Partner)
- Securonix (Project Partner)
- British Telecommunications plc (Project Partner)
Publications


Annie Gray
(2023)
Hierarchical clustering with dot products recovers hidden tree structure


Armbruster S
(2024)
Network-based time series modeling for COVID-19 incidence in the Republic of Ireland
in Applied Network Science

Chang J
(2021)
Modelling matrix time series via a tensor CP-decomposition

Chang, J.
(2024)
Edge differentially private estimation in the ß-model via jittering and method of moments
in The Annals of Statistics

Gallagher I
(2023)
Spectral Embedding of Weighted Graphs
in Journal of the American Statistical Association

Hallgren K
(2024)
Changepoint Detection on a Graph of Time Series
in Bayesian Analysis

Han Y
(2021)
Simultaneous Decorrelation of Matrix Time Series

Hannah Sansford
(2023)
Implications of sparsity and high triangle density for graph representation learning
Description | AI Hub |
Amount | £10,000,000 (GBP) |
Funding ID | EP/Y007484/1 |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 02/2024 |
End | 01/2029 |
Description | EPSRC DTP Scholarship |
Amount | £100,000 (GBP) |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 09/2024 |
End | 10/2028 |
Title | Spectral Embedding of Weighted Graphs |
Description | When analyzing weighted networks using spectral embedding, a judicious transformation of the edge weights may produce better results. To formalize this idea, we consider the asymptotic behavior of spectral embedding for different edge-weight representations, under a generic low rank model. We measure the quality of different embeddings-which can be on entirely different scales-by how easy it is to distinguish communities, in an information-theoretical sense. For common types of weighted graphs, such as count networks or p-value networks, we find that transformations such as tempering or thresholding can be highly beneficial, both in theory and in practice. Supplementary materials for this article are available online. |
Type Of Material | Database/Collection of data |
Year Produced | 2023 |
Provided To Others? | Yes |
URL | https://tandf.figshare.com/articles/dataset/Spectral_Embedding_of_Weighted_Graphs/23557217/1 |
Title | Spectral Embedding of Weighted Graphs |
Description | When analyzing weighted networks using spectral embedding, a judicious transformation of the edge weights may produce better results. To formalize this idea, we consider the asymptotic behavior of spectral embedding for different edge-weight representations, under a generic low rank model. We measure the quality of different embeddings-which can be on entirely different scales-by how easy it is to distinguish communities, in an information-theoretical sense. For common types of weighted graphs, such as count networks or p-value networks, we find that transformations such as tempering or thresholding can be highly beneficial, both in theory and in practice. Supplementary materials for this article are available online. |
Type Of Material | Database/Collection of data |
Year Produced | 2023 |
Provided To Others? | Yes |
URL | https://tandf.figshare.com/articles/dataset/Spectral_Embedding_of_Weighted_Graphs/23557217 |
Description | Analysis of neural firing patterns |
Organisation | Pasteur Institute, Paris |
Country | France |
Sector | Charity/Non Profit |
PI Contribution | Partnership just begun |
Collaborator Contribution | Partnership just begun |
Impact | Multi-disciplinary. Life sciences, bioinformatics, statistics. |
Start Year | 2024 |
Description | Analysis of telecommunications data |
Organisation | BT Group |
Department | BT Research |
Country | United Kingdom |
Sector | Private |
PI Contribution | Partnership has just begun |
Collaborator Contribution | Partnership has just begun |
Impact | Partnership has just begun |
Start Year | 2024 |
Description | EDF in Paris |
Organisation | EDF Energy |
Department | EDF Innovation and Research |
Country | France |
Sector | Private |
PI Contribution | We and the EDF team led by Dr Y Goude have worked together to develop the curve regression methodology and time series PCA h to tackle the new challenges in forecasting the loads due to the increase of renewable energy and the development of small distributed production units, as well as the changing consumption behavour due to plug-in (hybrid)electric vehicles, heat pumps and personal storage capacities. |
Collaborator Contribution | We and the EDF team have worked together to develop the curve linear regression methodology and have tested it with EDF data. |
Impact | Four publications, and one software package in R. |
Start Year | 2010 |
Description | Network analysis |
Organisation | Government Communications Headquarters (GCHQ) |
Country | United Kingdom |
Sector | Public |
PI Contribution | Our algorithm, Unfolded Spectral Embedding, is now implemented for large-scale dynamic graph visualisation |
Collaborator Contribution | Our algorithm, Unfolded Spectral Embedding, is now implemented for large-scale dynamic graph visualisation |
Impact | Software (not currently open access) |
Start Year | 2022 |
Description | Office for National Statistics work on various projects including migration statistics |
Organisation | Office for National Statistics |
Country | United Kingdom |
Sector | Private |
PI Contribution | Partnership has just begun |
Collaborator Contribution | Partnership has just begun |
Impact | Partnership has just begun |
Start Year | 2024 |
Description | Office of National Statistics Partnership |
Organisation | Office for National Statistics |
Country | United Kingdom |
Sector | Private |
PI Contribution | We are analysing data of direct debits and direct credits at a business sector level. To this purpose we have developed a novel model for time series on networks. It has resulted in a paper and in some conference presentations. Moreover representatives from the Department of Business and Trade have shown an interest in this work and we are in the process of expanding it to nowcast GDP-like figures. |
Collaborator Contribution | This is a partnership which has been facilitated by the Alan Turing Institute. Together with Mihai Cucuringu I supervise a PDRA, Anastasia Mantziou. The ONS provided access to a proprietary data set. It also provided in-house expertise in biweekly meetings. |
Impact | Mantziou, A., Cucuringu, M., Meirinhos, V., & Reinert, G. (2023). The GNAR-edge model: a network autoregressive model for networks with time-varying edge weights. Journal of Complex Networks, 11(6), cnad039. Multidisciplinary, includes statistics and economics |
Start Year | 2021 |
Title | GNAR: Methods for Fitting Network Time Series Models |
Description | Simulation of, and fitting models for, Generalised Network Autoregressive (GNAR) time series models which take account of network structure, potentially with exogenous variables. Such models are described in Knight et al. (2020) and Nason and Wei (2021) . Diagnostic tools for GNAR(X) models can be found in Nason et al (2023) . |
Type Of Technology | Software |
Year Produced | 2023 |
Open Source License? | Yes |
Impact | Difficult to ascertain |
URL | https://cran.r-project.org/web/packages/GNAR/index.html |
Description | An invited talk at Conference on "Recent Advances in Statistics and Data Science" in Rutgers |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | Conference on Recent Advances in Statistics and Data Science with a Celebration of Professors Regina Liu and Cun-Hui Zhang's Special Birthdays |
Year(s) Of Engagement Activity | 2023 |
URL | https://statistics.rutgers.edu/news-events/conferences/684-conference-on-recent-advances-in-statisti... |
Description | Invited talk at 2023 IMS International Conference on Statistics and Data Science, Lisbon |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | The objective of ICSDS is to bring together researchers in statistics and data science from academia, industry, and government in a stimulating setting to exchange ideas on the developments of modern statistics, machine learning, and broadly defined theory, methods, and applications in data science. |
Year(s) Of Engagement Activity | 2023 |
URL | https://www.icsds2023.com/ |
Description | Invited talk at Conference on "Statistical Foundations of Data Science and Applications" in Princeton |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | The conference was in honour of Professor Jianqing Fan's 60 birthday attended by over 300 academics, students and people working in industry, |
Year(s) Of Engagement Activity | 2023 |
URL | https://fan60.princeton.edu/ |
Description | Invited talk at Conference on 2023 Kansas Econometrics Workshop, Kansas |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | This workshop consists of a series of yearly workshops focusing on recent developments of econometrics theories and methodologies as well as applications in economics and finance and other applied fields such as data sciences and statistics. The main purpose of the econometrics workshop series at KU is to promote methodological and theoretical research as well as applications in modern econometrics and statistics as well as data science, and to provide a forum for researchers, including Ph.D. students, to come together to interact through social discussions and presentations. |
Year(s) Of Engagement Activity | 2023 |
URL | https://econometrics.ku.edu/ |
Description | Invited talk at Prof. Carey Priebe 60th birthday conference |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Other audiences |
Results and Impact | Talk |
Year(s) Of Engagement Activity | 2023 |
URL | https://brinmrc.umd.edu/programs/workshops/fall23/fall23-workshop-statistics.html |
Description | Invited talk at The OMI Machine Learning in Financial Econometrics, Oxford Man Institute |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Study participants or study members |
Results and Impact | The workshop is to to the dissemination of cutting-edge ideas in economics, financial industry using machine learning tools. |
Year(s) Of Engagement Activity | 2023 |
URL | https://web.cvent.com/event/78dec7d3-ee2d-4ddb-b14d-b05e782bb209/summary |
Description | Keynote talk |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Keynote talk, at ICT Innovations 2023: 15th ICT Innovations Conference 2023, Ohrid, North Macedonia, Title: "Synthetic Networks" This conference is a key conference for graduate students in North Macedonia. |
Year(s) Of Engagement Activity | 2023 |
URL | https://ictinnovations.org/ |
Description | Patrick Rubin-Delanchy seconder of the vote of thanks JRSSB Discussion paper: "Root and community inference on the latent growth process of a network" |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Other audiences |
Results and Impact | PRD seconder of the vote of thanks JRSSB Discussion paper: "Root and community inference on the latent growth process of a network |
Year(s) Of Engagement Activity | 2023 |
URL | https://rss.org.uk/training-events/events/events-2023/rss-events/root-and-community-inference-on-the... |
Description | Poster at Imperial showcase |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Local |
Primary Audience | Policymakers/politicians |
Results and Impact | Poster at the Imperial Natural Sciences Showcase 2023 on "Modelling a COVID-19 Time Series as a Generalised Network Autoregressive Process" |
Year(s) Of Engagement Activity | 2023 |
URL | https://www.imperial.ac.uk/events/163172/natural-sciences-showcase-2023-2/ |
Description | SNS email list |
Form Of Engagement Activity | Engagement focused website, blog or social media channel |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Postgraduate students |
Results and Impact | Gesine Reinert set up an email list for social network science |
Year(s) Of Engagement Activity | 2023 |
URL | https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=SNS |
Description | Seminar (UCL) |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Postgraduate students |
Results and Impact | Invited speaker talk at UCL national poster competition in Statistics. Talk on "Network Time Series" |
Year(s) Of Engagement Activity | 2023 |
URL | https://tsoo-math.github.io/ucl2/grst/2023-poster.html |
Description | Seminar at Queen's University Belfast |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Regional |
Primary Audience | Policymakers/politicians |
Results and Impact | Mathematical Sciences Research Centre at Queen's University Belfast seminar on "Network Time Series" |
Year(s) Of Engagement Activity | 2024 |
URL | https://www.qub.ac.uk/research-centres/msrc/events/ |
Description | Talk at St Andrews |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | Local |
Primary Audience | Professional Practitioners |
Results and Impact | Departmental Seminar given by affiliate Prof Mario Cortina Borja (UCL) on "Modelling high-dimensional time series with generalised network autoregressive processes " |
Year(s) Of Engagement Activity | 2023 |
URL | https://stats.wp.st-andrews.ac.uk/seminars/ |
Description | Talk at national postgraduate conference |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Postgraduate students |
Results and Impact | Conference talk given by grant affiliate Chiara Boetti (University of Bath) on "Long Memory Network Time Series" |
Year(s) Of Engagement Activity | 2023 |
Description | Temporal Graph Learning Workshop at NeurIPS 2023 |
Form Of Engagement Activity | A formal working group, expert panel or dialogue |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Other audiences |
Results and Impact | Patrick Rubin-Delanchy invited to be panelist at the Temporal Graph Learning Workshop at NeurIPS 2023. Alex Modell (PDRA) took his place. |
Year(s) Of Engagement Activity | 2023 |
URL | https://sites.google.com/view/tglworkshop-2023/home |