Census Innovation at CeLSIUS

Lead Research Organisation: UNIVERSITY COLLEGE LONDON

Department Name: Epidemiology and Public Health

Abstract

This project aims to develop two aspects of Census data.
1) Addressing the gap in provision with regard to access and utilising the most restricted access UK Census data (other than the longitudinal studies) by providing enhanced user information guidance, training and user support

2) Creation of a fake (impossible) longitudinal England and Wales dataset to enable users to explore the data, select variables and draft code for their analyses

Gap in provision

This currently exists with regard to support for using secure UK Census data other than the longitudinal studies which are currently supported by three organisations CeLSIUS in England and Wales, NILS-RSU in Northern Ireland and SLS-DSU in Scotland - collectively they form UKCenLS. Less restricted access versions of the data are currently well supported by the UK Data Service but there is no support for the most restricted access data sets other than limited user information and data storage and research clearance. CeLSIUS aims to further support two groups of data: secure origin-destination or 'flow' data (migration, commuting, student migration, and second residence sets) from 2011 and 2021/2, and de-identified individual and household microdata from 1961 to 2021.

This project seeks to assess feasibility and affordability of these potential new support services and provide the required user information guidance and training in particular for the 2021/2 data where none yet exists.

This service will support researchers who wish to use UK census 2021/2 migration and commuting and individual data for social science-led research, but also compare change over time.

CeLSIUS has considerable experience in supporting the user of the other restricted access census data and will ensure that no data that could identify an individual becomes public.

Creation of a fake longitudinal data set

Currently it is hard for users to know what variables are in the Office for National Statistics Longitudinal Study (ONS LS) and is getting harder with each Census. There are approximately 600 variables per Census and in some cases hundreds of categories for a variable.

The current solutions: a) ask CeLSIUS and/or ONS b) use the data dictionary c) look at the Census forms. It is hard for users to choose the best variable options when they are so many options. This means users may select the wrong variables for their project and then must reapply for any additional variables which causes delays and places administrative burden on ONS and CeLSIUS. Also we have users who never see the data set and we do analysis for them and no users ever see the full data set so they do not always know what is best to select.

We aim to create a individual level complete fake longitudinal data set to be openly available. This has imaginary people from 1971-2021 with impossible characteristics covering all options in the Censuses. This open data set will reduce burden on users and the user support team of the ONS LS. It will also be a useful tool for training.

Funded Value:

£384,576

Funded Period:

Mar 24 - Mar 26

Funder:

ESRC

Project Status:

Active

Project Category:

Research and Innovation

Project Reference:

ES/Z502741/1

Principal Investigator:

Nicola Shelton

Research Topic:

Unclassified

Organisations

People	ORCID iD
Nicola Shelton (Principal Investigator)
Oliver Duke-Williams (Co-Investigator)	http://orcid.org/0000-0001-8050-1881
Adam Dennett (Co-Investigator)

Publications

Author Name

Title Publication Date Published

10 25 50

Policy Influence
Collaboration
Engagement Activities


Description	Initial steering meeting with the ONS 2021 Census Production team
Geographic Reach	National
Policy Influence Type	Contribution to new or improved professional practice


Description	Participant in three rounds of ESRC/MRC PRUK Delphi surveys entitled: "Reaching consensus on representativeness and coverage in longitudinal population studies"
Geographic Reach	National
Policy Influence Type	Contribution to a national consultation/review
URL	https://www.ukri.org/opportunity/understanding-coverage-in-uk-longitudinal-population-studies/


Description	Submission to consulation on UK Statistics Assembly
Geographic Reach	National
Policy Influence Type	Contribution to a national consultation/review
Impact	The event will be repeated followings its success. The National Statistician has arranged to meet NS to discuss a BSPS day meeting
URL	https://uksa.statisticsauthority.gov.uk/uk-statistics-assembly-2025/


Description	Meeting with National Statistician
Organisation	Office for National Statistics
Country	United Kingdom
Sector	Private
PI Contribution	Following a meeting with Sir Ian Diamond the National Statistician there has been a decision made to hold a BSPS day meeting taking about the future of popualtion statistics
Collaborator Contribution	I arranged the meeting
Impact	Day meeting to be held May-July 2025
Start Year	2025


Description	Talk at the 2024 BSPS conference Innovation in Census Support at CeLSIUS
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Paper presented to academic conference - strong interest from users in the innovation and useful feebback received
Year(s) Of Engagement Activity	2024
URL	https://www.lse.ac.uk/international-development/research/british-society-for-population-studies/Asse...

Abstract

Organisations

People

ORCID iD

Publications