Our Data Ourselves

Lead Research Organisation: King's College London
Department Name: Digital Humanities

Abstract

Collectively, we now create more data in two days than we did in all of 2002. Whether it is the 4 B pieces of content shared daily on Facebook, the 200 M tweets, more than 5 B of us intensively produce unprecedented and unfathomable amounts of social data whenever we text, browse, post or generate content on our phones. We call this Big Social Data (BSD), and it entails the symbolic content we generate as will as the metadata our phones emit, tracking our bodies through time and space. Recent revelations about the NSA and GCHQ reveal just how data centric our society has become. Yet you could say we suffer from a data democracy deficit. As such, it is more important than ever that public understanding of our information-rich environment and our quantified selves improves.

Our research project 'Our Data, Ourselves' seeks to democratise BSD. Big data has already been widely identified as a key economic driver. We seek to turn it into a community asset and develop tools, applications, formats and practices which will enable important new research on and using BSD by arts and humanities researchers. We see community as being articulated though social and communicative practices, the very wellspring of BSD. These mediated practices impact traditional communities, those based on place and identity, and create new communities, especially with young people, those 'born digital.'

We will partner with youth coders in the Young Rewired State network as co-researchers, and gain privileged insight into how the mediated connectedness manifested by BSD is transforming communities. We will work with our young co-researchers to develop tools and applications for the capture, storage, and analysis of BSD. We will create an open environment for BSD research and develop an ethical framework for data sharing available for widespread community use. We envision a BSD research commons, respectful of privacy concerns, and engaging young tech-savvy communities to gain a greater understanding of the data-intensive digital culture of mobile environments.

Our research aims to enquire into the elements comprising one's personal archive of BSD. This entails examining the symbolic content we generate and the metadata of our devices, which map our bodies in time and space, as well as the infrastructure, in which data is stored, searched, shared, analysed and mined. This will facilitate a better understanding of the kinds of social connections, information sharing, and normative relations we are developing to and through BSD.

A major challenge is in the development and transfer of the technical skills and knowledge necessary for the capture and analysis of the different forms of BSD, and its transformation into community research data, available to researchers, those who generate it, and their communities. The basic issue raised is whether BSD can be transformed into a public asset and become a creative resource for cultural and economic community development. The basic problem addressed is that we do not own the BSD that we generate. From the moment we tap 'send' it moves and is processed in a highly proprietary environment where it remains inaccessible to us, typically returning only in the form of targeted ads. In short, we seek to transform proprietary and inaccessible BSD into open research data that becomes an accessible community resource.

We develop a freely accessible, open source online market place for tools and applications enabling the extraction of BSD from smart phones. We engage in hackathons to develop these tools, and engage privacy concerns, developing anonymisation technologies and produce policy white papers on ethical data sharing. We seek to turn BSD, which is currently primarily fodder for targeted ads and surveillance into a community resource available for creative use. If it is 'our data ourselves', our BSD commons will empower us to use it in new ways, both in community and by arts and humanities researchers.

Planned Impact

All components of our work deliver direct impact, as detailed in the Pathways to Impact and the Case for Support. The main impact, however, will come from working with Young Rewired State, a network of young coders who have participated in Hack Days for Refugees United, UK Aid, and the Department of Education. Young Rewired State is a leading community organisation in teaching youth coding skills through practical projects in situ, most prominently in their annual Festival of Code. We envision this project as the first step in a longer-term partnership with Young Rewired State, which has coder networks across the UK. Further, Young Rewired State is the philanthropic arm of Rewired State, which is a network of more than 1,000 software developers and designers with whom we intend to actively collaborate in the future.
Through Young Rewired State and other collaborations in the project, namely with the Open Data Institute and the Open Knowledge Foundation, we will engage a range of other stakeholders in digital culture research in mobile environments from social media researchers to mobile phone companies. This will enable us to capture the requirements of different groups within this broad field while forging networks for future research on digital culture, especially in regards to big data.
Our collaboration with Young Rewired Space will create a new interdisciplinary community for arts and humanities research, given its deployment of humanities and arts skills in the analysis of technological space. Further, our focus on hackathons, a co-research practice borrowed from our partner Young Rewired State, affords both innovative and effective means for community engagement.
Our partners will be co-researchers throughout the entire project cycle. They will generate big social data in-community, and work with us to develop the tools and applications for both its capture and analysis. The end result will be an open environment for big social data research that engages young tech-savvy communities in researching the digital culture of mobile environments.
Discussions with our partners identified a range of needs and questions that they considered would benefit from knowledge exchange with arts and humanities research. These can be clustered around the following:
i) The impact and potential of digital technologies, digital culture and digital practices and the forms of communication that surround and inform us
ii) The changing nature of engagement with the mobile environment where young tech-savvy users contribute more and more directly to the development of tools and services
iii) Cultural transformations where academic, cultural and creative organisations are challenged to connect with new forms of knowledge in new environments. These organisations need to consider the new power structures in these environment as well as new audiences, which include spectators but also a range of diverse communities that actively transform these environments.
The primary impact of the project will be providing practical experience and critical thinking about the process and importance of digital culture research in exploring the big data components of digital personalities. We would expect all participants to end the project with:
i) A wider understanding of what constitutes digital personality, both explicit and tacit, in a mobile environment
ii) Applied experience of extracting and representing knowledge in a mobile environment, in particular in relation to personal information
iii) A method and digital assets for involving a wide range young coders to participate and contribute their knowledge and insights in future digital arts and humanities research
vii) A culture, in which arts and humanities research assumes an integral role in the analysis of mobile environments
viii) An understanding of the nature and form of the interactions that best support this culture.

Publications

10 25 50

publication icon
Coté, M. (2014) Advancing Digital Humanities

publication icon
Gill A (2017) Insight workflow: Systematically combining human and computational methods to explore textual data in Journal of the Association for Information Science and Technology

publication icon
Pybus J (2015) Hacking the social life of Big Data in Big Data & Society

 
Description Our research project Our Data, Ourselves seeks to democratise Big Social Data (BSD). Big data has already been widely identified as a key economic driver. We turn it into a community asset and develop tools, applications, formats and practices, which will enable important new research on and using BSD by arts and humanities researchers. We see community as being articulated though social and communicative practices, the very wellspring of BSD.
Exploitation Route The Our Data, Ourselves project seeks to democratise the trails of data we generate, and produce an anonymised community asset; available to researchers, those who generate them, and their communities. We have developed an app to harvest smartphone metadata, developed and tested with young coders from Young Rewired State. Our smartphone app is novel in that it seeks to make data producers aware of the nature and scale of the data they generate.
Sectors Communities and Social Services/Policy,Creative Economy,Digital/Communication/Information Technologies (including Software),Government, Democracy and Justice,Culture, Heritage, Museums and Collections

URL http://www.big-social-data.net
 
Description Impact on the development of mobile ecosystem privacy requirements
First Year Of Impact 2015
Sector Digital/Communication/Information Technologies (including Software),Education,Culture, Heritage, Museums and Collections
Impact Types Cultural,Societal

 
Description AHRC Amplification Award: Digital Transformations
Amount £79,951 (GBP)
Funding ID AH/M002551/1 
Organisation Arts & Humanities Research Council (AHRC) 
Sector Public
Country United Kingdom
Start 09/2015 
End 07/2016
 
Description AHRC Connecting Communities Conference in Cardiff
Amount £1,400 (GBP)
Organisation Arts & Humanities Research Council (AHRC) 
Sector Public
Country United Kingdom
Start 05/2014 
End 05/2014
 
Title YRS Tumblr Blogs 
Description Our YRS subjects have been asked to keep tumbr blogs so that we can measure their perceived mobile use with their actual use, as measured by the MobileMiner App. Our co-researchers have been asked to record every time they use their mobile, laptop, or tablet devices. They have been asked to either take a screen shot or record a brief entry that describes the activity they have engaged in. Our subjects have been asked to do this 5 times throughout the course of project; in addition, we have coordinate three days in which we have specifically told them to use the MobileMiner App while they maintain their media diaries. 
Type Of Material Technology assay or reagent 
Provided To Others? No  
Impact Given young people tend to underestimate their screen time and mobile use, this will help to a) make them more reflexive with regards to their media habits b) provide a measured approach to understanding the gap between perception and reality. Young people have argued that the current discourse to engage them with regards to privacy is weak and ineffective. By gaining a clearer understanding of both their online habits, in relation the amount of data that is both intentionally and unintentionally shared will help to build a stronger curriculum for pedagogical engagement. Attached find an example of our Tumblr accounts. 
URL http://tamara-herbert.tumblr.com/
 
Description 2nd Workshop on Big Humanities Data, held in conjunction with IEEE Big Data 2014 
Organisation University of Maryland
Country United States 
Sector Academic/University 
PI Contribution The workshop addressed applications of big data in the humanities, arts culture, and social science, the challenges and possibilities that such increased scale brings for scholarship in these areas.
Collaborator Contribution We co-organised the IEEE workshop including a funders panel at the end, with funders from Canada, US and UK.
Impact Proceedings in IEEE
Start Year 2014
 
Title MobileMiner 
Description Mobile Miner is an Android app that monitors network activity of other apps on the user's device. The data it collects is made available to users, and also uploaded anonymously to our custom CKAN instance. 
Type Of Technology Software 
Year Produced 2014 
Open Source License? Yes  
Impact Users of the app, especially the young coders involved in our project, will be made aware of the activities of other apps on their devices. The data will be shared with the academic community. 
URL http://kingsbsd.github.io/MobileMiner/
 
Title MobileMinerPlugin 
Description MobileMinerPlugin is a plugin for the CKAN [http://ckan.org/] data-portal that receives data from our MobileMiner Android app and makes it available to users of our CKAN instance. It also processes the data to produce several derived tables that describe the activity of mobile 'phone users and their apps. 
Type Of Technology Webtool/Application 
Year Produced 2014 
Impact This software allows our data to be collected, in order for our Young Rewired State coders to be able to reflect on it and also to share it with the academic community. 
URL https://github.com/kingsBSD/MobileMinerPlugin
 
Description Aarhus University: Big Social Data 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talk sparked questions and discussions afterwards leading to important networking opportunities.

The talk led to important networking opportunities, resulting in our invitation to a follow-up conference in London at the Institute for Historical Research at the University of London.
Year(s) Of Engagement Activity 2013
 
Description British Academy of Management: NEMODE Workshop 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talk sparked questions and discussion as was anticipated by the organizer. Mark was invited to this discussion given the very limited scope and understanding of big data in the management field.

The talk led to possible interdisciplinary research projects and possibilities around consultation.
Year(s) Of Engagement Activity 2014
 
Description Connecting Communities 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact The AHRC Connected Communities event in Cardiff, UK was a cross-Council programme designed to help us understand the changing nature of communities in their historical and cultural contexts and the role of communities in sustaining and enhancing our quality of life. We set up a table and spoke with a number of people of the course of two days: profiling and providing information about our research; making important networking links and demonstrating some of our preliminary research results.

Notable impacts included an invitation from to the V&A for their Art and Design Weekend.
Year(s) Of Engagement Activity 2014
URL http://www.ahrc.ac.uk/Funding-Opportunities/Research-funding/Connected-Communities/Pages/Connected-C...
 
Description Digital Design Festival at the V&A 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact We took part in the AHRC Digital Transformation Booth at the V+A Digital Design Festival. It is difficult to count the number of people who attended but it was a very busy Sunday and because of the Snowden revelations our project attracted particular interest.

The result was a range of discussions with publishers, consultancies and the public in general. We are now working with publishers, technology providers and universities on several follow-on projects.
Year(s) Of Engagement Activity 2014
URL http://www.vam.ac.uk/whatson/event/3404/digital-design-weekend-4853/
 
Description Erasmus Programme at Salford University: Big Social Data 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Undergraduate students
Results and Impact Talk sparked questions and discussion afterwards.

Mark was the featured invited speaker, given his expertise in the field of big data. The organizers wanted an arts and humanities perspective on big data, allowing a more interdisciplinary approach for those computer science oriented students and scholars who were in attendance. His participation will lead to our future involvement with the Erasmus programme.
Year(s) Of Engagement Activity 2014
 
Description International Communication Association Talk on Big Social Data 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talk sparked discussion about the changing digital ecosystem. Many audience members where impressed by the research that clearly described the increased sophistication of the algorithmic practices of big social data. In addition, the audience was very supportive and interested in our articulation of data literacy.

Notable impacts include further requests for information and extremely positive feedback on the research done so far.
Year(s) Of Engagement Activity 2014
 
Description Istanbul Technical University: What is Big Social Data? 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talked sparked questions and discussion afterwards. The talk also sparked important possibilities around a future international research network.

Steps were made towards actualizing a future international research network.
Year(s) Of Engagement Activity 2014
 
Description King's India Institute: Talk on Big Social Data 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Talk sparked questions and discussion afterwards and expanded understanding of the role of big social data in society.

A number of participants were very keen on the discussion and details were exchanged to follow up with talks in the future.
Year(s) Of Engagement Activity 2014
 
Description Markov Clustering Lightening Talk 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact A lightening talk on our use of Markov Clustering for the analysis of Twitter data was given the Neo4J "GraphPub" meet-up, (http://www.meetup.com/graphdb-london/events/178586172/) and the first PyDataLondon meeting (http://www.meetup.com/PyData-London-Meetup/events/179396812/).

Both events simulated discussion, and we confirmed to audience members that our source-code was freely available.
Year(s) Of Engagement Activity 2014
URL http://www.slideshare.net/kingsBSD/twitter-35215472
 
Description Open Data Institute Research Day 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach Regional
Primary Audience Professional Practitioners
Results and Impact Providing and introduction to the research project.

The talk led to important networking links.
Year(s) Of Engagement Activity 2013
 
Description Our Data, Ourselves presentation at EMF Camp 2014 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Public/other audiences
Results and Impact Attended and presented at EMF Camp, (https://www.emfcamp.org/) a hacker/maker camp held near Bletchley. (Slides at: tinyurl.com/bsdemf)

Some members of the audience have downloaded our app, which alerts them to the activities of other apps their devices.
Year(s) Of Engagement Activity 2014
URL http://tinyurl.com/bsdemfcamp
 
Description Parliamentary Office for Science and Technology Big Data Event 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Policymakers/politicians
Results and Impact MPs and peers were given an introduction by Research Councils UK and POST (Parliamentary Office for Science and Technology) to raise awareness of the ways big data research is developing in science and technology.

We reported on how we could transform big data into a community asset for everyone and demoncratise it. We demonstrated the tools to enable important new arts and humanities research on digital culture.
Year(s) Of Engagement Activity 2014
URL http://www.ahrc.ac.uk/News-and-Events/News/Pages/Parliamentary-Event-%E2%80%93-Big-Data-Research-Exh...
 
Description Rutgers University: Data Literacy Not Media Literacy 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talked sparked questions afterwards about the rise of third parties and the number data marketing and analytic corporations that are sharing user-generated content. In addition, it sparked a very long discussion around the need to re-evaluate current, outdated media literacy pedagogical strategies. The idea of data-literacy was extremely well received and there was tremendous affirmation for the work we are engaging in.

After my talk I was asked to continue this discussion on a panel for the International Communication Association.
Year(s) Of Engagement Activity 2014
 
Description Talk at the AHRC Digital Transformations Panel at the Digital Humanities Congress in Sheffield 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Participation in big data panel for the AHRC at the Digital Humanities Congress in Sheffield.

N/A
Year(s) Of Engagement Activity 2014
 
Description Transmediale Conference: Big Social Data Panel 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talked sparked questions and discussion afterward and had a very far reach. It led to the wide spread circulation of our research to a diverse audience of digital artists, activists, general public and academics. The YouTube link of the panel has now reached over 1000 views.

After the talk we were asked to submit an article to Theory, Culture and Society.
Year(s) Of Engagement Activity 2014
URL https://www.youtube.com/watch?v=S8FiMva1cig
 
Description University of Westminster: Metadata: From Stasi to Syntax 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Other academic audiences (collaborators, peers etc.)
Results and Impact Talk sparked questions and discussion afterwards.

We have been asked to submit a chapter based on the talk to Edinburgh University Press.
Year(s) Of Engagement Activity 2014