Linked Conservation Data - phase 2

Lead Research Organisation: University of the Arts London
Department Name: CCW Grad School

Abstract

Phase 2 of the Linked Conservation Data (LCD) project builds on an existing collaboration between University of the Arts London and Stanford Libraries and a number of other high profile US and UK partners as part of a Research Networking project funded under the AHRC's Highlight Notice for UK/US Collaborations in Digital Scholarship in Cultural Institutions.

LCD explores methods for enabling access to knowledge about collections in memory organisations such as museums, galleries, libraries and archives. The work of conservators in these organisations focuses on investigating the structure and condition of objects and treating and protecting them. A core task in conservation is documenting observations made during this work. This creates a wealth of records about material observations, evidence and conclusions on the history of each object. Combining this knowledge with other historical resources such as texts is crucial for researching and interpreting history particularly for contested objects where the narrative from material evidence may be different from the popular understanding of an object. LCD aims to provide ways that conservation documentation can be produced, disseminated and re-used more effectively through Linked Data (https://www.w3.org/standards/semanticweb/data) in order to enable new research and new interpretations through offering researchers enhanced access to conservation data. We anticipate that this will contribute to educational programmes utilising conservation data, the development of improved methods for the protection of our cultural heritage and increased public engagement with collections in memory organisations.

During phase 1, the LCD project:
- addressed the issue of not being able to cross-search conservation records about the same things because of different vocabularies and established a pathway for harmonising and combining conservation vocabularies so that they can work together (https://www.ligatus.org.uk/lcd/output/142),
- studied conservation records and identified some types which are difficult to describe and share and initiated discussions for new proposals to overcome these limitations (https://www.ligatus.org.uk/lcd/output/151),
- raised awareness of the value of data in the profession through workshops and webinars (https://www.ligatus.org.uk/lcd/meetings).

In phase 2 we aim to develop a Linked Data pilot implementation on book conservation which is of interest to partner organisations such as the Bodleian Library, the Library of Congress and the Stanford Libraries with different datasets harmonised into one system. The project builds on its existing partners and brings together national organisations and universities from the UK and US to form a critical mass of activity able to transform scholarship using conservation data. We are also partnering with major professional bodies in conservation who have agreed to co-author and co-sign a policy/strategy document to promote LCD's objectives. We are involving experts in education who will advise the consortium of how conservation data can be used to help educational programmes for schools in memory organisations.

LCD phase 2 is the first step for setting up infrastructure for hosting shared vocabularies and datasets for conservation. We are preparing the consortium for a follow-up phase and we are establishing the foundations for Linked Data projects not only in conservation but potentially in other fields as well. LCD will inform current conservation documentation projects in memory organisations (e.g. three projects are in progress within consortium partners alone). The pilot is essential for testing methods and strengthening the consortium, as partners will work together through cross-disciplinary collaboration. A consortium of this scale with backing from major professional bodies is rare and this is a unique opportunity to make a real change to academic scholarship in memory organisations.

Planned Impact

The benefits of implementing Linked Data for conservation documentation start with the creators of documentation in a department, spread out to the institution, to users of collections and the general public. Sharing conservation data can assist decision making, make research data accessible, improve institutional prioritisation and resource allocation and deepen understanding of material culture for a broad array of audiences. Not implementing Linked Data in Conservation also has consequences. Specialised knowledge remains siloed and inaccessible. Opportunities for collaborative research are lost. Innovation across collections and institutions is stifled. Linked Conservation Data (LCD) presents many opportunities for impact, but these can only be realised if we can build on initial work to invest in developing tools, educating conservation professionals, and running pilots to test early stage implementations.

Who might benefit from this research outside of the academic research community:
- Conservation and documentation professionals
- Managers of collection care departments
- Curators and educators inside and outside memory organisations

How might they benefit:
Conservation and documentation professionals: Conservators assess, survey, repair, prepare items to go on exhibit or to be safely stored. Conservation documentation contains critical information on materials, structure, failure and damage as well as treatment pathways for stabilisation and use. This information is not widely accessible to conservators or others in institutions after initial generation and it is very rarely visible to audiences outside the organisation which generates it.
As conservation documentation becomes linked from institution to institution, insights into a specific creator's or artist's materials and practice are more accessible. Ways to manage the deterioration of a common material are more widely understood. New efficiencies for addressing storage challenges are shared. Through sharing data, conservation and cultural heritage professionals broaden their understanding of common and unique challenges and so are better able to care for collections.
Conservators and documentation professionals will benefit from guidelines for making their records accessible both within and if desired outside of their institutions. They will benefit from research data made possible by the use of Linked Data. They will also benefit from the publication of thesauri within the field as a whole.

Managers of collection care departments: Increased understanding of how conservators currently and in the past have cared for collections, can offer critical information to the leadership of cultural heritage and memory institutions for resource allocation and innovation opportunities. Further guidance from an LCD policy document outlining aims in relation to conservation documentation will also provide critical information to managers and administrators for future planning. This policy document will be co-authored and co-signed by three major international professional organisations who are also dissemination partners for the consortium.

Curators and educators: Curators who study and interpret collections from memory organisations will benefit from new methods of embedding conservation information about the materiality of objects in their collections. This will improve catalogue records and provide additional context for exhibitions and curation activities. Educators within and outside memory organisations will benefit from articulating types of conservation records which can form excellent examples for learning activities. Similar material can produce insights that can be shared with the public who visit museums or use library collections.
 
Description This project allowed the consortium to gain experience with the development of a Linked Data pilot and the partners to explore their own documentation records.
Exploitation Route Institutions wishing to adopt Linked Data for conservation documentation can refer to the pilot as an exemplar.
Sectors Digital/Communication/Information Technologies (including Software),Culture, Heritage, Museums and Collections

URL https://www.ligatus.org.uk/lcd/
 
Description Projects and practicioners in cultural heritage consider the project as reference of good practice. The issue of sharing records in conservation has become more central to the domain discourse. Several organisations are considering the project as core reference when working on their conservation documentation systems.
First Year Of Impact 2021
Sector Digital/Communication/Information Technologies (including Software),Culture, Heritage, Museums and Collections
Impact Types Cultural

 
Description Citation in evidence for the government inquiry for Reproducibility and Research Integrity
Geographic Reach National 
Policy Influence Type Citation in systematic reviews
URL https://committees.parliament.uk/writtenevidence/39704/pdf/
 
Title CIDOC-CRM negative properties 
Description Extension of the CIDOC-CRM to express absence of features from objects. 
Type Of Material Computer model/algorithm 
Year Produced 2021 
Provided To Others? Yes  
Impact Allows expressing conservation data about object which lack features. 
URL https://github.com/linked-conservation-data/crmntp
 
Title Datasets for the pilot of Linked Conservation Data project 
Description Datasets used for the Pilot of the Linked Conservation Data project. This includes documentation reports on bookbinding treatments involving board re-attachment from the following organisations: Bodleian Libraries, the National Archives (UK), the Library of Congress and Stanford Libraries. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/Datasets_for_the_pilot_of_Linked_Conservation_Data_proj...
 
Title Datasets for the pilot of Linked Conservation Data project 
Description Datasets used for the Pilot of the Linked Conservation Data project. This includes documentation reports on bookbinding treatments involving board re-attachment from the following organisations: Bodleian Libraries, the National Archives (UK), the Library of Congress and Stanford Libraries. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact Exemplar for sharing conservation documentation as Linked Data. This is currently being used by national memory organisations for designing their conservation documentation systems. 
URL https://figshare.arts.ac.uk/articles/dataset/Datasets_for_the_pilot_of_Linked_Conservation_Data_proj...
 
Title Linked Conservation Data Board Reattachment Pilot 
Description Four conservation studios (Library of Congress, The National Archives UK, Bodleian Library and Stanford Libraries) modeled their records to the CIDOC-CRM and converted them to Linked Data. These records were then integrated in a Linked Data Platform (ResearchSpace) and a number of research questions were expressed and answered based on them. 
Type Of Material Database/Collection of data 
Year Produced 2021 
Provided To Others? Yes  
Impact The pilot helped the consortium partners experience the process of exporting data as Linked Data. The Pilot is a point of reference for conservation Linked Data. 
URL https://lcd.researchspace.org/resource/rsp:Start
 
Title List of conservation sample records 
Description This is a collection of typical conservation records offered by participants of the workshops. 
Type Of Material Database/Collection of data 
Year Produced 2020 
Provided To Others? Yes  
Impact This allows testing the CIDOC-CRM ontology with real life records on a bottom-up approach to ensure that the kind of information that we need to record in conservation can be expressed with the CIDOC-CRM. 
URL https://www.ligatus.org.uk/lcd/sample-schemas
 
Title List of conservation vocabularies 
Description Following a survey with responses from professional in several organisations, a list of popular conservation vocabularies was compiled. 
Type Of Material Database/Collection of data 
Year Produced 2019 
Provided To Others? Yes  
Impact This enables fundamental work on the alignment of terminologies in conservation and subsequently it enables sharing of conservation records. 
URL https://www.ligatus.org.uk/lcd/controlled-vocabularies
 
Title TP and NTP full dataset 
Description This dataset includes statements about manuscripts from the library of St. Catherine Monastery in Sinai and specifically about the existence of leaf markers on each manuscript. The dataset is provided in three formats: CSV, OWL and RDFS.Leaf markers are not individually identified. Only their existence and type is indicated. The dataset is used to demonstrate a method of describing numerous individuals and absence of types in Knowledge Bases. all-records.csv is the original data as collected at the Monastery.all-records-owlcro.owl holds the original data alongside fictional records of individual leaf markers for each book (these do not exist but they are necessary to demonstrate the applicable method)all-records-owlcrop.owl holds the original data onlyThe same logic is followed for the RDF files.Due to the size of this dataset it may be possible to perform reasoning in OWL with it. A small indicative dataset is also available in this repository. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/TP_and_NTP_full_dataset/19487411/1
 
Title TP and NTP full dataset 
Description This dataset includes statements about manuscripts from the library of St. Catherine Monastery in Sinai and specifically about the existence of leaf markers on each manuscript. The dataset is provided in three formats: CSV, OWL and RDFS.Leaf markers are not individually identified. Only their existence and type is indicated. The dataset is used to demonstrate a method of describing numerous individuals and absence of types in Knowledge Bases. all-records.csv is the original data as collected at the Monastery.all-records-owlcro.owl holds the original data alongside fictional records of individual leaf markers for each book (these do not exist but they are necessary to demonstrate the applicable method)all-records-owlcrop.owl holds the original data onlyThe same logic is followed for the RDF files.Due to the size of this dataset it may be possible to perform reasoning in OWL with it. A small indicative dataset is also available in this repository. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/TP_and_NTP_full_dataset/19487411
 
Title TP and NTP small dataset 
Description This dataset includes statements about manuscripts from the library of St. Catherine Monastery in Sinai and specifically about the existence of leaf markers on each manuscript. The dataset is provided in three formats: CSV, OWL and RDF.Leaf markers are not individually identified. Only their existence and type is indicated. The dataset is used to demonstrate a method of describing numerous individuals and absence of types in Knowledge Bases. two-records.csv is part of the original data as collected at the Monastery.two-records-owlcro.owl holds part of the original data alongside fictional records of individual leaf markers for each book (these do not exist but they are necessary to demonstrate the applicable method)two-records-owlcrop.owl holds part of the original data onlyThe same logic is followed for the RDF files.The size of this dataset allows performing test reasoning in OWL. A full dataset is also available in this repository. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/TP_and_NTP_small_dataset/19487468
 
Title TP and NTP small dataset 
Description This dataset includes statements about manuscripts from the library of St. Catherine Monastery in Sinai and specifically about the existence of leaf markers on each manuscript. The dataset is provided in three formats: CSV, OWL and RDF.Leaf markers are not individually identified. Only their existence and type is indicated. The dataset is used to demonstrate a method of describing numerous individuals and absence of types in Knowledge Bases. two-records.csv is part of the original data as collected at the Monastery.two-records-owlcro.owl holds part of the original data alongside fictional records of individual leaf markers for each book (these do not exist but they are necessary to demonstrate the applicable method)two-records-owlcrop.owl holds part of the original data onlyThe same logic is followed for the RDF files.The size of this dataset allows performing test reasoning in OWL. A full dataset is also available in this repository. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/TP_and_NTP_small_dataset/19487468/1
 
Title Vocabularies for the pilot of Linked Conservation Data project 
Description Vocabularies in SKOS format used for the Pilot of the Linked Conservation Data project. This includes local vocabularies from the following organisations: Bodleian Libraries, the National Archives (UK), the Library of Congress and Stanford Libraries. The vocabularies include partial reconciliation records with the Getty AAT and Language of Binding Thesaurus. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
Impact Best practice exemplar for sharing conservation vocabularies as Linked Open Data. 
URL https://figshare.arts.ac.uk/articles/dataset/Vocabularies_for_the_pilot_of_Linked_Conservation_Data_...
 
Title Vocabularies for the pilot of Linked Conservation Data project 
Description Vocabularies in SKOS format used for the Pilot of the Linked Conservation Data project. This includes local vocabularies from the following organisations: Bodleian Libraries, the National Archives (UK), the Library of Congress and Stanford Libraries. The vocabularies include partial reconciliation records with the Getty AAT and Language of Binding Thesaurus. 
Type Of Material Database/Collection of data 
Year Produced 2022 
Provided To Others? Yes  
URL https://figshare.arts.ac.uk/articles/dataset/Vocabularies_for_the_pilot_of_Linked_Conservation_Data_...
 
Description New PhD studentship with TNA 
Organisation The National Archives
Country United Kingdom 
Sector Public 
PI Contribution Contributed to a new PhD studentship on a subject relevant to Linked Conservation Data.
Collaborator Contribution Contributed to a new PhD studentship on a subject relevant to Linked Conservation Data.
Impact New PhD studentship.
Start Year 2020
 
Description Technical work with the Linked Art consortium 
Organisation University of Oxford
Country United Kingdom 
Sector Academic/University 
PI Contribution LCD provides conservation advice and use cases to the modelling meetings of the Linked Art consortium.
Collaborator Contribution Linked Art members contribute to the modelling workshops organised by LCD.
Impact LCD modelling workshops Linked Art modelling meetings
Start Year 2020
 
Description Conservation Data Policy workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Policymakers/politicians
Results and Impact Discussion on the adoption of the policy primer and template produced by the project.
Year(s) Of Engagement Activity 2020
URL https://www.ligatus.org.uk/lcd/meeting/218
 
Description Education workshop 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact To explore the potential of Linked Data for conservation education and outreach activities in museums and school
Year(s) Of Engagement Activity 2020
URL https://www.ligatus.org.uk/lcd/meeting/212
 
Description Modelling workshop (phase 2) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Workshop for introducing conservators to Linked Data and the benefits of sharing records. Also testing the capacity of the CIDOC-CRM to express conservation data based on contributed sample records from the participants.
Year(s) Of Engagement Activity 2021
URL https://www.ligatus.org.uk/lcd/meeting/modelling-2