Fair and modular blockchain data infrastructure for open science and society (FairOnChain)
Lead Research Organisation:
Imperial College London
Department Name: Computing
Abstract
Public blockchains, such as Bitcoin and Ethereum, are publicly accessible by design, but their data cannot be easily accessed and analysed without proper structure and indexing. The objective of this project is to develop a publicly accessible infrastructure that enables easy access and searchability of blockchain data in accordance with the FAIR principles (Findable Accessible Interoperable Replicable) of open science. This will promote complete transparency and reproducibility of scientific analysis results in the blockchain field - something that does not exist today - facilitating the growth of new and existing applications and collaborations.
At present, structured analyses are typically performed using proprietary solutions and databases, which make reproducibility and sharing of data within the scientific community challenging and expensive. Additionally, even though scientific studies often perform common operations to collect data systematically, the tools and libraries developed and used are rarely shared with the broader scientific community. As a result, similar research carried out by different institutions and groups often requires the re-implementation of the same software tools, leading to wasted resources and an inability to reproduce and compare results.
As part of this project, we plan to provide the scientific community with:
(a) Publicly accessible and expandable datasets and infrastructure that include structured, daily updated blockchain transaction data. Researchers will be able to access raw transaction data and community-maintained, enriched datasets in a uniform and open manner, promoting the availability and reuse of these complex data.
(b) An open-source software framework and standardized data access APIs that enable effective querying, annotating, and referencing of data, and building well-described reusable workflows and pipelines that will facilitate the exchange and replication of scientific results
according to the FAIR principles of open science.
This project also aims to provide an effective solution and tool for the European Commission to certify blockchain transactions as required by the recently voted (08/12/2022) European tax rules, also known as the eighth Directive on Administrative Cooperation (DAC8), and for which a public and generally accepted solution does not yet exist.
At present, structured analyses are typically performed using proprietary solutions and databases, which make reproducibility and sharing of data within the scientific community challenging and expensive. Additionally, even though scientific studies often perform common operations to collect data systematically, the tools and libraries developed and used are rarely shared with the broader scientific community. As a result, similar research carried out by different institutions and groups often requires the re-implementation of the same software tools, leading to wasted resources and an inability to reproduce and compare results.
As part of this project, we plan to provide the scientific community with:
(a) Publicly accessible and expandable datasets and infrastructure that include structured, daily updated blockchain transaction data. Researchers will be able to access raw transaction data and community-maintained, enriched datasets in a uniform and open manner, promoting the availability and reuse of these complex data.
(b) An open-source software framework and standardized data access APIs that enable effective querying, annotating, and referencing of data, and building well-described reusable workflows and pipelines that will facilitate the exchange and replication of scientific results
according to the FAIR principles of open science.
This project also aims to provide an effective solution and tool for the European Commission to certify blockchain transactions as required by the recently voted (08/12/2022) European tax rules, also known as the eighth Directive on Administrative Cooperation (DAC8), and for which a public and generally accepted solution does not yet exist.
Organisations
People |
ORCID iD |
| William Knottenbelt (Principal Investigator) |
| Description | There are a lot of different blockchains of different kinds now in existence, the most famous of which is the Bitcoin blockchain. Making all the data in these blockchains available to the public in a unified and timely manner is a challenge. As a result of this award, we have established new unifying standards for the representation of data from different blockchains and defined new Application Programming Interface standards for researchers and others to access the data. |
| Exploitation Route | Researchers and others will now have access to a variety of data from several different blockchains, all in a common format. This is expected to enable many research efforts since collection of blockchain data is often a significant barrier that involves a lot of cost and time, especially when this needs to be done across blockchains. This will especially enable research into the interoperability of blockchains. |
| Sectors | Digital/Communication/Information Technologies (including Software) Financial Services and Management Consultancy |
| URL | https://www.faironchain.org/ |
| Description | Project overview video |
| Form Of Engagement Activity | A broadcast e.g. TV/radio/film/podcast (other than news/press) |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Public/other audiences |
| Results and Impact | A video to inform and engage the general public about the goals of the FairOnChain project. |
| Year(s) Of Engagement Activity | 2024 |
| URL | https://media.githubusercontent.com/media/FairOnChain/faironchain.github.io/main/assets/video/grid.m... |