DEAL - DEcentrAlised Learning for automated image analysis and biodiversity monitoring
Lead Research Organisation:
Plymouth Marine Laboratory
Abstract
As recognised in the UN Decade of Ocean Science for Sustainable Development (2021-2030) programme (https://oceandecade.org/), frequent and high-quality ocean observations are critical for effective marine management and decision-making. However, at the current time, marine biodiversity remains poorly observed, with sampling methods heavily reliant on infrequent and expensive ship-based observations.
Significant advances have been made in developing marine autonomous imaging platforms that collect data for specific organisms, including microscopic plankton that sit at the base of the ocean food web, and sea floor biota. Such platforms can generate millions of images that have the potential to revolutionise our understanding of marine biodiversity, and to facilitate a step change in marine biodiversity monitoring, allowing fine-scale spatial and temporal trends to be resolved. However, for this revolution to be realised, first high-throughput and high-efficacy classification and analysis tools must also be developed.
Fully supervised machine learning models that are given sufficient, high-quality, labelled data have been demonstrated to achieve reliable, high efficacy classification results. This is a pre-requisite for any operational, automated classification system. The main challenge encountered in the field of marine biodiversity monitoring is obtaining sufficient labelled data to employ a supervised learning approach. At the present time, work on automated image classifiers is highly fragmented, and there is an absence of common standards. Many research teams working on plankton act as individuals, building bespoke classifiers which are trained against limited image data, often from a single instrument. The problem with this approach is that individual researchers are ultimately limited by the amount of data they have been able to collate and label, which leads to results with higher biases that inhibit their incorporation into operational biodiversity monitoring platforms. The models are also incapable of detecting new or rare organisms that weren’t present within the original training data, meaning they are unsuitable for studying important processes such as the emergence of invasive species. Meanwhile, centralised services which require users to share their image data with a single custodian by uploading it to a central server before the images are classified, raise concerns over privacy and ownership of data. The process is also highly inefficient, as it relies on the transfer of potentially large volumes of data and its duplication on multiple servers.
In this project, we will develop a web-based application which addresses these problems by using the Swarm Learning Framework which has been developed by our project partner Hewitt Packard Enterprise (HPE). Swarm Learning allows users to participate in a decentralised, collaborative network without a central server. It makes it possible for users to benefit from each other’s data and learnings, and to collaborate in the building of better classification models with lower biases. At the same time, the system preserves data privacy and reduces inefficiencies and carbon costs associated with the transfer and duplication of large volumes of data. By further partnering with world leaders in plankton and sea floor imaging, we will deliver two operational networks for classifying plankton and sea floor image data. We intend the tool and the initial networks we form to act as catalysts, helping to build communities in which data producers coalesce around a set of shared standards, and cooperate in making marine image data suitable for operational biodiversity monitoring.
Significant advances have been made in developing marine autonomous imaging platforms that collect data for specific organisms, including microscopic plankton that sit at the base of the ocean food web, and sea floor biota. Such platforms can generate millions of images that have the potential to revolutionise our understanding of marine biodiversity, and to facilitate a step change in marine biodiversity monitoring, allowing fine-scale spatial and temporal trends to be resolved. However, for this revolution to be realised, first high-throughput and high-efficacy classification and analysis tools must also be developed.
Fully supervised machine learning models that are given sufficient, high-quality, labelled data have been demonstrated to achieve reliable, high efficacy classification results. This is a pre-requisite for any operational, automated classification system. The main challenge encountered in the field of marine biodiversity monitoring is obtaining sufficient labelled data to employ a supervised learning approach. At the present time, work on automated image classifiers is highly fragmented, and there is an absence of common standards. Many research teams working on plankton act as individuals, building bespoke classifiers which are trained against limited image data, often from a single instrument. The problem with this approach is that individual researchers are ultimately limited by the amount of data they have been able to collate and label, which leads to results with higher biases that inhibit their incorporation into operational biodiversity monitoring platforms. The models are also incapable of detecting new or rare organisms that weren’t present within the original training data, meaning they are unsuitable for studying important processes such as the emergence of invasive species. Meanwhile, centralised services which require users to share their image data with a single custodian by uploading it to a central server before the images are classified, raise concerns over privacy and ownership of data. The process is also highly inefficient, as it relies on the transfer of potentially large volumes of data and its duplication on multiple servers.
In this project, we will develop a web-based application which addresses these problems by using the Swarm Learning Framework which has been developed by our project partner Hewitt Packard Enterprise (HPE). Swarm Learning allows users to participate in a decentralised, collaborative network without a central server. It makes it possible for users to benefit from each other’s data and learnings, and to collaborate in the building of better classification models with lower biases. At the same time, the system preserves data privacy and reduces inefficiencies and carbon costs associated with the transfer and duplication of large volumes of data. By further partnering with world leaders in plankton and sea floor imaging, we will deliver two operational networks for classifying plankton and sea floor image data. We intend the tool and the initial networks we form to act as catalysts, helping to build communities in which data producers coalesce around a set of shared standards, and cooperate in making marine image data suitable for operational biodiversity monitoring.
Organisations
- Plymouth Marine Laboratory (Lead Research Organisation)
- CENTRE FOR ENVIRONMENT, FISHERIES AND AQUACULTURE SCIENCE (Collaboration)
- Joint Nature Conservancy Council (Collaboration)
- UNIVERSITY OF GLASGOW (Collaboration)
- Woods Hole Oceanographic Institution (Collaboration)
- Norwegian Institute of Marine Research (Collaboration)
- The Finnish Environment Institute (Collaboration)
- Hewlett Packard Enterprise (HPE) (Collaboration)
Publications
Clark J
(2025)
The Western Channel Observatory Automated Plankton Imaging and Classification System
in Oceanography
| Description | Collaboration between Plankton Imaging groups |
| Organisation | Centre For Environment, Fisheries And Aquaculture Science |
| Country | United Kingdom |
| Sector | Public |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | Hewlett Packard Enterprise (HPE) |
| Country | United Kingdom |
| Sector | Private |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | Joint Nature Conservancy Council |
| Country | United Kingdom |
| Sector | Public |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | Norwegian Institute of Marine Research |
| Country | Norway |
| Sector | Academic/University |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | The Finnish Environment Institute |
| Country | Finland |
| Sector | Academic/University |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | University of Glasgow |
| Country | United Kingdom |
| Sector | Academic/University |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Collaboration between Plankton Imaging groups |
| Organisation | Woods Hole Oceanographic Institution |
| Country | United States |
| Sector | Charity/Non Profit |
| PI Contribution | Sharing of marine image data for automatic classification. |
| Collaborator Contribution | Shared data and helped define technical requirements of the application. |
| Impact | The partnership involves sharing expertise in machine learning and marine taxonomy. |
| Start Year | 2024 |
| Description | Interview for ITN on the costs and benefits of AI |
| Form Of Engagement Activity | A press release, press conference or response to a media enquiry/interview |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | We were interviewed by ITN for a piece there were doing on the energy demands of AI, and its benefits. For the benefits, they discussed the use of AI in classifying marine image data - including that from our APICS cameras. |
| Year(s) Of Engagement Activity | 2025 |
| URL | https://www.itv.com/news/2025-02-10/can-renewable-energy-keep-up-with-the-increasing-power-demands-o... |
| Description | Interview for documentary on plankton |
| Form Of Engagement Activity | A broadcast e.g. TV/radio/film/podcast (other than news/press) |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Public/other audiences |
| Results and Impact | CGTN Europe filmed us for a documentary on plankton. |
| Year(s) Of Engagement Activity | 2025 |
| URL | https://newseu.cgtn.com/news/2025-02-22/RAZOR-What-has-plankton-ever-done-for-us--1B6jKimUOcM/p.html |
| Description | Marine imaging workshop |
| Form Of Engagement Activity | A formal working group, expert panel or dialogue |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Professional Practitioners |
| Results and Impact | Approximately 25 people attended a workshop to discuss end user requirements for a new plankton classifier we are creating. |
| Year(s) Of Engagement Activity | 2025 |
| Description | Ocean Decade meeting in Barcelona, 2024 |
| Form Of Engagement Activity | A talk or presentation |
| Part Of Official Scheme? | No |
| Geographic Reach | International |
| Primary Audience | Policymakers/politicians |
| Results and Impact | Participated in an expert panel on marine plastic pollution at the Ocean Decade meeting. Contributed to a presentation on next generation ocean technology. |
| Year(s) Of Engagement Activity | 2024 |
| URL | https://oceandecade.org/events/2024-ocean-decade-conference/ |
| Description | Royal Society Summer Science Exhibition |
| Form Of Engagement Activity | Participation in an activity, workshop or similar |
| Part Of Official Scheme? | No |
| Geographic Reach | National |
| Primary Audience | Public/other audiences |
| Results and Impact | I stood on the stand called Deep heat: how a warming ocean is challenging life on earth. I spent two days talking to students, the general public, scientists, people from the media and government about ocean science. |
| Year(s) Of Engagement Activity | 2024 |
| URL | https://royalsociety.org/science-events-and-lectures/2024/summer-science-exhibition/all-exhibits/ |