EDGE - Adaptive Deep Learning Hardware for Embedded Platforms

Lead Research Organisation: University of Essex

Department Name: Computer Sci and Electronic Engineering

Abstract

Deep learning (DL) is the key technique in modern artificial intelligence (AI), which has provided state-of-the-art accuracy on many machine-learning based applications. Today, although most of the computational loads of DL systems are still spent running neural networks in data centres, the ubiquity of smartphones, and the upcoming availability of self-contained wearable devices for augmented reality (AR), virtual reality (VR) and autonomous robot systems are placing heavy demands on DL-inference hardware with high energy and computing efficiencies along with rapid development of DL techniques. Recently, we have witnessed a distinct evolution in the types of DL architecture, with more sophisticated network architectures proposed to improve edge AI inference. This includes dynamic network architectures that change with each new input in a data-dependent way, where inputs and internal states are not fixed. Such new architectural concepts in DL are likely to affect the type of hardware architectures that will be required to deliver such capabilities in the future. This project precisely addresses this challenge and proposes to design a flexible hardware architecture that enables adaptive support for a variety of DL algorithms on embedded devices. Primarily, to produce lower cost, lower power and higher processing efficiency DL-inference hardware that can be configured adaptably for dedicated application specifications and operating environments, this will require radical innovation in the optimisation of both the software and the hardware of current DL techniques.

This work aims to perform fundamental research, development and practical demonstrator to enable general support for a variety of DL techniques on embedded edge devices with limited resource and latency budgets. Primarily, this requires radical innovation on the current DL architectures in terms of computing architecture, memory hierarchy and resource utilisation, as well as system latency and throughput: it is particularly important for the modern DL systems that the inference processes are dynamic, such as, the DL inference maybe input-dependent and resource-dependent. The proposal therefore seeks the following three thrusts: First, to build upon the existing work of the PI in optimising machine-learning models for resource-constrained embedded devices, towards achieving the goal that the network model could be dynamically optimised as needed through hardware-aware approximation techniques. Second, with newly-developed adaptive compute acceleration technology in programmable memory hierarchy and adaptive processing hardware, to seek a new ambitious direction to develop a set of context-aware hardware architectures to work closely with the approximation algorithms that can fully utilise the true hardware capabilities. Unlike traditional optimisation techniques for DL hardware inference engines, the proposed work will explore both software and hardware programmability of adaptive compute acceleration technology, in order to maximise the optimisation results for the target application scenarios. Third, this project will work closely with our industry and project partners to produce a practical demonstrator to showcase the effectiveness of the proposed DL framework versus traditional approaches, particularly, evaluating the effectiveness of the framework in real-world mission-critical applications.

Funded Value:

£232,165

Funded Period:

Dec 21 - May 24

Funder:

EPSRC

Project Status:

Closed

Project Category:

Research Grant

Project Reference:

EP/V034111/1

Principal Investigator:

Xiaojun Zhai

Research Subject:

Info. & commun. Technol. (100%)

Research Topic:

Artificial Intelligence (25%)

Fundamentals of Computing (75%)

Organisations

People	ORCID iD
Xiaojun Zhai (Principal Investigator)	http://orcid.org/0000-0002-1030-8311

Publications

Author Name

Title Publication Date Published

|< < 1 2 3 4 > >|

10 25 50

Altai Z (2023) Performance of multiple neural networks in predicting lower limb joint moments using wearable sensors in Frontiers in Bioengineering and Biotechnology

Borowski M (2023) Anomaly Behaviour tracing of CHERI-RISC V using Hardware-Software Co-design

Boukhennoufa I (2022) Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review in Biomedical Signal Processing and Control

Boukhennoufa I (2021) A comprehensive evaluation of state-of-the-art time-series deep learning models for activity-recognition in post-stroke rehabilitation assessment. in Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Boukhennoufa I (2022) Pattern Recognition and Artificial Intelligence - Third International Conference, ICPRAI 2022, Paris, France, June 1-3, 2022, Proceedings, Part II

Boukhennoufa I (2022) Predicting the Internal Knee Abduction Impulse During Walking Using Deep Learning. in Frontiers in bioengineering and biotechnology

Boukhennoufa I (2023) A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment. in IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society

Gao C (2023) Modelling and Analysis of FPGA-based MPSoC System with Multiple DNN Accelerators

Gao C (2023) Application Level Resource Scheduling for Deep Learning Acceleration on MPSoC in Journal of Signal Processing Systems

Gao C (2022) Deep Learning on FPGAs with Multiple Service Levels for Edge Computing

Key Findings
Impact Summary
Further Funding
Engagement Activities


Description	This work attempted the initial design on using adaptive software and hardware for accelerating deep learning neural networks, where the new design methods have been investigated to achieve better computing and energy efficiency. The current results of the award has shown potential ability of enabling flexible and adaptivity of deep learning hardware accelerators for highly diverse set of DNNs. In 2023, we have attempted the new design using DFX technology, which allowed us to build a flexible machine learning platform that allows run-time reconfiguration of both software and hardware setting, based on the initial testing, the results were promised. As the work is still on going, we are due to examine this work on newly available adaptive computing platform and wide range of benchmark suite.
Exploitation Route	This work has attracted a large number of SMEs who are interested in using AI at edge. The proposed new method, can achieve high throughput on deep learning application, and it also can reduce energy consumption. Currently, we are working on local SME, to develop a smart solution to enhance vessel management and path management using Edge AI.
Sectors	Digital/Communication/Information Technologies (including Software) Electronics


Description	Majority of the impact from this award is still under development, few local companies were very interested in using the research findings in their business work, they think that is very attractive to resolve their current technical challenges, and they believe it can open new bossiness and reduce operation cost of the current operations. We are working on two local SMEs for demonstrating the technology on their business, and received IUK support to help them build safer operations for business operations.
First Year Of Impact	2023
Sector	Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Environment,Manufacturing, including Industrial Biotechology
Impact Types	Societal Economic


Description	5G4PHealth: Enhanced 5G-Powered Platform for Predictive Preventive Personalized and Participatory Healthcare
Amount	£907,042 (GBP)
Funding ID	10093679
Organisation	Innovate UK
Sector	Public
Country	United Kingdom
Start	03/2024
End	03/2027


Description	EcoRoutePlanner: Dynamic Daily Route Planning and Scheduling for Crew Transfer Vessels in Offshore Wind Farms
Amount	£130,373 (GBP)
Funding ID	10132557
Organisation	Innovate UK
Sector	Public
Country	United Kingdom
Start	11/2024
End	03/2025


Description	IDEAL: Reducing Carbon Footprints of IoT Devices through Extension of Active Lifespans
Amount	£1,493,582 (GBP)
Funding ID	EP/Z533749/1
Organisation	Engineering and Physical Sciences Research Council (EPSRC)
Sector	Public
Country	United Kingdom
Start	03/2025
End	04/2028


Description	Morello-HAT: Morello High-Level API and Tooling
Amount	£1,128,653 (GBP)
Funding ID	EP/X015955/1
Organisation	Engineering and Physical Sciences Research Council (EPSRC)
Sector	Public
Country	United Kingdom
Start	06/2022
End	12/2024


Description	Real-Time Federated Learning at the Wireless Edge via Algorithm-Hardware Co-Design
Amount	£201,497 (GBP)
Funding ID	EP/X019160/1
Organisation	Engineering and Physical Sciences Research Council (EPSRC)
Sector	Public
Country	United Kingdom
Start	02/2023
End	11/2024


Description	University of Essex and Njord Offshore Ltd KTP 22_23 R3
Amount	£180,000 (GBP)
Funding ID	10049632
Organisation	Innovate UK
Sector	Public
Country	United Kingdom
Start	03/2023
End	04/2025


Description	Conference presentation (COINS2024)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Other audiences
Results and Impact	Presented research work to COINS2024, which engaged with wide audiences from academia and industries.
Year(s) Of Engagement Activity	2024


Description	Conference presentation (FPL23)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	Presented the work in FPL23, had interesting conversation with professionals from world, helped us refined the project.
Year(s) Of Engagement Activity	2023


Description	Conference presentation (ICAC2024)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Schools
Results and Impact	Presented work in ICAC2024, which engaged with academics from world wide and industry partners.
Year(s) Of Engagement Activity	2024


Description	Conference presentation (NEWCAS23)
Form Of Engagement Activity	Participation in an activity, workshop or similar
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Professional Practitioners
Results and Impact	presented our work in the conference, made new connections with both academics and industry link.
Year(s) Of Engagement Activity	2023


Description	School Visit (Exeter)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Schools
Results and Impact	30 academic and students attended for a research presentation, which sparked questions and discussion afterwards, and increased interest in edge AI and opened potential collaborations.
Year(s) Of Engagement Activity	2024


Description	School Visit (Leicester)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	National
Primary Audience	Schools
Results and Impact	about 100 students, academics attended the presentation, showing interests to the project.
Year(s) Of Engagement Activity	2023


Description	School Visit (Shannxi Normal University)
Form Of Engagement Activity	A talk or presentation
Part Of Official Scheme?	No
Geographic Reach	International
Primary Audience	Schools
Results and Impact	30 PG students and academics attended the seminar, and the team reported increased interest in this area, opened the joint papers and funding opportunities.
Year(s) Of Engagement Activity	2023

Abstract

Organisations

People

ORCID iD

Publications