EDGE - Adaptive Deep Learning Hardware for Embedded Platforms
Lead Research Organisation:
University of Essex
Department Name: Computer Sci and Electronic Engineering
Abstract
Deep learning (DL) is the key technique in modern artificial intelligence (AI), which has provided state-of-the-art accuracy on many machine-learning based applications. Today, although most of the computational loads of DL systems are still spent running neural networks in data centres, the ubiquity of smartphones, and the upcoming availability of self-contained wearable devices for augmented reality (AR), virtual reality (VR) and autonomous robot systems are placing heavy demands on DL-inference hardware with high energy and computing efficiencies along with rapid development of DL techniques. Recently, we have witnessed a distinct evolution in the types of DL architecture, with more sophisticated network architectures proposed to improve edge AI inference. This includes dynamic network architectures that change with each new input in a data-dependent way, where inputs and internal states are not fixed. Such new architectural concepts in DL are likely to affect the type of hardware architectures that will be required to deliver such capabilities in the future. This project precisely addresses this challenge and proposes to design a flexible hardware architecture that enables adaptive support for a variety of DL algorithms on embedded devices. Primarily, to produce lower cost, lower power and higher processing efficiency DL-inference hardware that can be configured adaptably for dedicated application specifications and operating environments, this will require radical innovation in the optimisation of both the software and the hardware of current DL techniques.
This work aims to perform fundamental research, development and practical demonstrator to enable general support for a variety of DL techniques on embedded edge devices with limited resource and latency budgets. Primarily, this requires radical innovation on the current DL architectures in terms of computing architecture, memory hierarchy and resource utilisation, as well as system latency and throughput: it is particularly important for the modern DL systems that the inference processes are dynamic, such as, the DL inference maybe input-dependent and resource-dependent. The proposal therefore seeks the following three thrusts: First, to build upon the existing work of the PI in optimising machine-learning models for resource-constrained embedded devices, towards achieving the goal that the network model could be dynamically optimised as needed through hardware-aware approximation techniques. Second, with newly-developed adaptive compute acceleration technology in programmable memory hierarchy and adaptive processing hardware, to seek a new ambitious direction to develop a set of context-aware hardware architectures to work closely with the approximation algorithms that can fully utilise the true hardware capabilities. Unlike traditional optimisation techniques for DL hardware inference engines, the proposed work will explore both software and hardware programmability of adaptive compute acceleration technology, in order to maximise the optimisation results for the target application scenarios. Third, this project will work closely with our industry and project partners to produce a practical demonstrator to showcase the effectiveness of the proposed DL framework versus traditional approaches, particularly, evaluating the effectiveness of the framework in real-world mission-critical applications.
This work aims to perform fundamental research, development and practical demonstrator to enable general support for a variety of DL techniques on embedded edge devices with limited resource and latency budgets. Primarily, this requires radical innovation on the current DL architectures in terms of computing architecture, memory hierarchy and resource utilisation, as well as system latency and throughput: it is particularly important for the modern DL systems that the inference processes are dynamic, such as, the DL inference maybe input-dependent and resource-dependent. The proposal therefore seeks the following three thrusts: First, to build upon the existing work of the PI in optimising machine-learning models for resource-constrained embedded devices, towards achieving the goal that the network model could be dynamically optimised as needed through hardware-aware approximation techniques. Second, with newly-developed adaptive compute acceleration technology in programmable memory hierarchy and adaptive processing hardware, to seek a new ambitious direction to develop a set of context-aware hardware architectures to work closely with the approximation algorithms that can fully utilise the true hardware capabilities. Unlike traditional optimisation techniques for DL hardware inference engines, the proposed work will explore both software and hardware programmability of adaptive compute acceleration technology, in order to maximise the optimisation results for the target application scenarios. Third, this project will work closely with our industry and project partners to produce a practical demonstrator to showcase the effectiveness of the proposed DL framework versus traditional approaches, particularly, evaluating the effectiveness of the framework in real-world mission-critical applications.
Organisations
Publications
Boukhennoufa I
(2023)
A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment.
in IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society
Boukhennoufa I
(2022)
Predicting the Internal Knee Abduction Impulse During Walking Using Deep Learning
in Frontiers in Bioengineering and Biotechnology
Boukhennoufa I
(2022)
Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review
in Biomedical Signal Processing and Control
Boukhennoufa I
(2021)
A comprehensive evaluation of state-of-the-art time-series deep learning models for activity-recognition in post-stroke rehabilitation assessment.
in Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Gao C
(2023)
Application Level Resource Scheduling for Deep Learning Acceleration on MPSoC
in Journal of Signal Processing Systems
Description | This work attempted the initial design on using adaptive software and hardware for accelerating deep learning neural networks, where the new design methods have been investigated to achieve better computing and energy efficiency. The current results of the award has shown potential ability of enabling flexible and adaptivity of deep learning hardware accelerators for highly diverse set of DNNs. In 2023, we have attempted the new design using DFX technology, which allowed us to build a flexible machine learning platform that allows run-time reconfiguration of both software and hardware setting, based on the initial testing, the results were promised. As the work is still on going, we are due to examine this work on newly available adaptive computing platform and wide range of benchmark suite. |
Exploitation Route | This work has attracted a large number of SMEs who are interested in using AI at edge. The proposed new method, can achieve high throughput on deep learning application, and it also can reduce energy consumption. Currently, we are working on local SME, to develop a smart solution to enhance vessel management and path management using Edge AI. |
Sectors | Digital/Communication/Information Technologies (including Software) Electronics |
Description | Majority of the impact from this award is still under development, few local companies were very interested in using the research findings in their business work, they think that is very attractive to resolve their current technical challenges, and they believe it can open new bossiness and reduce operation cost of the current operations. We are working on two local SMEs for demonstrating the technology on their business, and received IUK support to help them build safer operations for business operations. |
First Year Of Impact | 2023 |
Sector | Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Environment,Manufacturing, including Industrial Biotechology |
Impact Types | Societal Economic |
Description | 5G4PHealth: Enhanced 5G-Powered Platform for Predictive Preventive Personalized and Participatory Healthcare |
Amount | £907,042 (GBP) |
Funding ID | 10093679 |
Organisation | Innovate UK |
Sector | Public |
Country | United Kingdom |
Start | 03/2024 |
End | 03/2027 |
Description | Morello-HAT: Morello High-Level API and Tooling |
Amount | £1,128,653 (GBP) |
Funding ID | EP/X015955/1 |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 06/2022 |
End | 12/2024 |
Description | Real-Time Federated Learning at the Wireless Edge via Algorithm-Hardware Co-Design |
Amount | £201,497 (GBP) |
Funding ID | EP/X019160/1 |
Organisation | Engineering and Physical Sciences Research Council (EPSRC) |
Sector | Public |
Country | United Kingdom |
Start | 02/2023 |
End | 11/2024 |
Description | University of Essex and Njord Offshore Ltd KTP 22_23 R3 |
Amount | £180,000 (GBP) |
Funding ID | 10049632 |
Organisation | Innovate UK |
Sector | Public |
Country | United Kingdom |
Start | 03/2023 |
End | 04/2025 |
Description | Conference presentation (FPL23) |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | Presented the work in FPL23, had interesting conversation with professionals from world, helped us refined the project. |
Year(s) Of Engagement Activity | 2023 |
Description | Conference presentation (NEWCAS23) |
Form Of Engagement Activity | Participation in an activity, workshop or similar |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Professional Practitioners |
Results and Impact | presented our work in the conference, made new connections with both academics and industry link. |
Year(s) Of Engagement Activity | 2023 |
Description | School Visit (Leicester) |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | National |
Primary Audience | Schools |
Results and Impact | about 100 students, academics attended the presentation, showing interests to the project. |
Year(s) Of Engagement Activity | 2023 |
Description | School Visit (Shannxi Normal University) |
Form Of Engagement Activity | A talk or presentation |
Part Of Official Scheme? | No |
Geographic Reach | International |
Primary Audience | Schools |
Results and Impact | 30 PG students and academics attended the seminar, and the team reported increased interest in this area, opened the joint papers and funding opportunities. |
Year(s) Of Engagement Activity | 2023 |