EDGE - Adaptive Deep Learning Hardware for Embedded Platforms

Lead Research Organisation: University of Essex
Department Name: Computer Sci and Electronic Engineering

Abstract

Deep learning (DL) is the key technique in modern artificial intelligence (AI), which has provided state-of-the-art accuracy on many machine-learning based applications. Today, although most of the computational loads of DL systems are still spent running neural networks in data centres, the ubiquity of smartphones, and the upcoming availability of self-contained wearable devices for augmented reality (AR), virtual reality (VR) and autonomous robot systems are placing heavy demands on DL-inference hardware with high energy and computing efficiencies along with rapid development of DL techniques. Recently, we have witnessed a distinct evolution in the types of DL architecture, with more sophisticated network architectures proposed to improve edge AI inference. This includes dynamic network architectures that change with each new input in a data-dependent way, where inputs and internal states are not fixed. Such new architectural concepts in DL are likely to affect the type of hardware architectures that will be required to deliver such capabilities in the future. This project precisely addresses this challenge and proposes to design a flexible hardware architecture that enables adaptive support for a variety of DL algorithms on embedded devices. Primarily, to produce lower cost, lower power and higher processing efficiency DL-inference hardware that can be configured adaptably for dedicated application specifications and operating environments, this will require radical innovation in the optimisation of both the software and the hardware of current DL techniques.

This work aims to perform fundamental research, development and practical demonstrator to enable general support for a variety of DL techniques on embedded edge devices with limited resource and latency budgets. Primarily, this requires radical innovation on the current DL architectures in terms of computing architecture, memory hierarchy and resource utilisation, as well as system latency and throughput: it is particularly important for the modern DL systems that the inference processes are dynamic, such as, the DL inference maybe input-dependent and resource-dependent. The proposal therefore seeks the following three thrusts: First, to build upon the existing work of the PI in optimising machine-learning models for resource-constrained embedded devices, towards achieving the goal that the network model could be dynamically optimised as needed through hardware-aware approximation techniques. Second, with newly-developed adaptive compute acceleration technology in programmable memory hierarchy and adaptive processing hardware, to seek a new ambitious direction to develop a set of context-aware hardware architectures to work closely with the approximation algorithms that can fully utilise the true hardware capabilities. Unlike traditional optimisation techniques for DL hardware inference engines, the proposed work will explore both software and hardware programmability of adaptive compute acceleration technology, in order to maximise the optimisation results for the target application scenarios. Third, this project will work closely with our industry and project partners to produce a practical demonstrator to showcase the effectiveness of the proposed DL framework versus traditional approaches, particularly, evaluating the effectiveness of the framework in real-world mission-critical applications.

Publications

10 25 50

publication icon
Boukhennoufa I (2023) A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment. in IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society

publication icon
Boukhennoufa I (2022) Predicting the Internal Knee Abduction Impulse During Walking Using Deep Learning in Frontiers in Bioengineering and Biotechnology

publication icon
Boukhennoufa I (2022) Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review in Biomedical Signal Processing and Control

publication icon
Boukhennoufa I (2021) A comprehensive evaluation of state-of-the-art time-series deep learning models for activity-recognition in post-stroke rehabilitation assessment. in Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

publication icon
Gao C (2023) Application Level Resource Scheduling for Deep Learning Acceleration on MPSoC in Journal of Signal Processing Systems

 
Description This work attempted the initial design on using adaptive software and hardware for accelerating deep learning neural networks, where the new design methods have been investigated to achieve better computing and energy efficiency. The current results of the award has shown potential ability of enabling flexible and adaptivity of deep learning hardware accelerators for highly diverse set of DNNs.
In 2023, we have attempted the new design using DFX technology, which allowed us to build a flexible machine learning platform that allows run-time reconfiguration of both software and hardware setting, based on the initial testing, the results were promised.
As the work is still on going, we are due to examine this work on newly available adaptive computing platform and wide range of benchmark suite.
Exploitation Route This work has attracted a large number of SMEs who are interested in using AI at edge. The proposed new method, can achieve high throughput on deep learning application, and it also can reduce energy consumption. Currently, we are working on local SME, to develop a smart solution to enhance vessel management and path management using Edge AI.
Sectors Digital/Communication/Information Technologies (including Software)

Electronics

 
Description Majority of the impact from this award is still under development, few local companies were very interested in using the research findings in their business work, they think that is very attractive to resolve their current technical challenges, and they believe it can open new bossiness and reduce operation cost of the current operations. We are working on two local SMEs for demonstrating the technology on their business, and received IUK support to help them build safer operations for business operations.
First Year Of Impact 2023
Sector Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Environment,Manufacturing, including Industrial Biotechology
Impact Types Societal

Economic

 
Description 5G4PHealth: Enhanced 5G-Powered Platform for Predictive Preventive Personalized and Participatory Healthcare
Amount £907,042 (GBP)
Funding ID 10093679 
Organisation Innovate UK 
Sector Public
Country United Kingdom
Start 03/2024 
End 03/2027
 
Description Morello-HAT: Morello High-Level API and Tooling
Amount £1,128,653 (GBP)
Funding ID EP/X015955/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 06/2022 
End 12/2024
 
Description Real-Time Federated Learning at the Wireless Edge via Algorithm-Hardware Co-Design
Amount £201,497 (GBP)
Funding ID EP/X019160/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 02/2023 
End 11/2024
 
Description University of Essex and Njord Offshore Ltd KTP 22_23 R3
Amount £180,000 (GBP)
Funding ID 10049632 
Organisation Innovate UK 
Sector Public
Country United Kingdom
Start 03/2023 
End 04/2025
 
Description Conference presentation (FPL23) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact Presented the work in FPL23, had interesting conversation with professionals from world, helped us refined the project.
Year(s) Of Engagement Activity 2023
 
Description Conference presentation (NEWCAS23) 
Form Of Engagement Activity Participation in an activity, workshop or similar
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Professional Practitioners
Results and Impact presented our work in the conference, made new connections with both academics and industry link.
Year(s) Of Engagement Activity 2023
 
Description School Visit (Leicester) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach National
Primary Audience Schools
Results and Impact about 100 students, academics attended the presentation, showing interests to the project.
Year(s) Of Engagement Activity 2023
 
Description School Visit (Shannxi Normal University) 
Form Of Engagement Activity A talk or presentation
Part Of Official Scheme? No
Geographic Reach International
Primary Audience Schools
Results and Impact 30 PG students and academics attended the seminar, and the team reported increased interest in this area, opened the joint papers and funding opportunities.
Year(s) Of Engagement Activity 2023