EDGE - Adaptive Deep Learning Hardware for Embedded Platforms

Lead Research Organisation: University of Essex
Department Name: Computer Sci and Electronic Engineering

Abstract

Deep learning (DL) is the key technique in modern artificial intelligence (AI), which has provided state-of-the-art accuracy on many machine-learning based applications. Today, although most of the computational loads of DL systems are still spent running neural networks in data centres, the ubiquity of smartphones, and the upcoming availability of self-contained wearable devices for augmented reality (AR), virtual reality (VR) and autonomous robot systems are placing heavy demands on DL-inference hardware with high energy and computing efficiencies along with rapid development of DL techniques. Recently, we have witnessed a distinct evolution in the types of DL architecture, with more sophisticated network architectures proposed to improve edge AI inference. This includes dynamic network architectures that change with each new input in a data-dependent way, where inputs and internal states are not fixed. Such new architectural concepts in DL are likely to affect the type of hardware architectures that will be required to deliver such capabilities in the future. This project precisely addresses this challenge and proposes to design a flexible hardware architecture that enables adaptive support for a variety of DL algorithms on embedded devices. Primarily, to produce lower cost, lower power and higher processing efficiency DL-inference hardware that can be configured adaptably for dedicated application specifications and operating environments, this will require radical innovation in the optimisation of both the software and the hardware of current DL techniques.

This work aims to perform fundamental research, development and practical demonstrator to enable general support for a variety of DL techniques on embedded edge devices with limited resource and latency budgets. Primarily, this requires radical innovation on the current DL architectures in terms of computing architecture, memory hierarchy and resource utilisation, as well as system latency and throughput: it is particularly important for the modern DL systems that the inference processes are dynamic, such as, the DL inference maybe input-dependent and resource-dependent. The proposal therefore seeks the following three thrusts: First, to build upon the existing work of the PI in optimising machine-learning models for resource-constrained embedded devices, towards achieving the goal that the network model could be dynamically optimised as needed through hardware-aware approximation techniques. Second, with newly-developed adaptive compute acceleration technology in programmable memory hierarchy and adaptive processing hardware, to seek a new ambitious direction to develop a set of context-aware hardware architectures to work closely with the approximation algorithms that can fully utilise the true hardware capabilities. Unlike traditional optimisation techniques for DL hardware inference engines, the proposed work will explore both software and hardware programmability of adaptive compute acceleration technology, in order to maximise the optimisation results for the target application scenarios. Third, this project will work closely with our industry and project partners to produce a practical demonstrator to showcase the effectiveness of the proposed DL framework versus traditional approaches, particularly, evaluating the effectiveness of the framework in real-world mission-critical applications.

Publications

10 25 50
publication icon
Boukhennoufa I (2022) Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review in Biomedical Signal Processing and Control

publication icon
Boukhennoufa I (2023) A Novel Model to Generate Heterogeneous and Realistic Time-Series Data for Post-Stroke Rehabilitation Assessment. in IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society

publication icon
Gao C (2023) Application Level Resource Scheduling for Deep Learning Acceleration on MPSoC in Journal of Signal Processing Systems

publication icon
Manning K (2022) Image analysis and machine learning-based malaria assessment system in Digital Communications and Networks

publication icon
Saha S (2022) RASA: Reliability-Aware Scheduling Approach for FPGA-Based Resilient Embedded Systems in Extreme Environments in IEEE Transactions on Systems, Man, and Cybernetics: Systems

publication icon
Saha S (2022) ACCURATE: Accuracy Maximization for Real-Time Multicore Systems With Energy-Efficient Way-Sharing Caches in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

 
Description This work attempted the initial design on using adaptive software and hardware for accelerating deep learning neural networks, where the new design methods have been investigated to achieve better computing and energy efficiency. The current results of the award has shown potential ability of enabling flexible and adaptivity of deep learning hardware accelerators for highly diverse set of DNNs. As the work is still on going, we are due to examine this work on newly available adaptive computing platform.
Exploitation Route This work has attracted a large number of SMEs who are interested in using AI at edge. The proposed new method, can achieve high throughput on deep learning application, and it also can reduce energy consumption. Currently, we are working on local SME, to develop a smart solution to enhance vessel management and path management using Edge AI.
Sectors Digital/Communication/Information Technologies (including Software),Electronics

 
Description Majority of the impact from this award is still under development, few local companies were very interested in using the research findings in their business work, they think that is very attractive to resolve their current technical challenges, and they believe it can open new bossiness and reduce operation cost of the current operations.
Sector Agriculture, Food and Drink,Digital/Communication/Information Technologies (including Software),Environment,Manufacturing, including Industrial Biotechology
Impact Types Societal,Economic

 
Description Morello-HAT: Morello High-Level API and Tooling
Amount £1,128,653 (GBP)
Funding ID EP/X015955/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 07/2022 
End 12/2024
 
Description Real-Time Federated Learning at the Wireless Edge via Algorithm-Hardware Co-Design
Amount £201,497 (GBP)
Funding ID EP/X019160/1 
Organisation Engineering and Physical Sciences Research Council (EPSRC) 
Sector Public
Country United Kingdom
Start 02/2023 
End 11/2024
 
Description University of Essex and Njord Offshore Ltd KTP 22_23 R3
Amount £180,000 (GBP)
Funding ID 10049632 
Organisation Innovate UK 
Sector Public
Country United Kingdom
Start 04/2023 
End 04/2025