Abstract
In recent years, embedded platforms became attractive targets to deploy machine learning (ML) -empowered control and processing systems. However, securing a high performance executability to satisfy the hard real-time constraints of such systems on resource-limited platforms, such as FPGAs, is a challenging task. This paper introduces a methodology for deploying hardware-accelerated, adaptable convolutional neural networks (CNN) on embedded FPGA platforms. It enables automated synthesis of hardware IPs, instantiation of CNN architectures and mapping of the CNN layers processing to hardware IPs. This will result in iteration-free deployment reducing the expensive cost of HW synthesis, so that the acceleration hardware IPs are synthesized once and configured at runtime following the CNN architecture. To demonstrate the applicability and assess the performance of our deployment model, we implemented and deployed a segment-based acceleration of a image classification CNN on Xilinx ZYBO Z7 FPGA board. Among others, we have analyzed the computation performance, accuracy and trade-offs between CNN size, image segmentation size, resources utilization and scalability.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys) : Proceedings |
Editors | Jinjun Chen, Laurence T. Yang |
Number of pages | 8 |
Publisher | IEEE |
Publication date | 2023 |
Pages | 558-565 |
ISBN (Print) | 979-8-3503-3002-1 |
ISBN (Electronic) | 979-8-3503-3001-4 |
DOIs | |
Publication status | Published - 2023 |
Event | IEEE International Conference on High Performance Computing and Communications - Melbourne, Australia Duration: 13 Dec 2023 → 15 Dec 2023 Conference number: 25 |
Conference
Conference | IEEE International Conference on High Performance Computing and Communications |
---|---|
Number | 25 |
Country/Territory | Australia |
City | Melbourne |
Period | 13/12/2023 → 15/12/2023 |
Keywords
- Acceleration
- Adaptive Architectures
- Convolutional Neural Networks
- FPGAs
- Hardware synthesis