Attention-Based Feature Compression for CNN Inference Offloading in Edge Computing

Nan Li, Alexandros Iosifidis, Qi Zhang

Research output: Contribution to book/anthology/report/proceedingArticle in proceedingsResearchpeer-review

5 Citations (Scopus)

Abstract

This paper studies the computational offloading of CNN inference in device-edge co-inference systems. Inspired by the emerging paradigm semantic communication, we propose a novel autoencoder-based CNN architecture (AECNN), for effective feature extraction at end-device. We design a feature compression module based on the channel attention method in CNN, to compress the intermediate data by selecting the most important features. To further reduce communication overhead, we can use entropy encoding to remove the statistical redundancy in the compressed data. At the receiver, we design a lightweight decoder to reconstruct the intermediate data through learning from the received compressed data to improve accuracy. To fasten the convergence, we use a step-by-step approach to train the neural networks obtained based on ResNet-50 architecture. Experimental results show that AECNN can compress the intermediate data by more than 256 × with only about 4% accuracy loss, which outperforms the state-of-the-art work, BottleNet++. Compared to offloading inference task directly to edge server, AECNN can complete inference task earlier, in particular, under poor wireless channel condition, which highlights the effectiveness of AECNN in guaranteeing higher accuracy within time constraint.

Original languageEnglish
Title of host publicationICC 2023 - IEEE International Conference on Communications
EditorsMichele Zorzi, Meixia Tao, Walid Saad
Number of pages6
PublisherIEEE
Publication dateOct 2023
Pages967-972
ISBN (Electronic)978-1-5386-7462-8
DOIs
Publication statusPublished - Oct 2023
EventIEEE International Conference on Communications - La Nuvola Convention Center, Rome, Italy
Duration: 28 May 20231 Jun 2023
https://icc2023.ieee-icc.org/

Conference

ConferenceIEEE International Conference on Communications
LocationLa Nuvola Convention Center
Country/TerritoryItaly
CityRome
Period28/05/202301/06/2023
Internet address
SeriesI E E E International Conference on Communications
ISSN1938-1883

Keywords

  • CNN inference
  • Edge computing
  • feature compression
  • semantic communication
  • service reliability

Fingerprint

Dive into the research topics of 'Attention-Based Feature Compression for CNN Inference Offloading in Edge Computing'. Together they form a unique fingerprint.

Cite this