Dynamic Semantic Compression for CNN Inference in Multi-Access Edge Computing: A Graph Reinforcement Learning-Based Autoencoder

Nan Li*, Alexandros Iosifidis, Qi Zhang

*Corresponding author for this work

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Abstract

This paper studies the computational offloading of CNN inference in dynamic multi-access edge computing (MEC) networks. To address the uncertainties in communication time and edge servers’ available capacity, we propose a novel semantic compression method, autoencoder-based CNN architecture (AECNN), for effective semantic extraction and compression in partial offloading. In the semantic encoder, we introduce a feature compression module based on the channel attention mechanism in CNNs, to compress intermediate data by selecting the most informative features. Additionally, to further reduce communication overhead, we leverage entropy encoding to remove the statistical redundancy in the compressed data. In the semantic decoder, we design a lightweight decoder to reconstruct the intermediate data through learning from the received compressed data to improve accuracy. To effectively trade-off communication, computation, and inference accuracy, we design a reward function and formulate the offloading problem of CNN inference as a maximization problem with the goal of maximizing the average inference accuracy and throughput over the long term. To address this maximization problem, we propose a graph reinforcement learning-based AECNN (GRL-AECNN) method, which outperforms existing works DROO-AECNN, GRL-BottleNet++ and GRL-DeepJSCC under different dynamic scenarios. This highlights the advantages of GRL-AECNN in offloading decision-making for CNN inference tasks in dynamic MEC.

Original languageEnglish
JournalIEEE Transactions on Wireless Communications
Volume24
Issue3
Pages (from-to)2157-2172
Number of pages16
ISSN1536-1276
DOIs
Publication statusPublished - Mar 2025

Keywords

  • CNN inference
  • edge computing
  • feature compression
  • graph reinforcement learning
  • semantic communication
  • service reliability

Fingerprint

Dive into the research topics of 'Dynamic Semantic Compression for CNN Inference in Multi-Access Edge Computing: A Graph Reinforcement Learning-Based Autoencoder'. Together they form a unique fingerprint.

Cite this