Abstract
Progressive deep image compression (DIC) with hybrid contexts is an under-investigated problem that aims to jointly maximize the utility of a compressed image for multiple contexts or tasks under variable rates. In this paper, we consider the contexts of image reconstruction and classification. We propose a DIC framework, called residual-enhanced mask-based progressive generative coding (RMPGC), designed for explicit control of the performance within the rate-distortion-classification-perception (RDCP) trade-off. Three independent mechanisms are introduced to yield a semantically structured latent representation that can support parameterized control of rate and context adaptation. Experimental results show that the proposed RMPGC outperforms a benchmark DIC scheme using the same generative adversarial nets (GANs) backbone in all six metrics related to classification, distortion, and perception. Moreover, RMPGC is a flexible framework that can be applied to different neural network backbones. Some typical implementations are given and shown to outperform the classic BPG codec and four state-of-the-art DIC schemes in classification and perception metrics, with a slight degradation in distortion metrics. Our proposal of a nonlinear-neural-coded and richly structured latent space makes the proposed DIC scheme well suited for image compression in wireless communications, multi-user broadcasting, and multi-tasking applications.
Original language | English |
---|---|
Pages (from-to) | 72-89 |
Number of pages | 18 |
Journal | IEEE Journal on Selected Areas in Communications |
Volume | 41 |
Issue number | 1 |
Early online date | 5 Dec 2022 |
DOIs | |
Publication status | Published - Jan 2023 |
Keywords
- Deep image compression
- generative image compression
- image semantics
- progressive compression
ASJC Scopus subject areas
- Computer Networks and Communications
- Electrical and Electronic Engineering