Learned image compression

Last update: Dec 04, 2022

Overview

Pytorch code of our recent work A Unified End-to-End Framework for Efficient Deep Image Compression.

We first release the code for Variational image compression with a scale hyperprior, we will update our code to our full implementaion of our paper.

Prerequisites

You should install the libraries of this repo.

pip install -r requirements.txt

Data Preparation

We need to first prepare the training and validation data. The trainging data is from flicker.com. You can obtain the training data according to description of CompressionData.

The validation data is the popular kodak dataset.

bash data/download_kodak.sh

Training

For high bitrate (4096, 6144, 8192), the out_channel_N is 192 and the out_channel_M is 320 in 'config_high.json'. For low bitrate (256, 512, 1024, 2048), the out_channel_N is 128 and the out_channel_M is 192 in 'config_low.json'.

Details

PSNR experiments.

For high bitrate of 8192, we first train from scratch as follows.

CUDA_VISIBLE_DEVICES=0 python train.py --config examples/example/config_high.json -n baseline_8192 --train flicker_path --val kodak_path

For other high bitrate (4096, 6144), we use the converged model of 8192 as pretrain model and set the learning rate as 1e-5. The training iterations are set as 500000.

The low bitrate (256, 512, 1024, 2048) training process follows the same strategy.

MS-SSIM experiments

You should change the distorsion loss to (1-MS_SSIM), and fine-tune the pretrained model optimized by PSNR to accelerate the training process. You can find more details in our released paper. The training strategy is similar.

If your find our code is helpful for your research, please cite our paper. Besides, this code is only for research.

@article{liu2020unified,
  title={A Unified End-to-End Framework for Efficient Deep Image Compression},
  author={Liu, Jiaheng and Lu, Guo and Hu, Zhihao and Xu, Dong},
  journal={arXiv preprint arXiv:2002.03370},
  year={2020}
}

Learned image compression

Related tags

Overview

Overview

Content

Prerequisites

Data Preparation

Training

Details

PSNR experiments.

MS-SSIM experiments

Owner

Jiaheng Liu

📚 Papermill is a tool for parameterizing, executing, and analyzing Jupyter Notebooks.

Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

[CVPRW 2022] Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

An investigation project for SISR.

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

IJON is an annotation mechanism that analysts can use to guide fuzzers such as AFL.

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

The 3rd place solution for competition

E2C implementation in PyTorch

A Temporal Extension Library for PyTorch Geometric

KDD CUP 2020 Automatic Graph Representation Learning: 1st Place Solution

Autonomous racing with the Anki Overdrive

Python implementation of NARS (Non-Axiomatic-Reasoning-System)

Learning to Draw: Emergent Communication through Sketching

Audio Domain Adaptation for Acoustic Scene Classification using Disentanglement Learning

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

robomimic: A Modular Framework for Robot Learning from Demonstration

A curated list of awesome Model-Based RL resources