PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Last update: Dec 08, 2022

Overview

An Information-Theoretic Justification for Model Pruning

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning".

An Information-Theoretic Justification for Model Pruning
Berivan Isik, Tsachy Weissman, Albert No
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.

1) Train the baseline model:

To train the baseline model to be compressed, set trainer=Classifier. To try this for ResNet-20, run:

python3 main.py --trainer=Classifier --config=cifar_resnet20/config.yaml

To test the baseline model, run:

python3 main.py --trainer=Classifier --config=cifar_resnet20/config.yaml --test

2) One-shot (non-iterative) reconstruction with SuRP:

To compress the baseline model with SuRP non-iteratively, change the experiment id exp_id of the target model and target sparsity ratio sparsity: [sparsity of the input model, target sparsity] in the recon.yaml file accordingly. Then, run:

python3 main.py --trainer=Reconstruction --config=cifar_resnet20/recon.yaml

3) Iterative reconstruction with SuRP:

To compress the baseline model with SuRP iteratively, apply SuRP several times following a sparsity schedule. Each time, modify exp_id and sparsity: [sparsity of the input model, target sparsity], accordingly. To retrain the sparse models before applying SuRP again, set retrain: True. And run:

python3 main.py --trainer=ReconFromFile --config=cifar_resnet20/recon.yaml

References

If you find this work useful in your research, please consider citing our paper:

@article{isik2021rate,
  title={Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning},
  author={Isik, Berivan and No, Albert and Weissman, Tsachy},
  journal={arXiv preprint arXiv:2102.08329},
  year={2021}
}

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Related tags

Overview

An Information-Theoretic Justification for Model Pruning

1) Train the baseline model:

2) One-shot (non-iterative) reconstruction with SuRP:

3) Iterative reconstruction with SuRP:

References

Owner

Berivan Isik

CAST: Character labeling in Animation using Self-supervision by Tracking

Source Code for ICSE 2022 Paper - ``Can We Achieve Fairness Using Semi-Supervised Learning?''

This repository contains code released by Google Research.

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

FluxTraining.jl gives you an endlessly extensible training loop for deep learning

Neural Magic Eye: Learning to See and Understand the Scene Behind an Autostereogram, arXiv:2012.15692.

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

Sudoku solver - A sudoku solver with python

View model summaries in PyTorch!

ML model to classify between cats and dogs

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

Official code for "Decoupling Zero-Shot Semantic Segmentation"

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

On Out-of-distribution Detection with Energy-based Models

Official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR)

On Effective Scheduling of Model-based Reinforcement Learning

The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks