Revisiting Dynamic Convolution via Matrix Decomposition (ICLR 2021)

A pytorch implementation of DCD. If you use this code in your research please consider citing

@article{li2021revisiting, title={Revisiting Dynamic Convolution via Matrix Decomposition}, author={Li, Yunsheng and Chen, Yinpeng and Dai, Xiyang and Liu, Mengchen and Chen, Dongdong and Yu, Ye and Yuan, Lu and Liu, Zicheng and Chen, Mei and Vasconcelos, Nuno}, journal={arXiv preprint arXiv:2103.08756}, year={2021} }

Requirements

Hardware: PC with NVIDIA Titan GPU.
Software: Ubuntu 16.04, CUDA 10.0, Anaconda3, pytorch 1.0.0
Python package
- conda install --quiet --yes pytorch==1.0.0 torchvision==0.2.1 cuda100 -c pytorch
- pip install tensorboard tensorboardX pillow==6.1

Evaluate DCD on ImageNet

The pre-trained model can be downloaded here ResNet-50 and MobileNetV2x1.0

DCD for ResNet-50

python main.py -a resnet50_dcd -d /path/to/imagenet/ -b 256 -c /path/to/output -j 48 --input-size 224 --dropout 0.1 --weight /path/to/resnet50_dcd.pth.tar --evaluate

DCD for MobileNetV2x1.0

python main.py -a mobilenetv2_dcd -d /path/to/imagenet/ -b 512 -c /path/to/output --width-mult 1.0 -j 48 --input-size 224 --dropout 0.1 --fc-squeeze 16 --weight mv2x1.0_dcd.pth.tar --evaluate

Train DCD on ImageNet

DCD for ResNet-50

CUDA_VISIBLE_DEVICES=0,1,2,3 python main.py -a resnet50_dcd -d /path/to/imagenet/ -b 256 --epochs 120 --lr-decay schedule --lr 0.1 --wd 1e-4 -c /path/to/output -j 48 --input-size 224 --label-smoothing 0.1 --dropout 0.1 --mixup 0.2

DCD for MobileNetV2x1.0

CUDA_VISIBLE_DEVICES=0,1,2,3 python main.py -a mobilenetv2_dcd -d /path/to/imagenet/ --epochs 300 --lr-decay cos --lr 0.1 --wd 2e-5 -c /path/to/output --width-mult 1.0 -j 48 --input-size 224 --label-smoothing 0.1 --dropout 0.2 -b 512 --mixup 0.2 --fc-squeeze 16

official code for dynamic convolution decomposition

Related tags

Overview

Revisiting Dynamic Convolution via Matrix Decomposition (ICLR 2021)

Requirements

Evaluate DCD on ImageNet

Train DCD on ImageNet

Owner

Yunsheng Li

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Contains supplementary materials for reproduce results in HMC divergence time estimation manuscript

Perform zero-order Hankel Transform for an 1D array (float or real valued).

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Just Go with the Flow: Self-Supervised Scene Flow Estimation

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

(CVPR 2021) Lifting 2D StyleGAN for 3D-Aware Face Generation

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

Efficient Lottery Ticket Finding: Less Data is More

Scrutinizing XAI with linear ground-truth data

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

This repository is for Contrastive Embedding Distribution Refinement and Entropy-Aware Attention Network (CEDR)

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

High-performance moving least squares material point method (MLS-MPM) solver.

The 2nd place solution of 2021 google landmark retrieval on kaggle.

Sarus implementation of classical ML models. The models are implemented using the Keras API of tensorflow 2. Vizualization are implemented and can be seen in tensorboard.

Realtime_Multi-Person_Pose_Estimation

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

PyTorch Autoencoders - Implementing a Variational Autoencoder (VAE) Series in Pytorch.