RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Last update: Jan 04, 2023

Overview

RMNet: Equivalently Removing Residual Connection from Networks

This repository is the official implementation of "RMNet: Equivalently Removing Residual Connection from Networks".

Requirements

To install requirements:

pip install torch
pip install torchvision

Training

To train the models in the paper, run this command:

python train.py -a rmrep_69 --dist-url 'tcp://127.0.0.1:23333' --dist-backend 'nccl' --multiprocessing-distributed --world-size 1 --rank 0 --workers 32 [imagenet-folder with train and val folders]

Our Pre-trained Models

You can download pretrained models here:

Download our pre-trained models trained on ImageNet from Google Drive or Baidu Cloud(提取码:0mto).

Evaluation

To evaluate our pre-trained models trained on ImageNet, run:

python train.py -a rmrep_69 -e checkpoint/rmrep_69.pth.tar [imagenet-folder with train and val folders]

Results

Our model achieves the following performance on :

Help RepVGG achieve better performance even when the depth is large

Arch	Top-1 Accuracy(%)	Top-5 Accuracy(%)	Train FLOPs(G)	Test FLOPs(M)
RepVGG-21	72.508	90.840	2.4	2.1
RepVGG-21(RM 0.25)	72.590	90.924	2.1	2.1
RepVGG-37	74.408	91.900	4.4	4.0
RepVGG-37(RM 0.25)	74.478	91.892	3.9	4.0
RepVGG-69	74.526	92.182	8.6	7.7
RepVGG-69(RM 0.5)	75.088	92.144	6.5	7.7
RepVGG-133	70.912	89.788	16.8	15.1
RepVGG-133(RM 0.75)	74.560	92.000	10.6	15.1

Image Classification on ImageNet

Model name	Top 1 Accuracy(%)	Top 5 Accuracy(%)
RMNeXt 41x5_16	78.498	94.086
RMNeXt 50x5_32	79.076	94.444
RMNeXt 50x6_32	79.57	94.644
RMNeXt 101x6_16	80.07	94.918
RMNeXt 152x6_32	80.356	80.356

Citation

If you find this code useful, please cite the following paper:

@misc{meng2021rmnet,
      title={RMNet: Equivalently Removing Residual Connection from Networks}, 
      author={Fanxu Meng and Hao Cheng and Jiaxin Zhuang and Ke Li and Xing Sun},
      year={2021},
      eprint={2111.00687},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contributing

Our code is based on RepVGG

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Related tags

Overview

RMNet: Equivalently Removing Residual Connection from Networks

Requirements

Training

Our Pre-trained Models

Evaluation

Results

Help RepVGG achieve better performance even when the depth is large

Image Classification on ImageNet

Citation

Contributing

Owner

An end-to-end machine learning library to directly optimize AUC loss

Configure SRX interfaces with Scrapli

unofficial pytorch implement of "Squareplus: A Softplus-Like Algebraic Rectifier"

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

Probabilistic Tensor Decomposition of Neural Population Spiking Activity

Open-CyKG: An Open Cyber Threat Intelligence Knowledge Graph

Python Jupyter kernel using Poetry for reproducible notebooks

Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561

How to use TensorLayer

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Reaction SMILES-AA mapping via language modelling

Subdivision-based Mesh Convolutional Networks

[ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Lin

This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.

Paper Code：A Self-adaptive Weighted Differential Evolution Approach for Large-scale Feature Selection

VOGUE: Try-On by StyleGAN Interpolation Optimization

This repository contains code released by Google Research.

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

PyTorch source code for Distilling Knowledge by Mimicking Features