Accuracy Aligned. Concise Implementation of Swin Transformer

Last update: Dec 16, 2022

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

This repository contains the implementation of Swin Transformer, and the training codes on ImageNet datasets. We have aligned the output of our network with the official one, that is, using the same input and random seed, the output is identical to the official one.

Our implementation is highly based on einops, which makes the implementation more concise, and easy to be understand. (Intuitively, we use only 200 lines of codes compared with ~600 lines of official codes.) Besides, our implementation keeps the same training speed.

Model	Epoch	[email protected](our)	[email protected](our)	[email protected](official)	[email protected](official)	pretrained model
Swin-T	300	81.3	95.5	81.2	95.5	here

Usage

Train on ImageNet:

Train Swin-T

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_T \
--batch-size 128 --drop-path 0.2 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinT/

Train Swin-S

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_S \
--batch-size 128 --drop-path 0.3 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinS/

Train Swin-B

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_B \
--batch-size 128 --drop-path 0.5 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinB/

Reference

The training process involves many training and augmentation tricks, such as stochastic depth, mixup, cutmix and random erasing. I borrow large from Deit (https://github.com/facebookresearch/deit).

Citations

@misc{liu2021swin,
      title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, 
      author={Ze Liu and Yutong Lin and Yue Cao and Han Hu and Yixuan Wei and Zheng Zhang and Stephen Lin and Baining Guo},
      year={2021},
      eprint={2103.14030},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Accuracy Aligned. Concise Implementation of Swin Transformer

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

Usage

Reference

Citations

Owner

FengWang

Style-based Neural Drum Synthesis with GAN inversion

Prior-Guided Multi-View 3D Head Reconstruction

Codebase for the paper titled "Continual learning with local module selection"

Pre-trained Deep Learning models and demos (high quality and extremely fast)

IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

An abstraction layer for mathematical optimization solvers.

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

repro_eval is a collection of measures to evaluate the reproducibility/replicability of system-oriented IR experiments

Linear algebra python - Number of operations and problems in Linear Algebra and Numerical Linear Algebra

Answering Open-Domain Questions of Varying Reasoning Steps from Text

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

PyTorch Connectomics: segmentation toolbox for EM connectomics

CUDA Python Low-level Bindings

This project is the PyTorch implementation of our CVPR 2022 paper:

Source code for our paper "Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash"