PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Last update: Dec 24, 2022

Overview

Dynamic Routing Between Capsules - PyTorch implementation

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules from Sara Sabour, Nicholas Frosst and Geoffrey E. Hinton.

The hyperparameters and data augmentation strategy strictly follow the paper.

Requirements

Only PyTorch with torchvision is required (tested on pytorch 0.2.0 and 0.3.0). Jupyter and matplotlib is required to run the notebook with visualizations.

Usage

Train the model by running

python net.py

Optional arguments and default values:

  --batch-size N          input batch size for training (default: 128)
  --test-batch-size N     input batch size for testing (default: 1000)
  --epochs N              number of epochs to train (default: 250)
  --lr LR                 learning rate (default: 0.001)
  --no-cuda               disables CUDA training
  --seed S                random seed (default: 1)
  --log-interval N        how many batches to wait before logging training
                          status (default: 10)
  --routing_iterations    number of iterations for routing algorithm (default: 3)
  --with_reconstruction   should reconstruction layers be used

MNIST dataset will be downloaded automatically.

Results

The network trained with reconstruction and 3 routing iterations on MNIST dataset achieves 99.65% accuracy on test set. The test loss is still slightly decreasing, so the accuracy could probably be improved with more training and more careful learning rate schedule.

Visualizations

We can create visualizations of digit reconstructions from DigitCaps (e.g. Figure 3 in the paper)

We can also visualize what each dimension of digit capsule represents (Section 5.1, Figure 4 in the paper).

Below, each row shows the reconstruction when one of the 16 dimensions in the DigitCaps representation is tweaked by intervals of 0.05 in the range [−0.25, 0.25].

We can see what individual dimensions represent for digit 7, e.g. dim6 - stroke thickness, dim11 - digit width, dim 15 - vertical shift.

Visualization examples are provided in a jupyter notebook

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Related tags

Overview

Dynamic Routing Between Capsules - PyTorch implementation

Requirements

Usage

Results

Visualizations

Owner

Adam Bielski

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

A cross-lingual COVID-19 fake news dataset

A crash course in six episodes for software developers who want to become machine learning practitioners.

Federated Learning Based on Dynamic Regularization

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

Official implementation for "Symbolic Learning to Optimize: Towards Interpretability and Scalability"

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

Stock-Prediction - prediction of stock market movements using sentiment analysis and deep learning.

CIFAR-10 Photo Classification

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

Official repo for AutoInt: Automatic Integration for Fast Neural Volume Rendering in CVPR 2021

A pre-trained model with multi-exit transformer architecture.

Like Dirt-Samples, but cleaned up

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

AI Flow is an open source framework that bridges big data and artificial intelligence.

SmoothGrad implementation in PyTorch

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.