Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Last update: Sep 09, 2022

Overview

Pytorch Implementation of Augmenting Convolutional networks with attention-based aggregation

This is the unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

reference: https://arxiv.org/pdf/2112.13692.pdf

Prerequisites

PyTorch
PyTorch Lightning
timm
torchmetrics
torchvision
python3
CUDA

Comments

Due to computation limits, CIFAR100 dataset was used in contrast to ImageNet in the original paper.
Since the official code is not released yet, there may be differences in structures and hyperparameters.
- Most of the hidden dimensions were chosen based on guesswork.
MADGRAD was used instead of LAMB optimizer.
(I thought it would be inefficient to use LAMB for small batchsizes in my local machine)
LayerScale will be added soon

Citations

@misc{touvron2021augmenting,
      title={Augmenting Convolutional networks with attention-based aggregation}, 
      author={Hugo Touvron and Matthieu Cord and Alaaeldin El-Nouby and Piotr Bojanowski and Armand Joulin and Gabriel Synnaeve and Hervé Jégou},
      year={2021},
      eprint={2112.13692},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

Related tags

Overview

Pytorch Implementation of Augmenting Convolutional networks with attention-based aggregation

Prerequisites

Comments

Citations

Owner

DK

CCCL: Contrastive Cascade Graph Learning.

This repo contains research materials released by members of the Google Brain team in Tokyo.

An automated facial recognition based attendance system (desktop application)

A python implementation of Yolov5 to detect fire or smoke in the wild in Jetson Xavier nx and Jetson nano

Learning a mapping from images to psychological similarity spaces with neural networks.

Talk covering the features of skorch

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

U-Net for GBM

PolyTrack: Tracking with Bounding Polygons

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

🔊 Audio and fastai v2

Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS-21)

DTCN IJCAI - Sequential prediction learning framework and algorithm

Populating 3D Scenes by Learning Human-Scene Interaction https://posa.is.tue.mpg.de/

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

duralava is a neural network which can simulate a lava lamp in an infinite loop.

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.

95.47% on CIFAR10 with PyTorch