Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Last update: Dec 31, 2021

Related tags

Overview

LESA

Introduction

This repository contains the official implementation of Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms. The code for image classification and object detection is based on axial-deeplab and mmdetection.

Citing LESA

If you find LESA is helpful in your project, please consider citing our paper.

@article{yang2021locally,
  title={Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms},
  author={Yang, Chenglin and Qiao, Siyuan and Kortylewski, Adam and Yuille, Alan},
  journal={arXiv preprint arXiv:2107.05637},
  year={2021}
}

Main Results on ImageNet

Please refer to LESA_classification for details.

Method	Model	Top-1 Acc.	Top-5 Acc.
LESA_ResNet50	Download	79.55	94.79
LESA_WRN50	Download	80.18	95.07

Main Results on COCO test-dev

Please refer to LESA_detection for details.

Method	Backbone	Pretrained	Model	Box AP	Mask AP
Mask-RCNN	LESA_ResNet50	Download	Download	44.2	39.6
HTC	LESA_WRN50	Download	Download	50.5	44.4

Credits

This project is based on axial-deeplab and mmdetection.

Relative position embedding is based on bottleneck-transformer-pytorch

ResNet is based on pytorch/vision. Classification helper functions are based on pytorch-classification.

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Related tags

Overview

LESA

Introduction

Citing LESA

Main Results on ImageNet

Main Results on COCO test-dev

Credits

Owner

Chenglin Yang

PyTorch implementation of MICCAI 2018 paper "Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector"

Predict halo masses from simulations via graph neural networks

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

An Approach to Explore Logistic Regression Models

Semantic Segmentation with SegFormer on Drone Dataset.

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"

Data for "Driving the Herd: Search Engines as Content Influencers" paper

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

pytorch implementation of fast-neural-style

Narya API allows you track soccer player from camera inputs, and evaluate them with an Expected Discounted Goal (EDG) Agent

[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

Pytorch implementation for "Adversarial Robustness under Long-Tailed Distribution" (CVPR 2021 Oral)

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Collection of generative models in Tensorflow

Very large and sparse networks appear often in the wild and present unique algorithmic opportunities and challenges for the practitioner

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Object detection (YOLO) with pytorch, OpenCV and python