Dual Attention Network for Scene Segmentation (CVPR2019)

Last update: Dec 28, 2022

Related tags

Overview

Dual Attention Network for Scene Segmentation(CVPR2019)

Jun Fu, Jing Liu, Haijie Tian, Yong Li, Yongjun Bao, Zhiwei Fang,and Hanqing Lu

Introduction

We propose a Dual Attention Network (DANet) to adaptively integrate local features with their global dependencies based on the self-attention mechanism. And we achieve new state-of-the-art segmentation performance on three challenging scene segmentation datasets, i.e., Cityscapes, PASCAL Context and COCO Stuff-10k dataset.

Cityscapes testing set result

We train our DANet-101 with only fine annotated data and submit our test results to the official evaluation server.

Updates

2020/9：Renew the code, which supports Pytorch 1.4.0 or later!

2020/8：The new TNNLS version DRANet achieves 82.9% on Cityscapes test set (submit the result on August, 2019), which is a new state-of-the-arts performance with only using fine annotated dataset and Resnet-101. The code will be released in DRANet.

2020/7：DANet is supported on MMSegmentation, in which DANet achieves 80.47% with single scale testing and 82.02% with multi-scale testing on Cityscapes val set.

2018/9：DANet released. The trained model with ResNet101 achieves 81.5% on Cityscapes test set.

Usage

Install pytorch
- The code is tested on python3.6 and torch 1.4.0.
- The code is modified from PyTorch-Encoding.

Clone the resposity

git clone https://github.com/junfu1115/DANet.git 
cd DANet 
python setup.py install

Dataset
- Download the Cityscapes dataset and convert the dataset to 19 categories.
- Please put dataset in folder ./datasets
Evaluation for DANet
- Download trained model DANet101 and put it in folder ./experiments/segmentation/models/
- cd ./experiments/segmentation/
- For single scale testing, please run:
- ```
CUDA_VISIBLE_DEVICES=0,1,2,3 python test.py --dataset citys --model danet --backbone resnet101 --resume  models/DANet101.pth.tar --eval --base-size 2048 --crop-size 768 --workers 1 --multi-grid --multi-dilation 4 8 16 --os 8 --aux --no-deepstem
```
- Evaluation Result
  
  The expected scores will show as follows: DANet101 on cityscapes val set (mIoU/pAcc): 79.93/95.97(ss)
Evaluation for DRANet
- Download trained model DRANet101 and put it in folder ./experiments/segmentation/models/
- Evaluation code is in folder ./experiments/segmentation/
- cd ./experiments/segmentation/
- For single scale testing, please run:
- ```
CUDA_VISIBLE_DEVICES=0,1,2,3 python test.py --dataset citys --model dran --backbone resnet101 --resume  models/dran101.pth.tar --eval --base-size 2048 --crop-size 768 --workers 1 --multi-grid --multi-dilation 4 8 16 --os 8 --aux
```
- Evaluation Result
  
  The expected scores will show as follows: DRANet101 on cityscapes val set (mIoU/pAcc): 81.63/96.62 (ss)

Citation

if you find DANet and DRANet useful in your research, please consider citing:

@article{fu2020scene,
  title={Scene Segmentation With Dual Relation-Aware Attention Network},
  author={Fu, Jun and Liu, Jing and Jiang, Jie and Li, Yong and Bao, Yongjun and Lu, Hanqing},
  journal={IEEE Transactions on Neural Networks and Learning Systems},
  year={2020},
  publisher={IEEE}
}

@inproceedings{fu2019dual,
  title={Dual attention network for scene segmentation},
  author={Fu, Jun and Liu, Jing and Tian, Haijie and Li, Yong and Bao, Yongjun and Fang, Zhiwei and Lu, Hanqing},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={3146--3154},
  year={2019}
}

Acknowledgement

Thanks PyTorch-Encoding, especially the Synchronized BN!

Dual Attention Network for Scene Segmentation (CVPR2019)

Related tags

Overview

Dual Attention Network for Scene Segmentation(CVPR2019)

Introduction

Cityscapes testing set result

Updates

Usage

Citation

Acknowledgement

Owner

Jun Fu

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

Remote sensing change detection using PaddlePaddle

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

CRISCE: Automatically Generating Critical Driving Scenarios From Car Accident Sketches

Arquitetura e Desenho de Software.

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty

Taichi Course Homework Template

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

YOLOPのPythonでのONNX推論サンプル

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

A2LP for short, ECCV2020 spotlight, Investigating SSL principles for UDA problems

Reporting and Visualization for Hazardous Events

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Dual Attention Network for Scene Segmentation (CVPR2019)

Related tags

Overview

Dual Attention Network for Scene Segmentation(CVPR2019)

Introduction

Cityscapes testing set result

Updates

Usage

Citation

Acknowledgement

Owner

Jun Fu

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

Remote sensing change detection using PaddlePaddle

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

CRISCE: Automatically Generating Critical Driving Scenarios From Car Accident Sketches

Arquitetura e Desenho de Software.

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty

Taichi Course Homework Template

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

YOLOPのPythonでのONNX推論サンプル

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

A2LP for short, ECCV2020 spotlight, Investigating SSL principles for UDA problems

Reporting and Visualization for Hazardous Events

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,