Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Last update: Jan 02, 2023

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

[arXiv] [Project] [BibTeX]

Features

A single architecture for panoptic, instance and semantic segmentation.
Support major segmentation datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for Mask2Former.

See Getting Started with Mask2Former.

Advanced usage

See Advanced Usage of Mask2Former.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the Mask2Former Model Zoo.

License

Shield:

The majority of Mask2Former is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license, Deformable-DETR is licensed under the Apache-2.0 License.

Citing Mask2Former

If you use Mask2Former in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

If you find the code useful, please also consider the following BibTeX entry.

@inproceedings{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={NeurIPS},
  year={2021}
}

Acknowledgement

Code is largely based on MaskFormer (https://github.com/facebookresearch/MaskFormer).

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Features

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

Owner

Meta Research

PyTorch implementation of residual gated graph ConvNets, ICLR’18

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

[CVPR 2021] A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

PyTorch implementation of DeepDream algorithm

Product-based-recommendation-system - A product based recommendation system which uses Machine learning algorithm such as KNN and cosine similarity

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

PyTorch implementation of EfficientNetV2

An adaptive hierarchical energy management strategy for hybrid electric vehicles

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

SEJE Pytorch implementation

Pytorch implementation of XRD spectral identification from COD database

PoseCamera is python based SDK for human pose estimation through RGB webcam.

This is the pytorch re-implementation of the IterNorm

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation

Features

Installation

Getting Started

Advanced usage

Model Zoo and Baselines

License

Citing Mask2Former

Acknowledgement

Owner

Meta Research

PyTorch implementation of residual gated graph ConvNets, ICLR’18

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

[CVPR 2021] A Peek Into the Reasoning of Neural Networks: Interpreting with Structural Visual Concepts

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

PyTorch implementation of DeepDream algorithm

Product-based-recommendation-system - A product based recommendation system which uses Machine learning algorithm such as KNN and cosine similarity

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

PyTorch implementation of EfficientNetV2

An adaptive hierarchical energy management strategy for hybrid electric vehicles

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

[ArXiv 2021] Data-Efficient Instance Generation from Instance Discrimination

SEJE Pytorch implementation

Pytorch implementation of XRD spectral identification from COD database

PoseCamera is python based SDK for human pose estimation through RGB webcam.

This is the pytorch re-implementation of the IterNorm

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.