Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Multi-Stage Spatial-Temporal Convolutional Neural Network (MS-GCN)

Tensor-based approaches for fMRI classification

Listing arxiv - Personalized list of today's articles from ArXiv

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"

Speckle-free Holography with Partially Coherent Light Sources and Camera-in-the-loop Calibration

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

CC-GENERATOR - A python script for generating CC

Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"

Pytorch library for seismic data augmentation

shufflev2-yolov5：lighter, faster and easier to deploy

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

Multi-Object Tracking in Satellite Videos with Graph-Based Multi-Task Modeling

Multiple Object Tracking with Yolov5!

a short visualisation script for pyvideo data

Spatial Contrastive Learning for Few-Shot Classification (SCL)

Edison AT is software Depression Assistant personal.

Code for the paper "JANUS: Parallel Tempered Genetic Algorithm Guided by Deep Neural Networks for Inverse Molecular Design"

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"