Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Last update: Dec 16, 2021

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar [arXiv]

Features

A single architecture for three tasks: panoptic, instance and semantic segmentation. This straightforward mini project was built as part of the main project, IST: A TensorFlow 2 compatible instance segmentation toolbox, with the purpose of adapting recent research into segmentation approaches into TensorFlow.
Support common benchmark datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Getting started

Project is currently being built, with SwinTransformerV1 and SwinTransformerV2 and a few bits and pieces ready.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citation

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Features

Getting started

License

Citation

Owner

Phan Nguyen

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).

GRF: Learning a General Radiance Field for 3D Representation and Rendering

Implicit Graph Neural Networks

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Accurate identification of bacteriophages from metagenomic data using Transformer

Learning trajectory representations using self-supervision and programmatic supervision.

A PaddlePaddle implementation of STGCN with a few modifications in the model architecture in order to forecast traffic jam.

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs

Hypersearch weight debugging and losses tutorial

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

OBBDetection: an oriented object detection toolbox modified from MMdetection

Use graph-based analysis to re-classify stocks and to improve Markowitz portfolio optimization

Classical OCR DCNN reproduction based on PaddlePaddle framework.

A flag generation AI created using DeepAIs API

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

A fast and easy to use, moddable, Python based Minecraft server!

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"