FreeSOLO for unsupervised instance segmentation, CVPR 2022

Last update: Jan 02, 2023

Overview

FreeSOLO: Learning to Segment Objects without Annotations

This project hosts the code for implementing the FreeSOLO algorithm for unsupervised instance segmentation.

FreeSOLO: Learning to Segment Objects without Annotations,
Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2022
arXiv preprint (arXiv 2202.12181)

Visual Results

Installation

Prerequisites

Linux or macOS with Python >= 3.6
PyTorch >= 1.5 and torchvision that matches the PyTorch installation.
scikit-image

Install PyTorch in Conda env

# create conda env
conda create -n detectron2 python=3.6
# activate the enviorment
conda activate detectron2
# install PyTorch >=1.5 with GPU
conda install pytorch torchvision -c pytorch

Build Detectron2 from Source

Follow the INSTALL.md to install Detectron2 (commit id 11528ce has been tested).

Datasets

Follow the datasets/README.md to set up the MS COCO dataset.

Pre-trained model

Download the DenseCL pre-trained model from here. Convert it to detectron2's format and put the converted model under "training_dir/pre-trained/DenseCL" directory.

python tools/convert-pretrain-to-detectron2.py {WEIGHT_FILE}.pth {WEIGHT_FILE}.pkl

Usage

Free Mask

Download the prepared free masks in json format from here. Put it under "datasets/coco/annotations" directory. Or, generate it by yourself:

bash inference_freemask.sh

Training

# train with free masks
bash train.sh

# generate pseudo labels
bash gen_pseudo_labels.sh

# self-train
bash train_pl.sh

Testing

Download the trained model from here.

bash test.sh {MODEL_PATH}

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@article{wang2022freesolo,
  title={{FreeSOLO}: Learning to Segment Objects without Annotations},
  author={Wang, Xinlong and Yu, Zhiding and De Mello, Shalini and Kautz, Jan and Anandkumar, Anima and Shen, Chunhua and Alvarez, Jose M},
  journal={arXiv preprint arXiv:2202.12181},
  year={2022}
}

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Related tags

Overview

FreeSOLO: Learning to Segment Objects without Annotations

Visual Results

Installation

Prerequisites

Install PyTorch in Conda env

Build Detectron2 from Source

Datasets

Pre-trained model

Usage

Free Mask

Training

Testing

Citations

Owner

NVIDIA Research Projects

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

AI Flow is an open source framework that bridges big data and artificial intelligence.

PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot

Implementation of SiameseXML (ICML 2021)

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Match SafeGraph POIs with Data collected through a cultural resource survey in Washington DC.

Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Yas CRNN model training - Yet Another Genshin Impact Scanner

Sudoku solver - A sudoku solver with python

IndoNLI: A Natural Language Inference Dataset for Indonesian

High-quality implementations of standard and SOTA methods on a variety of tasks.

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

ExCon: Explanation-driven Supervised Contrastive Learning

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"