ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Last update: Jan 03, 2023

Overview

ST++

This is the official PyTorch implementation of our paper:

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation.
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi and Yang Gao.

Getting Started

Data Preparation

Pre-trained Model

ResNet-50 | ResNet-101 | DeepLabv2-ResNet-101

Dataset

Pascal | Augmented Masks | Cityscapes | Class Mapped Masks

File Organization

├── ./pretrained
    ├── resnet50.pth
    ├── resnet101.pth
    └── deeplabv2_resnet101_coco_pretrained.pth
    
├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass    # replace the official folder with above augmented masks 
    
├── [Your Cityscapes Path]
    ├── gtFine               # replace the official folder with above class mapped masks 
    └── leftImg8bit

Training and Testing

export semi_setting='pascal/1_8/split_0'

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset pascal --data-root [Your Pascal Path] \
  --batch-size 16 --backbone resnet50 --model deeplabv3plus \
  --labeled-id-path dataset/splits/$semi_setting/labeled.txt \
  --unlabeled-id-path dataset/splits/$semi_setting/unlabeled.txt \
  --pseudo-mask-path outdir/pseudo_masks/$semi_setting \
  --save-path outdir/models/$semi_setting

This script is for our ST framework. To run ST++, add --plus --reliable-id-path outdir/reliable_ids/$semi_setting.

Acknowledgement

The DeepLabv2 MS COCO pre-trained model is borrowed and converted from AdvSemiSeg. The image partitions are borrowed from Context-Aware-Consistency and PseudoSeg. Part of the training hyper-parameters and network structures are adapted from PyTorch-Encoding. The strong data augmentations are borrowed from MoCo v2 and PseudoSeg.

AdvSemiSeg: https://github.com/hfslyc/AdvSemiSeg.
Context-Aware-Consistency: https://github.com/dvlab-research/Context-Aware-Consistency.
PseudoSeg: https://github.com/googleinterns/wss.
PyTorch-Encoding: https://github.com/zhanghang1989/PyTorch-Encoding.
MoCo: https://github.com/facebookresearch/moco.
OpenSelfSup: https://github.com/open-mmlab/OpenSelfSup.

Thanks a lot for their great works!

Citation

If you find this project useful, please consider citing:

@article{yang2021st++,
  title={ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation},
  author={Yang, Lihe and Zhuo, Wei and Qi, Lei and Shi, Yinghuan and Gao, Yang},
  journal={arXiv preprint arXiv:2106.05095},
  year={2021}
}

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Related tags

Overview

ST++

Getting Started

Data Preparation

Pre-trained Model

Dataset

File Organization

Training and Testing

Acknowledgement

Citation

Owner

Lihe Yang

Pose estimation with MoveNet Lightning

"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.

TAug :: Time Series Data Augmentation using Deep Generative Models

The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer

Demo project for real time anomaly detection using kafka and python

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

An evaluation toolkit for voice conversion models.

Train Yolov4 using NBX-Jobs

一套完整的微博舆情分析流程代码，包括微博爬虫、LDA主题分析和情感分析。

Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure.

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

Object Tracking and Detection Using OpenCV

Dogs classification with Deep Metric Learning using some popular losses

Lolviz - A simple Python data-structure visualization tool for lists of lists, lists, dictionaries; primarily for use in Jupyter notebooks / presentations

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python Rapid Artificial Intelligence Ab Initio Molecular Dynamics

Implementation for "Domain-Specific Bias Filtering for Single Labeled Domain Generalization"

Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"