implementation for paper "ShelfNet for fast semantic segmentation"

Last update: Sep 16, 2022

Related tags

Overview

ShelfNet-lightweight for paper (ShelfNet for fast semantic segmentation)

This repo contains implementation of ShelfNet-lightweight models for real-time models on Cityscapes.
For real-time tasks, we achieved 74.8% mIoU on Ctiyscapes dataset, with a speed of 59.2 FPS (61.7 FPS for BiSeNet at 74.7% on a GTX 1080Ti GPU).
For non real-time tasks, we achieved 79.0% mIoU on Cityscapes test set with ResNet34 backbone, suparssing other models (PSPNet and BiSeNet) with largers backbones with ResNet50 or Resnet 101 backbone.
For Non light-weight ShelfNet implementation, refer to another ShelfNet repo.
This branch is the result on Cityscapes experiment, for results on PASCAL, see branch pascal

This repo is based on two implementations Implementation 1 and Implementation 2. This implementation takes about 24h's training on 2 GTX 1080Ti GPU.

Results

Link to results on Cityscapes test set

ShelfNet18-lw real-time: https://www.cityscapes-dataset.com/anonymous-results/?id=b2cc8f49fc3267c73e6bb686425016cb152c8bc34fc09ac207c81749f329dc8d
ShelfNet34-lw non real-time: https://www.cityscapes-dataset.com/anonymous-results/?id=c0a7c8a4b64a880a715632c6a28b116d239096b63b5d14f5042c8b3280a7169d

Data Preparation

Download fine labelled dataset from Cityscapes server, and decompress into ./data folder.
You might need to modify data path here and here

$ mkdir -p data
$ mv /path/to/leftImg8bit_trainvaltest.zip data
$ mv /path/to/gtFine_trainvaltest.zip data
$ cd data
$ unzip leftImg8bit_trainvaltest.zip
$ unzip gtFine_trainvaltest.zip

Two models and the pretrained weights

We provide two models, ShelfNet18 with 64 base channels for real-time semantic segmentation, and ShelfNet34 with 128 base channels for non-real-time semantic segmentation.
Pretrained weights for ShelfNet18 and ShelfNet34.

Requirements

PyTorch 1.1
python3
scikit-image
tqdm

How to run

Find the folder (cd ShelfNet18_realtime or cd ShelfNet34_non_realtime)

training

CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node=2 train.py

evaluate on validation set (Create a folder called res, this folder is automatically created if you train the model. Put checkpoint in resfolder, and make sure the checkpoint name and dataset path match evaluate.py. Change checkpoint name to model_final.pthby default)

python evaluate.py

Running speed

test running speed of ShelfNet18-lw

python test_speed.py

You can modify the shape of input images to test running speed, by modifying here
You can test running speed of different models by modifying here
The running speed is an average of 100 single forward passes, therefore it's possible the speed varies. The code returns the mean running time by default.

implementation for paper "ShelfNet for fast semantic segmentation"

Related tags

Overview

ShelfNet-lightweight for paper (ShelfNet for fast semantic segmentation)

Results

Link to results on Cityscapes test set

Data Preparation

Two models and the pretrained weights

Requirements

How to run

Running speed

Owner

Juntang Zhuang

DRLib：A concise deep reinforcement learning library, integrating HER and PER for almost off policy RL algos.

Towards Understanding Quality Challenges of the Federated Learning: A First Look from the Lens of Robustness

TrackFormer: Multi-Object Tracking with Transformers

Parametric Contrastive Learning (ICCV2021)

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral

AFLFast (extends AFL with Power Schedules)

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Real-Time Semantic Segmentation in Mobile device

We are More than Our JOints: Predicting How 3D Bodies Move

Speech-Emotion-Analyzer - The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Soomvaar is the repo which 🏩 contains different collection of 👨‍💻🚀code in Python and 💫✨Machine 👬🏼 learning algorithms📗📕 that is made during 📃 my practice and learning of ML and Python✨💥

Towards Debiasing NLU Models from Unknown Biases

Implementation for On Provable Benefits of Depth in Training Graph Convolutional Networks

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

Checkout some cool self-projects you can try your hands on to curb your boredom this December!

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".