ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Last update: Dec 31, 2022

Related tags

Deep Learning ICNet

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

by Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia, details are in project page.

Introduction

Based on PSPNet, this repository is build for evaluation in ICNet. For installation, please follow the description in PSPNet repository (support CUDA 7.0/7.5 + cuDNN v4).

Usage

Clone the repository recursively:

git clone --recursive https://github.com/hszhao/ICNet.git

Build Caffe and matcaffe:

cd $ICNET_ROOT/PSPNet
cp Makefile.config.example Makefile.config
vim Makefile.config
make -j8 && make matcaffe
cd ..

Evaluation mIoU:
- Evaluation code is in folder 'evaluation'.
- Download trained models and put them in folder 'evaluation/model':
  - icnet_cityscapes_train_30k.caffemodel: GoogleDrive
    
    (31M, md5: c7038630c4b6c869afaaadd811bdb539; train on trainset for 30k)
  - icnet_cityscapes_trainval_90k.caffemodel: GoogleDrive
    
    (31M, md5: 4f4dd9eecd465dd8de7e4cf88ba5d5d5; train on trainvalset for 90k)
- Modify the related paths in 'eval_all.m':
  - Mainly variables 'data_root' and 'eval_list', and your image list for evaluation should be similar to that in folder 'evaluation/samplelist' if you use this evaluation code structure.
```
cd evaluation
vim eval_all.m
```
- Run the evaluation scripts:
```
./run.sh
```
Evaluation time:
- To get inference time as accurate as possible, it's suggested to make sure the GPU card with specified ID in script 'test_time.sh' is empty (without other processes executing)
- Run the evaluation scripts:
```
./test_time.sh
```
Results:
- Prediction results will show in folder 'evaluation/mc_result' and the expected scores are:
  - ICNet train on trainset for 30K, evaluated on valset (mIoU/pAcc): 67.7/94.5
  - ICNet train on trainvalset for 90K, evaluated on testset (mIoU): 69.5
- Log information of inference time will be in file 'time.log', approximately 33~36ms on TitanX.
Demo video:
- Video processed by ICNet on cityscapes dataset:
  - Alpha blending with value as 0.5: Video

Citation

If ICNet is useful for your research, please consider citing:

@inproceedings{zhao2018icnet,
  title={ICNet for Real-Time Semantic Segmentation on High-Resolution Images},
  author={Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya},
  booktitle={ECCV},
  year={2018}
}

Questions

Please contact '[email protected]'

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Related tags

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Introduction

Usage

Citation

Questions

Owner

Hengshuang Zhao

Unofficial implementation of Pix2SEQ

PyTorch code for our paper "Attention in Attention Network for Image Super-Resolution"

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs

Equivariant layers for RC-complement symmetry in DNA sequence data

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

GUPNet - Geometry Uncertainty Projection Network for Monocular 3D Object Detection

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

Official PyTorch code for WACV 2022 paper "CFLOW-AD: Real-Time Unsupervised Anomaly Detection with Localization via Conditional Normalizing Flows"

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

This repository contains all code and data for the Inside Out Visual Place Recognition task

PyTorch implementation of Neural Dual Contouring.

Author Disambiguation using Knowledge Graph Embeddings with Literals

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment