PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments.

Last update: Nov 28, 2022

Related tags

Overview

MemSeg: Memory-based semantic segmentation for off-road unstructured natural environments

Introduction

This repository is a PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments. This work is based on semseg.

The codebase mainly uses ResNet18, ResNet50 and MobileNet-V2 as backbone with ASPP module and can be easily adapted to other basic semantic segmentation structures.

Sample experimented dataset is RUGD.

Requirement

Hardware: >= 11G GPU memory

Software: PyTorch>=1.0.0, python3

Usage

For installation, follow installation steps below or recommend you to refer to the instructions described here.

For its pretrained ResNet50 backbone model, you can download from URL.

Getting Started

Installation

Clone this repository.

git clone https://github.com/youngsjjn/MemSeg.git

Install Python dependencies.

pip install -r requirements.txt

Implementation

Download datasets (i.e. RUGD) and change the root of data path in config.

Download data list of RUGD here.

Inference If you want to inference on pretrained models, download pretrained network in my drive and save them in ./exp/rugd/.

Inference "ResNet50 + Deeplabv3" without the memory module

sh tool/test.sh rugd deeplab50

Inference "ResNet50 + Deeplabv3" with the memory module

sh tool/test_mem.sh rugd deeplab50mem

Network	mIoU
ResNet18 + PSPNet	33.42
ResNet18 + PSPNet (Memory)	34.13
ResNet18 + Deeplabv3	33.48
ResNet18 + Deeplabv3 (Memory)	35.07
ResNet50 + Deeplabv3	36.77
ResNet50 + Deeplabv3 (Memory)	37.71

Train (Evaluation is included at the end of the training) Train "ResNet50 + Deeplabv3" without the memory module

sh tool/train.sh rugd deeplab50

Train "ResNet50 + Deeplabv3" without the memory module

sh tool/train_mem.sh rugd deeplab50mem

Here, the example is for training or testing on "ResNet50 + Deeplabv3". If you want to train other networks, please change "deeplab50" or "deeplab50mem" as a postfix of a config file name.

For example, train "ResNet18 + PSPNet" with the memory module:

sh tool/train_mem.sh rugd pspnet18mem

Citation

If you like our work and use the code or models for your research, please cite our work as follows.

@article{DBLP:journals/corr/abs-2108-05635,
  author    = {Youngsaeng Jin and
               David K. Han and
               Hanseok Ko},
  title     = {Memory-based Semantic Segmentation for Off-road Unstructured Natural
               Environments},
  journal   = {CoRR},
  volume    = {abs/2108.05635},
  year      = {2021},
  url       = {https://arxiv.org/abs/2108.05635},
  eprinttype = {arXiv},
  eprint    = {2108.05635},
  timestamp = {Wed, 18 Aug 2021 19:45:42 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-2108-05635.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments.

Related tags

Overview

MemSeg: Memory-based semantic segmentation for off-road unstructured natural environments

Introduction

Requirement

Usage

Getting Started

Installation

Implementation

Citation

Owner

Music library streaming app written in Flask & VueJS

tf2-keras implement yolov5

Exploring Simple Siamese Representation Learning

VGGFace2-HQ - A high resolution face dataset for face editing purpose

Social Fabric: Tubelet Compositions for Video Relation Detection

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

Code for weakly supervised segmentation of a single class

OSLO: Open Source framework for Large-scale transformer Optimization

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Autotype on websites that have copy-paste disabled like Moodle, HackerEarth contest etc.

Hooks for VCOCO

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)

General purpose Slater-Koster tight-binding code for electronic structure calculations

Aspect-Sentiment-Multiple-Opinion Triplet Extraction (NLPCC 2021)

A dual benchmarking study of visual forgery and visual forensics techniques

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

TF Image Segmentation: Image Segmentation framework

Open source annotation tool for machine learning practitioners.

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment