New Modeling The Background CodeBase

Last update: Dec 28, 2022

Related tags

Overview

Modeling the Background for Incremental Learning in Semantic Segmentation

This is the updated official PyTorch implementation of our work: "Modeling the Background for Incremental Learning in Semantic Segmentation" accepted at CVPR 2020. For the original implementation, please refer to MiB In the update, we provide:

Support for WandB
Removed Nvidia DDP/AMP for PyTorch ones
Clear and better logging
Fixed MiB parameters in the argparser

We still want to provide users implementations of:

Prototype-based Incremental Few-Shot Semantic Segmentation
PLOP: Learning without Forgetting for Continual Semantic Segmentation
RECALL: Replay-based Continual Learning in Semantic Segmentation
ADD Cityscapes and COCO datasets

Requirements

To install the requirements, use the requirements.txt file:

pip install -r /path/to/requirements.txt

How to download data

In this project we use two dataset, ADE20K and Pascal-VOC 2012. We provide the scripts to download them in data/download_\ .sh. The script takes no inputs but use it in the target directory (where you want to download data).

ImageNet Pretrained Models

After setting the dataset, you download the models pretrained on ImageNet using InPlaceABN. Download the ResNet-101 model (we only need it but you can also download other networks if you want to change it). Then, put the pretrained model in the pretrained folder.

How to perform training

The most important file is run.py, that is in charge to start the training or test procedure. To run it, simpy use the following command:

python -m torch.distributed.launch --nproc_per_node=
   
     run.py --data_root 
    
      --name 
     
       .. other args ..

The default is to use a pretraining for the backbone used, that is searched in the pretrained folder of the project. We used the pretrained model released by the authors of In-place ABN (as said in the paper), that can be found here: link. Since the pretrained are made on multiple-gpus, they contain a prefix "module." in each key of the network. Please, be sure to remove them to be compatible with this code (simply rename them using key = key[7:]). If you don't want to use pretrained, please use --no-pretrained.

There are many options (you can see them all by using --help option), but we arranged the code to being straightforward to test the reported methods. Leaving all the default parameters, you can replicate the experiments by setting the following options.

please specify the data folder using: --data_root
dataset: --dataset voc (Pascal-VOC 2012) | ade (ADE20K)
task: --task, where tasks are
- 15-5, 15-5s, 19-1 (VOC), 100-50, 100-10, 50, 100-50b, 100-10b, 50b (ADE, b indicates the order)
step (each step is run separately): --step, where N is the step number, starting from 0
(only for Pascal-VOC) disjoint is default setup, to enable overlapped: --overlapped
learning rate: --lr 0.01 (for step 0) | 0.001 (for step > 0)
batch size: --batch_size <24/num_GPUs>
epochs: --epochs 30 (Pascal-VOC 2012) | 60 (ADE20K)
method: --method, where names are
- FT, LWF, LWF-MC, ILT, EWC, RW, PI, MIB

For all details please follow the information provided using the help option.

Example commands

LwF on the 100-50 setting of ADE20K, step 0: python -m torch.distributed.launch --nproc_per_node=2 run.py --data_root data --batch_size 12 --dataset ade --name LWF --task 100-50 --step 0 --lr 0.01 --epochs 60 --method LWF

MIB on the 50b setting of ADE20K, step 2: python -m torch.distributed.launch --nproc_per_node=2 run.py --data_root data --batch_size 12 --dataset ade --name MIB --task 100-50 --step 2 --lr 0.001 --epochs 60 --method MIB

LWF-MC on 15-5 disjoint setting of VOC, step 1: python -m torch.distributed.launch --nproc_per_node=2 run.py --data_root data --batch_size 12 --dataset voc --name LWF-MC --task 15-5 --step 1 --lr 0.001 --epochs 30 --method LWF-MC

RW on 15-1 overlapped setting of VOC, step 1: python -m torch.distributed.launch --nproc_per_node=2 run.py --data_root data --batch_size 12 --dataset voc --name LWF-MC --task 15-5s --overlapped --step 1 --lr 0.001 --epochs 30 --method RW

Once you trained the model, you can see the result on tensorboard (we perform the test after the whole training) or you can test it by using the same script and parameters but using the command --test that will skip all the training procedure and test the model on test data.

Cite us

Please, cite the following article when referring to this code/method.

@inProceedings{cermelli2020modeling,
   author = {Cermelli, Fabio and Mancini, Massimiliano and Rota Bul\`o, Samuel and Ricci, Elisa and Caputo, Barbara},
   title  = {Modeling the Background for Incremental Learning in Semantic Segmentation},
   booktitle = {Computer Vision and Pattern Recognition (CVPR)},
   year      = {2020},
   month     = {June}
}

New Modeling The Background CodeBase

Related tags

Overview

Modeling the Background for Incremental Learning in Semantic Segmentation

Requirements

How to download data

ImageNet Pretrained Models

How to perform training

Example commands

Cite us

Owner

Fabio Cermelli

CoNLL-English NER Task (NER in English)

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

Levenshtein and Hamming distance computation

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

Module for automatic summarization of text documents and HTML pages.

Implementation of Multistream Transformers in Pytorch

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, and deploy transformer specific models.

DAGAN - Dual Attention GANs for Semantic Image Synthesis

State-of-the-art NLP through transformer models in a modular design and consistent APIs.

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

A 10000+ hours dataset for Chinese speech recognition

Installation, test and evaluation of Scribosermo speech-to-text engine