This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Last update: May 03, 2022

Related tags

Deep Learning ObjProp

Overview

ObjProp

Introduction

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Installation

This repo is built using mmdetection. To install the dependencies, first clone the repository locally:

git clone https://github.com/anirudh-chakravarthy/objprop.git

Then, install PyTorch 1.1.0, torchvision 0.3.0, mmcv 0.2.12:

conda install pytorch==1.1.0 torchvision==0.3.0 -c pytorch
pip install mmcv==0.2.12

Then, install the CocoAPI for YouTube-VIS

conda install cython scipy
pip install git+https://github.com/youtubevos/cocoapi.git#"egg=pycocotools&subdirectory=PythonAPI"

Training

First, download and prepare the YouTube-VIS dataset using the following instructions.

To train ObjProp, run the following command:

python3 tools/train.py configs/masktrack_rcnn_r50_fpn_1x_youtubevos_objprop.py

In order to change the arguments such as dataset directory, learning rate, number of GPUs, etc, refer to the following configuration file configs/masktrack_rcnn_r50_fpn_1x_youtubevos_objprop.py.

Inference

To perform inference using ObjProp, run the following command:

python3 tools/test_video.py configs/masktrack_rcnn_r50_fpn_1x_youtubevos_objprop.py [MODEL_PATH] --out [OUTPUT_PATH.json] --eval segm

A JSON file with the inference results will be saved at OUTPUT_PATH.json. To evaluate the performance, submit the result file to the evaluation server.

License

ObjProp is released under the Apache 2.0 license.

Citation

@article{Chakravarthy2021ObjProp,
  author = {Anirudh S Chakravarthy and Won-Dong Jang and Zudi Lin and Donglai Wei and Song Bai and Hanspeter Pfister},  
  title = {Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation},
  journal = {CoRR},
  volume = {abs/2111.07529},
  year = {2021},
  url = {https://arxiv.org/abs/2111.07529}
}

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Related tags

Overview

ObjProp

Introduction

Installation

Training

Inference

License

Citation

Owner

Anirudh S Chakravarthy

Implementation of Hierarchical Transformer Memory (HTM) for Pytorch

Official code for paper "ISNet: Costless and Implicit Image Segmentation for Deep Classifiers, with Application in COVID-19 Detection"

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

3rd place solution for the Weather4cast 2021 Stage 1 Challenge

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

Annotate datasets with a semi-trained or fully trained YOLOv5 model

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

On the adaptation of recurrent neural networks for system identification

Train a state-of-the-art yolov3 object detector from scratch!

BookMyShowPC - Movie Ticket Reservation App made with Tkinter

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

A simple Neural Network that predicts the label for a series of handwritten digits

[WACV 2022] Contextual Gradient Scaling for Few-Shot Learning

Disagreement-Regularized Imitation Learning

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".