git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Last update: Nov 28, 2022

Related tags

Overview

USD-Seg

This project is an implement of paper USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation, based on FCOS detector from MMDetection tool box.

Introduction

We present a novel explicit shape representation for instance segmentation. The proposed USD-Seg adopts a linear model, sparse coding with dictionary, for object shapes. First, it learns a dictionary from a large collection of shape datasets, making any shape being able to be decomposed into a linear combination through the dictionary. Hence the name "Universal Shape Dictionary". It adds a simple shape vector regression head to ordinary object detector, giving the detector segmentation ability with minimal overhead.

License

This project is released under the Apache 2.0 license.

Model

The overall pipeline of USD-Seg: an RGB image is input to the base detector, and the base detector will regress both detection related information (bounding box and class) and the shape vector. Then the mask will be decoded by simple multiplication between shape vector and dictionary atoms, followed by proper resize and threshold operations.

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of MMDetection.
We follow the original usage of mmdetection framework. You can use configs for usd-seg in /configs/usdseg/ to train from scratch.

Citation

If you use this toolbox or benchmark in your research, please cite this project and mmdetection.

@article{USD-Seg,
  title   = {Learning Universal Shape Dictionary for Realtime Instance Segmentation},
  author  = {Tang, Tutian and Xu, Wenqiang and Ye, Ruolin and Yang, Lixin and Lu, Cewu},
  journal= {arXiv preprint arXiv:2012.01050},
  year={2020}
}

Contact

This repo is currently maintained by Tutian tang (@ElectronicElephant)and Ruolin Ye (@YoruCathy). Other core developers include Wenqiang Xu (@WenqiangX). For technical details, please feel free to contact the authors directly via Email.

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Related tags

Overview

USD-Seg

Introduction

License

Model

Installation

Get Started

Citation

Contact

Owner

Ruolin Ye

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

PyGCL: A PyTorch Library for Graph Contrastive Learning

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

A PyTorch-based library for fast prototyping and sharing of deep neural network models.

Justmagic - Use a function as a method with this mystic script, like in Nim

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote Sensing Images

3.8% and 18.3% on CIFAR-10 and CIFAR-100

CVPR 2020 oral paper: Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax.

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

A library of multi-agent reinforcement learning components and systems

Companion repository to the paper accepted at the 4th ACM SIGSPATIAL International Workshop on Advances in Resilient and Intelligent Cities

Deep functional residue identification

基于Paddle框架的arcface复现

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

[ICCV2021] Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

N-Omniglot is a large neuromorphic few-shot learning dataset