DetCo: Unsupervised Contrastive Learning for Object Detection

Last update: Dec 18, 2022

Related tags

Deep Learning DetCo

Overview

DetCo: Unsupervised Contrastive Learning for Object Detection

arxiv link

News

Sparse RCNN+DetCo improves from 45.0 AP to 46.5 AP(+1.5) with 3x+ms train. See details in SparseRCNN.
Pretrained weights has been released.

Highlights

State-of-the-art transfer performance on dense prediction tasks.
Improving 1.6/1.2/1.0 AP than supervised ImageNet pretrain on Mask RCNN-C4/FPN/RetinaNet with COCO 1x schedule.
Comprehensively improving most instance-level detection and semantic segmentation tasks.

Pipeline

Performances

Install

Same as OpenSelfSup.

Codes

Pretext Task Pretrain

Coming Soon.

Transfer to Downstream tasks

We provide training scripts on COCO, because the performance of COCO is more stable than VOC and Cityscapes. See results in Table 3-5 and Table 13.

We provide Mask RCNN-C4, Mask RCNN-FPN and RetinaNet with 12k, 90k and 180k iterations.

First, you need to download model(.pkl) to benchmarks/detection/pths, and convert pretrain model to detectron2_version. See this script.

Second, start training and testing.

sh tools_local/dist_test_coco.sh $PTH $WORK_DIR

For example:

sh tools_local/dist_test_coco.sh benchmarks/detection/pths/detco_200ep_AA.pkl benchmarks/detection/work_dirs/detco_AA

Download Models

DetCo-200ep: [Google Drive], [Baidu Drive] Fetch Code: okfp

DetCo-200ep-AA: [Google Drive], [Baidu Drive] Fetch Code: fg7h

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@misc{xie2021detco,
      title={DetCo: Unsupervised Contrastive Learning for Object Detection}, 
      author={Enze Xie and Jian Ding and Wenhai Wang and Xiaohang Zhan and Hang Xu and Zhenguo Li and Ping Luo},
      year={2021},
      eprint={2102.04803},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledges

We would like to thank Huawei AI Theory Group to support 200+ V100 GPUs for this research project without which this work would not be possible.

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact the authors.

DetCo: Unsupervised Contrastive Learning for Object Detection

Related tags

Overview

DetCo: Unsupervised Contrastive Learning for Object Detection

News

Highlights

Pipeline

Performances

Install

Codes

Pretext Task Pretrain

Transfer to Downstream tasks

Download Models

Citations

Acknowledges

License

Owner

Enze Xie

A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

Unofficial PyTorch implementation of TokenLearner by Google AI

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

Weakly Supervised End-to-End Learning (NeurIPS 2021)

A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

Ankou: Guiding Grey-box Fuzzing towards Combinatorial Difference

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Image restoration with neural networks but without learning.

From Canonical Correlation Analysis to Self-supervised Graph Neural Networks

PyToch implementation of A Novel Self-supervised Learning Task Designed for Anomaly Segmentation