R3Det based on mmdet 2.19.0

Last update: Dec 15, 2022

Related tags

Overview

R³Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

Installation

# install mmdetection first if you haven't installed it yet. (Refer to mmdetection for details.)
pip install mmdet==2.19.0

# install r3det (Compiling rotated ops is a little time-consuming.)
pip install -r requirements.txt
pip install -v -e .

It is best to use opencv-python greater than 4.5.1 because its angle representation has been changed in 4.5.1. The following experiments are all run with 4.5.3.

Quick Start

Please change path in configs to your data path.

# train
CUDA_VISIBLE_DEVICES=0 PORT=29500 \
./tools/dist_train.sh configs/rretinanet/rretinanet_obb_r50_fpn_1x_dota_v3.py 1

# submission
CUDA_VISIBLE_DEVICES=0 PORT=29500 \
./tools/dist_test.sh configs/rretinanet/rretinanet_obb_r50_fpn_1x_dota_v3.py \
        work_dirs/rretinanet_obb_r50_fpn_1x_dota_v3/epoch_12.pth 1 --format-only\
        --eval-options submission_dir=work_dirs/rretinanet_obb_r50_fpn_1x_dota_v3/Task1_results

For DOTA dataset, please crop the original images into 1024×1024 patches with an overlap of 200 by run

python tools/split/img_split.py --base_json \
       tools/split/split_configs/split_configs/dota1_0/ss_trainval.json

python tools/split/img_split.py --base_json \
       tools/split/split_configs/dota1_0/ss_test.json

Please change path in ss_trainval.json, ss_test.json to your path. (Forked from BboxToolkit, which is faster then DOTA_Devkit.)

Angle Representations

Three angle representations are built-in, which can freely switch in the config.

v1 (from R³Det): [-PI/2, 0)
v2 (from S²ANet): [-Pi/4, 3PI/4)
v3 (from OBBDetection): [-PI/2, PI/2)

The differences of the three angle representations are reflected in poly2obb, obb2poly, obb2xyxy, obb2hbb, hbb2obb, etc. [More], And according to the above three papers, the coders of them are different.

DeltaXYWHAOBBoxCoder
- v1：None
- v2：Constrained angle + Projection of dx and dy + Normalized with PI
- v3：Constrained angle and length&width + Projection of dx and dy
DeltaXYWHAHBBoxCoder
- v1：None
- v2：Constrained angle + Normalized with PI
- v3：Constrained angle and length&width + Normalized with 2PI

We believe that different coders are the key reason for the different baselines in different papers. The good news is that all the above coders can be freely switched in R3Det. In addition, R3Det also provide 4 NMS ops and 3 IoU_Calculators for rotation detection as follows:

nms.type
- v1：v1
- v2：v2
- v3：v3
- mmcv: mmcv
iou_calculator
- v1：RBboxOverlaps2D_v1
- v2：RBboxOverlaps2D_v2
- v3：RBboxOverlaps2D_v3

Performance

DOTA1.0 (Task1)

Model	Backbone	Lr schd	MS	RR	Angle	box AP	Official	Download
RRetinaNet HBB	R50-FPN	1x	-	-	v1	65.19	65.73	Baidu:0518/Google
RRetinaNet OBB	R50-FPN	1x	-	-	v3	68.20	69.40	Baidu:0518/Google
RRetinaNet OBB	R50-FPN	1x	-	-	v2	68.64	68.40	Baidu:0518/Google
R³Det	R50-FPN	1x	-	-	v1	70.41	70.66	Baidu:0518/Google
R³Det*	R50-FPN	1x	-	-	v1	70.86	-	Baidu:0518/Google

MS means multiple scale image split.
RR means random rotation.

Citation

@inproceedings{yang2021r3det,
    title={R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object},
    author={Yang, Xue and Yan, Junchi and Feng, Ziming and He, Tao},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    volume={35},
    number={4},
    pages={3163--3171},
    year={2021}
}

R3Det based on mmdet 2.19.0

Related tags

Overview

R³Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

Installation

Quick Start

Angle Representations

Performance

Citation

Owner

SJTU-Thinklab-Det

Numba-accelerated Pythonic implementation of MPDATA with examples in Python, Julia and Matlab

Differential fuzzing for the masses!

[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

Research - dataset and code for 2016 paper Learning a Driving Simulator

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

TensorFlow implementation of ENet

An implementation of Deep Forest 2021.2.1.

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

An implementation of Deep Graph Infomax (DGI) in PyTorch

DeepFaceEditing: Deep Face Generation and Editing with Disentangled Geometry and Appearance Control

IJCAI2020 & IJCV 2020 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Tensorflow implementation of soft-attention mechanism for video caption generation.

NAS-Bench-x11 and the Power of Learning Curves

The Instructed Glacier Model (IGM)

This is a collection of our NAS and Vision Transformer work.

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Differentiable rasterization applied to 3D model simplification tasks