YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Last update: Dec 30, 2022

Overview

YOLOv5_DOTA_OBB

YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection)

Datasets and pretrained checkpoint

Datasets : DOTA
Pretrained Checkpoint or Demo Files :
- train,detect_and_evaluate_demo_files.(6666)
- yolov5x.pt.(6666)
- yolov5l.pt.(6666)
- yolov5m.pt.(6666)
- yolov5s.pt.(6666)
- YOLOv5_DOTAv1.5_OBB.pt.(6666)

Fuction

train.py. Train.
detect.py. Detect and visualize the detection result. Get the detection result txt.
evaluation.py. Merge the detection result and visualize it. Finally evaluate the detector

Installation (Linux Recommend, Windows not Recommend)

1. Python 3.8 or later with all requirements.txt dependencies installed, including torch>=1.7. To install run:

$   pip install -r requirements.txt

2. Install swig

$   cd  \.....\yolov5_DOTA_OBB\utils
$   sudo apt-get install swig

3. Create the c++ extension for python

$   swig -c++ -python polyiou.i
$   python setup.py build_ext --inplace

More detailed explanation

想要了解相关实现的细节和原理可以看我的知乎文章:
YOLOv5_DOTAv1.5(遥感旋转目标检测，全踩坑记录);

Usage Example

1. 'Get Dataset'

Split the DOTA_OBB image and labels. Trans DOTA format to YOLO longside format.
You can refer to hukaixuan19970627/DOTA_devkit_YOLO.
The Oriented YOLO Longside Format is:

$  classid    x_c   y_c   longside   shortside    Θ    Θ∈[0, 180)


* longside: The longest side of the oriented rectangle.

* shortside: The other side of the oriented rectangle.

* Θ: The angle between the longside and the x-axis(The x-axis rotates clockwise).x轴顺时针旋转遇到最长边所经过的角度

WARNING: IMAGE SIZE MUST MEETS 'HEIGHT = WIDTH'

2. 'train.py'

All same as ultralytics/yolov5. You better train demo files first before train your custom dataset.
Single GPU training:

$ python train.py  --batch-size 4 --device 0

Multi GPU training: DistributedDataParallel Mode

python -m torch.distributed.launch --nproc_per_node 4 train.py --sync-bn --device 0,1,2,3

3. 'detect.py'

Download the demo files.
Then run the demo. Visualize the detection result and get the result txt files.

$  python detect.py

4. 'evaluation.py'

Run the detect.py demo first. Then change the path with yours:

evaluation
(
        detoutput=r'/....../DOTA_demo_view/detection',
        imageset=r'/....../DOTA_demo_view/row_images',
        annopath=r'/....../DOTA_demo_view/row_DOTA_labels/{:s}.txt'
)
draw_DOTA_image
(
        imgsrcpath=r'/...../DOTA_demo_view/row_images',
        imglabelspath=r'/....../DOTA_demo_view/detection/result_txt/result_merged',
        dstpath=r'/....../DOTA_demo_view/detection/merged_drawed'
)

Run the evaluation.py demo. Get the evaluation result and visualize the detection result which after merged.

$  python evaluation.py

有问题反馈

在使用中有任何问题，欢迎反馈给我，可以用以下联系方式跟我交流

知乎（@略略略）
代码问题提issues,其他问题请知乎上联系

感激

感谢以下的项目,排名不分先后

关于作者

  Name  : "胡凯旋"
  describe myself："咸鱼一枚"

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Related tags

Overview

YOLOv5_DOTA_OBB

Datasets and pretrained checkpoint

Fuction

Installation (Linux Recommend, Windows not Recommend)

More detailed explanation

Usage Example

有问题反馈

感激

关于作者

Owner

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

OpenGait is a flexible and extensible gait recognition project

Distort a video using Seam Carving (video) and Vibrato effect (sound)

Text layer for bio-image annotation.

Ocular is a state-of-the-art historical OCR system.

Provides OCR (Optical Character Recognition) services through web applications

一键翻译各类图片内文字

Distilling Knowledge via Knowledge Review, CVPR 2021

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

7th place solution

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

A tensorflow implementation of EAST text detector

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

scene-linear test images

Application that instantly translates sign-language to letters.

Characterizing possible failure modes in physics-informed neural networks.

POT : Python Optimal Transport

Camelot: PDF Table Extraction for Humans

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.