An implementation of RetinaNet in PyTorch.

Last update: Jan 04, 2023

Overview

RetinaNet

An implementation of RetinaNet in PyTorch.

Installation
Training
Evaluation
Todo
Credits

Installation

Install PyTorch and torchvision.
For faster data augmentation, install pillow-simd:

pip uninstall -y pillow
pip install pillow-simd

Training

COCO 2017

First, install pycocotools:

git clone https://github.com/pdollar/coco/
cd coco/PythonAPI
make
python setup.py install
cd ../..
rm -r coco

Then download COCO 2017 into ./datasets/COCO/:

cd datasets
mkdir COCO
cd COCO

If your using wget:

wget http://images.cocodataset.org/zips/train2017.zip &&
wget http://images.cocodataset.org/zips/val2017.zip &&
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

If your using aria2c (recommended on for higher bandwidth connections and for allowing resumption of the download. Tune the number of max concurrent downloads (-j) and max connections per server (-x) as needed:

aria2c -x 10 -j 10 http://images.cocodataset.org/zips/train2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/zips/val2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/annotations/annotations_trainval2017.zip

unzip *.zip
rm *.zip

Then just run:

python train_coco.py

Pascal VOC

cd datasets
mkdir VOC
cd VOC

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

tar xf *.tar
rm *.tar

Then just run:

python train_voc.py

Custom Dataset

Lots to write here. 😉

Evaluation

To evaluate an image on a trained model:

python eval.py [checkpoint_path] [image_path]

This will create an image (output.jpg) with bounding box annotations.

Todo

Finish converting the COCO dataset class to work with batches.
Train COCO 2017 for 90,000 iterations and save a reusable checkpoint.
Try training on Pascal VOC and add download instructions.
Produce bounding box outputs for a few sanity check images.
Upload trained weights to Github releases.
Train on the 🔮 magic proprietary dataset ✨ .

An implementation of RetinaNet in PyTorch.

Related tags

Overview

RetinaNet

Installation

Training

COCO 2017

Pascal VOC

Custom Dataset

Evaluation

Todo

Credits

Owner

Conner Vercellino

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

Official Implementation of VAT

Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

[ICCV '21] In this repository you find the code to our paper Keypoint Communities

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models.

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

Standalone pre-training recipe with JAX+Flax

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

The BCNet related data and inference model.

Pytorch implementation of DeepMind's differentiable neural computer paper.

An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

Reinforcement Learning for finance

Used to record WKU's utility bills on a regular basis.

IOT: Instance-wise Layer Reordering for Transformer Structures

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Improving 3D Object Detection with Channel-wise Transformer