Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Last update: Dec 30, 2022

Related tags

Overview

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Junjie Ye, Changhong Fu, Guangze Zheng, Danda Pani Paudel, and Guang Chen. Unsupervised Domain Adaptation for Nighttime Aerial Tracking. In CVPR, pages 1-10, 2022.

Overview

UDAT is an unsupervised domain adaptation framework for visual object tracking. This repo contains its Python implementation.

Paper | NAT2021 benchmark

Testing UDAT

1. Preprocessing

Before training, we need to preprocess the unlabelled training data to generate training pairs.

Download the proposed NAT2021-train set

Customize the directory of the train set in lowlight_enhancement.py and enhance the nighttime sequences

cd preprocessing/
python lowlight_enhancement.py # enhanced sequences will be saved at '/YOUR/PATH/NAT2021/train/data_seq_enhanced/'

Download the video saliency detection model here and place it at preprocessing/models/checkpoints/.

Predict salient objects and obtain candidate boxes

python inference.py # candidate boxes will be saved at 'coarse_boxes/' as .npy

Generate pseudo annotations from candidate boxes using dynamic programming

python gen_seq_bboxes.py # pseudo box sequences will be saved at 'pseudo_anno/'

Generate cropped training patches and a JSON file for training
```
python par_crop.py
python gen_json.py
```

2. Train

Take UDAT-CAR for instance.

Apart from above target domain dataset NAT2021, you need to download and prepare source domain datasets VID and GOT-10K.
Download the pre-trained daytime model (SiamCAR/SiamBAN) and place it at UDAT/tools/snapshot.

Start training

cd UDAT/CAR
export PYTHONPATH=$PWD
python tools/train.py

3. Test

Take UDAT-CAR for instance.

For quick test, you can download our trained model for UDAT-CAR (or UDAT-BAN) and place it at UDAT/CAR/experiments/udatcar_r50_l234.
Start testing
```
python tools/test.py --dataset NAT
```

4. Eval

Start evaluating
```
python tools/eval.py --dataset NAT
```

Demo

Reference

@Inproceedings{Ye2022CVPR,

title={{Unsupervised Domain Adaptation for Nighttime Aerial Tracking}},

author={Ye, Junjie and Fu, Changhong and Zheng, Guangze and Paudel, Danda Pani and Chen, Guang},

booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},

year={2022},

pages={1-10}

}

Acknowledgments

We sincerely thank the contribution of following repos: SiamCAR, SiamBAN, DCFNet, DCE, and USOT.

Contact

If you have any questions, please contact Junjie Ye at [email protected] or Changhong Fu at [email protected].

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Related tags

Overview

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Overview

Testing UDAT

1. Preprocessing

2. Train

3. Test

4. Eval

Demo

Reference

Acknowledgments

Contact

Owner

Intelligent Vision for Robotics in Complex Environment

Animal Sound Classification (Cats Vrs Dogs Audio Sentiment Classification)

Adaptation through prediction: multisensory active inference torque control

Music source separation is a task to separate audio recordings into individual sources

Official code for "Decoupling Zero-Shot Semantic Segmentation"

Tech Resources for Academic Communities

A machine learning project which can detect and predict the skin disease through image recognition.

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Implementation of experiments in the paper Clockwork Variational Autoencoders (project website) using JAX and Flax

A font family with a great monospaced variant for programmers.

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Large scale embeddings on a single machine.

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

ULMFiT for Genomic Sequence Data

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

GPT, but made only out of gMLPs

A facial recognition doorbell system using a Raspberry Pi

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

Gas detection for Raspberry Pi using ADS1x15 and MQ-2 sensors