Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

Malware Env for OpenAI Gym

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

CTF challenges from redpwnCTF 2021

Analysis of Antarctica sequencing samples contaminated with SARS-CoV-2

A vanilla 3D face modeling on pose-invariant and multi-lightning image data

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

SCU OlympicsRunning Baseline

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Data loaders and abstractions for text and NLP

Implements a fake news detection program using classifiers.

Faster RCNN pytorch windows

Syntax-Aware Action Targeting for Video Captioning

Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

Open standard for machine learning interoperability

Denoising Diffusion Probabilistic Models

An offline deep reinforcement learning library