4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Last update: Nov 09, 2022

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR)

Challenge Site

Overview

Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity dampens the performance of classification methods, especially for the datasensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and a post-processing approach (CAN) to improve the accuracy. In the PBVS 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which achieves the top-tier among all the participants.

Requirements

Ubuntu (It's only tested on Ubuntu, so it may not work on Windows.)
Python >= 3.7
PyTorch >= 1.4.0
torchvision
```
pip install -r requirements.txt
```

Usage

The first stage training

python train.py --config ./configs/sar10/shake_shake.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val”, under the “dataset” field and “output_dir” under the “train” field in the file “./configs/sar10/shake_shake.yaml”。

The second stage training

python train.py --config ./configs/sar10/shake_shake_fc.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val” under the “dataset” field and “output_dir”, “checkpoint” under the “train” field in the file “./configs/sar10/shake_shake_fc.yaml”。

Test

python predict_TTA.py

You need to change the value of “dataset_dir”, “checkpoint”, under the “test” field in the file “./configs/sar10/shake_shake.yaml”, then you can find the results in file “.result/results.csv”。
You can download the trained model here.

Acknowledge

The codes borrow heavily from hysts/pytorch_image_classification.

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Related tags

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

Overview

Requirements

Usage

The first stage training

The second stage training

Test

Acknowledge

Owner

LinpengPan

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Space Invaders For Python

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

Node-level Graph Regression with Deep Gaussian Process Models

This is the code for Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Official implementation for "Style Transformer for Image Inversion and Editing" (CVPR 2022)

HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Deep Sketch-guided Cartoon Video Inbetweening

The world's largest toxicity dataset.

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

The source code for CATSETMAT: Cross Attention for Set Matching in Bipartite Hypergraphs

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

[TPAMI 2021] iOD: Incremental Object Detection via Meta-Learning

Some tentative models that incorporate label propagation to graph neural networks for graph representation learning in nodes, links or graphs.

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

Le dataset des images du projet d'IA de 2021

Imagededup - 😎 Finding duplicate images made easy