Robust Partial Matching for Person Search in the Wild

Last update: Dec 18, 2022

Related tags

Overview

APNet for Person Search

Introduction

This is the code of Robust Partial Matching for Person Search in the Wild accepted in CVPR2020. The Align-to-Part Network(APNet) is proposed to alleviate the misalignment problem occurred in pedestrian detector, facilitating the downstream re-identification task. The code is based on maskrcnn-benchmark.

Quick start

Installation

Please follow the offical installation INSTALL.md. This code does not support the mixed precision training, so feel free to skip the installation of apex.

NOTE: If you meet some problems during the installation, you may find a solution in issues of official maskrcnn-benchmark.

Install APNet

git clone https://github.com/zhongyingji/APNet.git
cd APNet
rm -rf build/
python setup.py build develop

Dataset Preparation

Make sure you have downloaded the dataset of person search like PRW-v16.04.20.

Since the training of APNet relies on the keypoint annotation, we provide the keypoint estimation file by AlphaPose in keypoint_pred/. Copy all the files into the root dir of dataset, like /path_to_prw_dataset/PRW-v16.04.20/:

cp keypoint_pred/* /path_to_prw_dataset/PRW-v16.04.20/

Symlink the path to the dataset to datasets/ as follows:

ln -s /path_to_prw_dataset/PRW-v16.04.20/ maskrcnn_benchmark/datasets/PRW-v16.04.20

Training

APNet composes of three modules, OIM, RSFE and BBA. To train the entire network, you can simply run:

./train.sh

which contains the training scripts of the three modules.

NOTE: Both RSFE and BBA are required to be intialised with the trained OIM. For more details, please check train.sh.

You can alter the scripts in train.sh in the following aspects:

We train OIM on 2 GPUS with batchsize 4. If you encounter out-of-memory (OOM) error, reduce the batchsize by setting SOLVER.IMS_PER_BATCH to a smaller number.
If you want to use 1 GPU, replace the command of OIM with single GPU training script:

python tools/train_net.py --config-file "configs/reid/prw_R_50_C4.yaml" SOLVER.IMS_PER_BATCH 2 TEST.IMS_PER_BATCH 8 OUTPUT_DIR "models/prw_oim"

Test

After each of the module has been trained, you can run exactly the same training script of that module to test the performance.

Citation

If you find this work or code is helpful in your research, please consider citing:

Robust Partial Matching for Person Search in the Wild

Related tags

Overview

APNet for Person Search

Introduction

Quick start

Installation

Dataset Preparation

Training

Test

Citation

Owner

Yingji Zhong

Flexible Option Learning - NeurIPS 2021

Steerable discovery of neural audio effects

A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Official implementation of Self-supervised Graph Attention Networks (SuperGAT), ICLR 2021.

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)

State of the Art Neural Networks for Generative Deep Learning

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

Weakly Supervised Learning of Rigid 3D Scene Flow

Generalized Data Weighting via Class-level Gradient Manipulation

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

P-Tuning v2: Prompt Tuning Can Be Comparable to Finetuning Universally Across Scales and Tasks

Moer Grounded Image Captioning by Distilling Image-Text Matching Model

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens