HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Last update: Dec 04, 2022

Related tags

Overview

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Official PyTroch implementation of HPRNet.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation,
Nermin Samet, Emre Akbas,
Under review. (arXiv pre-print)

Highlights

HPRNet is a bottom-up, one-stage and hierarchical keypoint regression method for whole-body pose estimation.
HPRNet has the best performance among bottom-up methods for all the whole-body parts.
HPRNet achieves SOTA performance for the face (76.0 AP) and hand (51.2 AP) keypoint estimation.
Unlike two-stage methods, HPRNet predicts whole-body pose in a constant time independent of the number of people in an image.

COCO-WholeBody Keypoint Estimation Results

Model	Body AP	Foot AP	Face AP	Hand AP	Whole-body AP	Download
HPRNet (DLA)	55.2 / 57.1	49.1 / 50.7	74.6 / 75.4	47.0 / 48.4	31.5 / 32.7	model
HPRNet (Hourglass)	59.4 / 61.1	53.0 / 53.9	75.4 / 76.0	50.4 / 51.2	34.8 / 34.9	model

Results are presented without and with test time flip augmentation respectively.
All models are trained on COCO-WholeBody train2017 and evaluated on val2017.
The models can be downloaded directly from Google drive.

Installation

[Optional but recommended] create a new conda environment.
```
conda create --name HPRNet python=3.7
```
And activate the environment.
```
conda activate HPRNet
```

Clone the repo:

HPRNet_ROOT=/path/to/clone/HPRNet
git clone https://github.com/nerminsamet/HPRNet $HPRNet_ROOT

Install PyTorch 1.4.0:

conda install pytorch torchvision cudatoolkit=10.0 -c pytorch

Install the requirements:
```
pip install -r requirements.txt
```

Compile DCNv2 (Deformable Convolutional Networks):

cd $HPRNet_ROOT/src/lib/models/networks/DCNv2
./make.sh

Dataset preparation

Download the images (2017 Train, 2017 Val) from coco website.

Download train and val annotation files.

${COCO_PATH}
|-- annotations
    |-- coco_wholebody_train_v1.0.json
    |-- coco_wholebody_val_v1.0.json
|-- images
    |-- train2017
    |-- val2017

Evaluation and Training

You could find all the evaluation and training scripts in the experiments folder.
For evaluation, please download the pretrained models you want to evaluate and put them in HPRNet_ROOT/models/.
In the case that you don't have 4 GPUs, you can follow the linear learning rate rule to adjust the learning rate.
If the training is terminated before finishing, you can use the same command with --resume to resume training.

Acknowledgement

The numerical calculations reported in this paper were fully performed at TUBITAK ULAKBIM, High Performance and Grid Computing Center (TRUBA resources).

License

HPRNet is released under the MIT License (refer to the LICENSE file for details).

Citation

If you find HPRNet useful for your research, please cite our paper as follows:

N. Samet, E. Akbas, "HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation", arXiv, 2021.

BibTeX entry:

@misc{hprnet,
      title={HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation}, 
      author={Nermin Samet and Emre Akbas},
      year={2021}, 
}

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Related tags

Overview

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Highlights

COCO-WholeBody Keypoint Estimation Results

Installation

Dataset preparation

Evaluation and Training

Acknowledgement

License

Citation

Owner

Nermin Samet

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

September-Assistant - Open-source Windows Voice Assistant

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Pytorch implementation for the Temporal and Object Quantification Networks (TOQ-Nets).

[AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Campsite Reservation Finder

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Federated Learning Based on Dynamic Regularization

Interactive dimensionality reduction for large datasets

A simple, fully convolutional model for real-time instance segmentation.

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

BlueFog Tutorials

Official code for "Decoupling Zero-Shot Semantic Segmentation"

Zsseg.baseline - Zero-Shot Semantic Segmentation

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Ejemplo Algoritmo Viterbi - Example of a Viterbi algorithm applied to a hidden Markov model on DNA sequence

[ICCV2021] Learning to Track Objects from Unlabeled Videos

Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering