MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Last update: Jan 07, 2023

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

This repo is the official implementation of "MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation, Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool" in PyTorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our MHFormer with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Models	MPJPE
VideoPose3D	46.8
PoseFormer	44.3
MHFormer	43.0

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@article{li2021mhformer,
  title={MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Tang, Hao and Wang, Pichao and Van Gool, Luc},
  journal={arXiv preprint},
  year={2021}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

An evaluation toolkit for voice conversion models.

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Protect against subdomain takeover

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

SSD: A Unified Framework for Self-Supervised Outlier Detection [ICLR 2021]

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

MCMC samplers for Bayesian estimation in Python, including Metropolis-Hastings, NUTS, and Slice

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

Stroke-predictions-ml-model - Machine learning model to predict individuals chances of having a stroke

This repo is to present various code demos on how to use our Graph4NLP library.

Self-Supervised Learning

Self-supervised learning (SSL) is a method of machine learning

Python project to take sound as input and output as RGB + Brightness values suitable for DMX

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"