A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Last update: Jan 03, 2023

Overview

3d-pose-baseline

This is the code for the paper

Julieta Martinez, Rayat Hossain, Javier Romero, James J. Little. A simple yet effective baseline for 3d human pose estimation. In ICCV, 2017. https://arxiv.org/pdf/1705.03098.pdf.

The code in this repository was mostly written by Julieta Martinez, Rayat Hossain and Javier Romero.

We provide a strong baseline for 3d human pose estimation that also sheds light on the challenges of current approaches. Our model is lightweight and we strive to make our code transparent, compact, and easy-to-understand.

Dependencies

Python ≥ 3.5
cdflib
tensorflow 1.0 or later

First of all

Watch our video: https://youtu.be/Hmi3Pd9x1BE
Clone this repository

git clone https://github.com/una-dinosauria/3d-pose-baseline.git
cd 3d-pose-baseline
mkdir -p data/h36m/

Get the data

Go to http://vision.imar.ro/human3.6m/, log in, and download the D3 Positions files for subjects [1, 5, 6, 7, 8, 9, 11], and put them under the folder data/h36m. Your directory structure should look like this

src/
README.md
LICENCE
...
data/
  └── h36m/
    ├── Poses_D3_Positions_S1.tgz
    ├── Poses_D3_Positions_S11.tgz
    ├── Poses_D3_Positions_S5.tgz
    ├── Poses_D3_Positions_S6.tgz
    ├── Poses_D3_Positions_S7.tgz
    ├── Poses_D3_Positions_S8.tgz
    └── Poses_D3_Positions_S9.tgz

Now, move to the data folder, and uncompress all the data

cd data/h36m/
for file in *.tgz; do tar -xvzf $file; done

Finally, download the code-v1.2.zip file, unzip it, and copy the metadata.xml file under data/h36m/

Now, your data directory should look like this:

data/
  └── h36m/
    ├── metadata.xml
    ├── S1/
    ├── S11/
    ├── S5/
    ├── S6/
    ├── S7/
    ├── S8/
    └── S9/

There is one little fix we need to run for the data to have consistent names:

mv h36m/S1/MyPoseFeatures/D3_Positions/TakingPhoto.cdf \
   h36m/S1/MyPoseFeatures/D3_Positions/Photo.cdf

mv h36m/S1/MyPoseFeatures/D3_Positions/TakingPhoto\ 1.cdf \
   h36m/S1/MyPoseFeatures/D3_Positions/Photo\ 1.cdf

mv h36m/S1/MyPoseFeatures/D3_Positions/WalkingDog.cdf \
   h36m/S1/MyPoseFeatures/D3_Positions/WalkDog.cdf

mv h36m/S1/MyPoseFeatures/D3_Positions/WalkingDog\ 1.cdf \
   h36m/S1/MyPoseFeatures/D3_Positions/WalkDog\ 1.cdf

And you are done!

Please note that we are currently not supporting SH detections anymore, only training from GT 2d detections is possible now.

Quick demo

For a quick demo, you can train for one epoch and visualize the results. To train, run

python src/predict_3dpose.py --camera_frame --residual --batch_norm --dropout 0.5 --max_norm --evaluateActionWise --epochs 1

This should take about <5 minutes to complete on a GTX 1080, and give you around 56 mm of error on the test set.

Now, to visualize the results, simply run

python src/predict_3dpose.py --camera_frame --residual --batch_norm --dropout 0.5 --max_norm --evaluateActionWise --epochs 1 --sample --load 24371

This will produce a visualization similar to this:

Training

To train a model with clean 2d detections, run:

python src/predict_3dpose.py --camera_frame --residual --batch_norm --dropout 0.5 --max_norm --evaluateActionWise

This corresponds to Table 2, bottom row. Ours (GT detections) (MA)

Citing

If you use our code, please cite our work

@inproceedings{martinez_2017_3dbaseline,
  title={A simple yet effective baseline for 3d human pose estimation},
  author={Martinez, Julieta and Hossain, Rayat and Romero, Javier and Little, James J.},
  booktitle={ICCV},
  year={2017}
}

Other implementations

Pytorch by @weigq
MXNet/Gluon by @lck1201

Extensions

@ArashHosseini maintains a fork for estimating 3d human poses using the 2d poses estimated by either OpenPose or tf-pose-estimation as input.

License

MIT

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

Related tags

Overview

3d-pose-baseline

Dependencies

First of all

Quick demo

Training

Citing

Other implementations

Extensions

License

Owner

Julieta Martinez

Official PyTorch Implementation of Learning Architectures for Binary Networks

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

PyTorch implementation code for the paper MixCo: Mix-up Contrastive Learning for Visual Representation

Optimizing synthesizer parameters using gradient approximation

mbrl-lib is a toolbox for facilitating development of Model-Based Reinforcement Learning algorithms.

Self-supervised Augmentation Consistency for Adapting Semantic Segmentation (CVPR 2021)

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

Replication of Pix2Seq with Pretrained Model

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

This repository contains the reference implementation for our proposed Convolutional CRFs.

Offline Reinforcement Learning with Implicit Q-Learning

Exemplo de implementação do padrão circuit breaker em python

Official implementation of Few-Shot and Continual Learning with Attentive Independent Mechanisms

natural image generation using ConvNets

This repo is about to create the Streamlit application for given ML model.

Fast and Simple Neural Vocoder, the Multiband RNNMS

PINN(s): Physics-Informed Neural Network(s) for von Karman vortex street

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets