Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

Last update: Dec 21, 2022

Overview

PersonLab

This is a Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation. The model predicts heatmaps and various offsets which allow for computation of joint locations and connections as well as pixel instance ids. See the paper for more details.

Training a model

If you want to use Resnet101 as the base, first download the imagenet initialization weights from here and copy it to your ~/.keras/models/ directory. (Over 100MB files cannot be hosted on github.)

First, construct the dataset in the correct format by running the generate_hdf5.py script. Before running, just set the ANNO_FILE and IMG_DIR constants at the top of the script to the paths to the COCO person_keypoints annotation file and the image folder respectively.

Edit the config.py to set options for training, e.g. input resolution, number of GPUs, whether to freeze the batchnorm weights, etc. More advanced options require altering the train.py script. For example, changing the base network can be done by adding an argument to the get_personlab() function, see the documentation there.

After eveything is configured to your liking, go ahead and run the train.py script.

Testing a model

See the demo.ipynb for sample inference and visualizations.

Technical Debts

Several parts of this codebase are borrowed from others. These include:

The Resnet-101 in Keras
The augmentation code (which is different from the procedure in the PersonLab paper) and data iterator code is heavily borrowed from this fork of the Keras implementation of CMU's "Realtime Multi-Person Pose Estimation". (The pose plotting function is also influenced by the one in that repo.)
The Polyak Averaging callback is just a lightly modified version of the EMA callback from here

Environment

This code was tested in the following environment and with the following software versions:

Ubuntu 16.04
CUDA 8.0 with cudNN 6.0
Python 2.7
Tensorflow 1.7
Keras 2.1.3
OpenCV 2.4.9

Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

Related tags

Overview

PersonLab

Training a model

Testing a model

Technical Debts

Environment

Owner

OCTI

Software Platform for solving and manipulating multiparametric programs in Python

Augmentation for Single-Image-Super-Resolution

Simple, efficient and flexible vision toolbox for mxnet framework.

Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Audio Visual Emotion Recognition using TDA

Code for this paper The Lottery Ticket Hypothesis for Pre-trained BERT Networks.

An example of semantic segmentation using tensorflow in eager execution.

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

The Self-Supervised Learner can be used to train a classifier with fewer labeled examples needed using self-supervised learning.

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

ACV is a python library that provides explanations for any machine learning model or data.

Deep Two-View Structure-from-Motion Revisited

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

Do Neural Networks for Segmentation Understand Insideness?

Methods to get the probability of a changepoint in a time series.

Wav2Vec for speech recognition, classification, and audio classification

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)