Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Last update: Dec 01, 2022

Overview

PortraitNet

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device". @ CAD&Graphics 2019

Introduction

We propose a real-time portrait segmentation model, called PortraitNet, that can run effectively and efficiently on mobile device. PortraitNet is based on a lightweight U-shape architecture with two auxiliary losses at the training stage, while no additional cost is required at the testing stage for portrait inference.

Portrait segmentation applications on mobile device.

Experimental setup

Requirements

python 2.7
PyTorch 0.3.0.post4
Jupyter Notebook
pip install easydict matplotlib tqdm opencv-python scipy pyyaml numpy

Download datasets

EG1800 Since several image URL links are invalid in the original EG1800 dataset, we finally use 1447 images for training and 289 images for validation.
Supervise-Portrait Supervise-Portrait is a portrait segmentation dataset collected from the public human segmentation dataset Supervise.ly using the same data process as EG1800.

Training

Network Architecture

Overview of PortraitNet.

Training Steps

Download the datasets (EG1800 or Supervise-Portriat). If you want to training at your own dataset, you need to modify data/datasets.py and data/datasets_portraitseg.py.
Prepare training/testing files, like data/select_data/eg1800_train.txt and data/select_data/eg1800_test.txt.
Select and modify the parameters in the folder of config.
Start the training with single gpu:

cd myTrain
python2.7 train.py

Testing

In the folder of myTest:

you can use EvalModel.ipynb to test on testing datasets.
you can use VideoTest.ipynb to test on a single image or video.

Visualization

Using tensorboard to visualize the training process:

cd path_to_save_model
tensorboard --logdir='./log'

Download models

from Dropbox:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

from Baidu Cloud:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Related tags

Overview

PortraitNet

Introduction

Experimental setup

Requirements

Download datasets

Training

Network Architecture

Training Steps

Testing

Visualization

Download models

Owner

Official implementation of Influence-balanced Loss for Imbalanced Visual Classification in PyTorch.

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Intelligent Video Analytics toolkit based on different inference backends.

Official Pytorch implementation of C3-GAN

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion.

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

Learning to Identify Top Elo Ratings with A Dueling Bandits Approach

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

Finetune SSL models for MOS prediction

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

A heterogeneous entity-augmented academic language model based on Open Academic Graph (OAG)

NaturalCC is a sequence modeling toolkit that allows researchers and developers to train custom models

ElegantRL is featured with lightweight, efficient and stable, for researchers and practitioners.

Greedy Gaussian Segmentation

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

This code is part of the reproducibility package for the SANER 2022 paper "Generating Clarifying Questions for Query Refinement in Source Code Search".

[ACM MM2021] MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification