The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Last update: Jan 09, 2023

Related tags

Deep Learning PIRender

Overview

Website | ArXiv | Get Start | Video

PIRenderer

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering" (ICCV2021)

The proposed PIRenderer can synthesis portrait images by intuitively controlling the face motions with fully disentangled 3DMM parameters. This model can be applied to tasks such as:

Intuitive Portrait Image Editing

Intuitive Portrait Image Control

Pose & Expression Alignment
Motion Imitation

Same & Corss-identity Reenactment
Audio-Driven Facial Reenactment

Audio-Driven Reenactment

News

2021.9.20 Code for PyTorch is available!

Colab Demo

Coming soon

Get Start

1). Installation

Requirements

Python 3
PyTorch 1.7.1
CUDA 10.2

Conda Installation

# 1. Create a conda virtual environment.
conda create -n PIRenderer python=3.6
conda activate PIRenderer
conda install -c pytorch pytorch=1.7.1 torchvision cudatoolkit=10.2

# 2. Install other dependencies
pip install -r requirements.txt

2). Dataset

We train our model using the VoxCeleb. You can download the demo dataset for inference or prepare the dataset for training and testing.

Download the demo dataset

The demo dataset contains all 514 test videos. You can download the dataset with the following code:

./scripts/download_demo_dataset.sh

Or you can choose to download the resources with these links:

Google Driven & BaiDu Driven with extraction passwords ”p9ab“

Then unzip and save the files to ./dataset

Prepare the dataset

The dataset is preprocessed follow the method used in First-Order. You can follow the instructions in their repo to download and crop videos for training and testing.

After obtaining the VoxCeleb videos, we extract 3DMM parameters using Deep3DFaceReconstruction.

The folder are with format as:

${DATASET_ROOT_FOLDER}
└───path_to_videos
		└───train
				└───xxx.mp4
				└───xxx.mp4
				...
		└───test
				└───xxx.mp4
				└───xxx.mp4
				...
└───path_to_3dmm_coeff
		└───train
				└───xxx.mat
				└───xxx.mat
				...
		└───test
				└───xxx.mat
				└───xxx.mat
				...

We save the video and 3DMM parameters in a lmdb file. Please run the following code to do this

python scripts/prepare_vox_lmdb.py \
--path path_to_videos \
--coeff_3dmm_path path_to_3dmm_coeff \
--out path_to_output_dir

3). Training and Inference

Inference

The trained weights can be downloaded by running the following code:

./scripts/download_weights.sh

Or you can choose to download the resources with these links: coming soon. Then save the files to ./result/face

Reenactment

Run the the demo for face reenactment:

python -m torch.distributed.launch --nproc_per_node=1 --master_port 12345 inference.py \
--config ./config/face.yaml \
--name face \
--no_resume \
--output_dir ./vox_result/face_reenactment

The output results are saved at ./vox_result/face_reenactment

Intuitive Control

coming soon

Train

Our model can be trained with the following code

python -m torch.distributed.launch --nproc_per_node=4 --master_port 12345 train.py \
--config ./config/face.yaml \
--name face

Citation

If you find this code is helpful, please cite our paper

@misc{ren2021pirenderer,
      title={PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering}, 
      author={Yurui Ren and Ge Li and Yuanqi Chen and Thomas H. Li and Shan Liu},
      year={2021},
      eprint={2109.08379},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement

We build our project base on imaginaire. Some dataset preprocessing methods are derived from video-preprocessing.

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Related tags

Overview

PIRenderer

News

Colab Demo

Get Start

1). Installation

Requirements

Conda Installation

2). Dataset

Download the demo dataset

Prepare the dataset

3). Training and Inference

Inference

Train

Citation

Acknowledgement

Owner

Ren Yurui

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Security evaluation module with onnx, pytorch, and SecML.

Custom Implementation of Non-Deep Networks

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Generic Foreground Segmentation in Images

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Simple Dynamic Batching Inference

OpenMMLab Image and Video Editing Toolbox

Feature board for ERPNext

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

NAS-Bench-x11 and the Power of Learning Curves

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Transformer in Vision

yolov5 deepsort 行人车辆跟踪检测计数

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks

Only works with the dashboard version / branch of jesse

🛰️ List of earth observation companies and job sites

Credit fraud detection in Python using a Jupyter Notebook

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Related tags

Overview

PIRenderer

News

Colab Demo

Get Start

1). Installation

Requirements

Conda Installation

2). Dataset

Download the demo dataset

Prepare the dataset

3). Training and Inference

Inference

Train

Citation

Acknowledgement

Owner

Ren Yurui

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Security evaluation module with onnx, pytorch, and SecML.

Custom Implementation of Non-Deep Networks

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Generic Foreground Segmentation in Images

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Simple Dynamic Batching Inference

OpenMMLab Image and Video Editing Toolbox

Feature board for ERPNext

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

NAS-Bench-x11 and the Power of Learning Curves

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Transformer in Vision

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

Local Similarity Pattern and Cost Self-Reassembling for Deep Stereo Matching Networks

Only works with the dashboard version / branch of jesse

🛰️ List of earth observation companies and job sites

Credit fraud detection in Python using a Jupyter Notebook

yolov5 deepsort 行人车辆跟踪检测计数