This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Last update: Dec 29, 2022

Related tags

Deep Learning AD-NeRF

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

PyTorch implementation for the paper "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis"

Prerequisites

You can create an anaconda environment called adnerf with:

conda env create -f environment.yml
conda activate adnerf

PyTorch3D

Recommend install from a local clone

git clone https://github.com/facebookresearch/pytorch3d.git
cd pytorch3d && pip install -e .

Basel Face Model 2009

Put "01_MorphableModel.mat" to data_util/face_tracking/3DMM/; cd data_util/face_tracking; run
```
python convert_BFM.py
```

Train AD-NeRF

Data Preprocess ($id Obama for example)
```
bash process_data.sh Obama
```
- Input: A portrait video at 25fps containing voice audio. (dataset/vids/$id.mp4)
- Output: folder dataset/$id that contains all files for training
Train Two NeRFs (Head-NeRF and Torso-NeRF)
- Train Head-NeRF with command
```
python NeRFs/HeadNeRF/run_nerf.py --config dataset/$id/HeadNeRF_config.txt
```
- Copy latest trainied model from dataset/$id/logs/$id_head to dataset/$id/logs/$id_com
- Train Torso-NeRF with command
```
python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRF_config.txt
```

Run AD-NeRF for rendering

Reconstruct original video with audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=dataset/$id/aud.npy --test_size=300

Drive the target person with another audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=${deepspeechfile.npy} --test_size=-1

Acknowledgments

We use face-parsing.PyTorch for parsing head and torso maps, and DeepSpeech for audio feature extraction. The NeRF model is implemented based on NeRF-pytorch.

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Related tags

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

Prerequisites

Train AD-NeRF

Run AD-NeRF for rendering

Acknowledgments

Owner

BMVC 2021 Oral: code for BI-GCN: Boundary-Aware Input-Dependent Graph Convolution for Biomedical Image Segmentation

Generic Event Boundary Detection: A Benchmark for Event Segmentation

Code of paper "CDFI: Compression-Driven Network Design for Frame Interpolation", CVPR 2021

A really easy-to-use and powerful sudoku solver.

Unofficial implementation of MUSIQ (Multi-Scale Image Quality Transformer)

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

This repository contains small projects related to Neural Networks and Deep Learning in general.

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

A collection of awesome resources image-to-image translation.

Toontown House CT Edition

Exponential Graph is Provably Efficient for Decentralized Deep Training

Apply AnimeGAN-v2 across frames of a video clip

Convnet transfer - Code for paper How transferable are features in deep neural networks?

Official Pytorch implementation of Meta Internal Learning

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Deep Learning for Time Series Classification

A PyTorch implementation of PointRend: Image Segmentation as Rendering

Scalable machine learning based time series forecasting

Object Depth via Motion and Detection Dataset

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.