3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Last update: Dec 20, 2022

Overview

visemenet-inference

Inference Demo of "VisemeNet-tensorflow"
- VisemeNet is an audio-driven animator centric speech animation driving a JALI or standard FACS-based face-rigging from input audio.
- The original repo is outdated and difficult to setup the environment for testing the pretrained model. This code is to provide a super-clean inference module based on the original author's repo.

How to freeze graph

This repo does not need bazel-build for "freeze-graph" function
Thanks to https://github.com/lighttransport/VisemeNet-infer for giving some examples.

Requirements

Python 3.6.x using "pyenv"
Tensorflow 1.1.0

Setup the envs and packages

# Install Virtualenv using pyenv
pyenv install 3.6.5
pyenv virtualenv 3.6.5 visemenet-freeze
pyenv activate visemenet-freeze

# Install packages
pip install tensorflow==1.1.0

Clone the repo

# Clone Visemenet repo and the pretrained model
git clone https://github.com/yzhou359/VisemeNet_tensorflow.git
curl -L https://www.dropbox.com/sh/7nbqgwv0zz8pbk9/AAAghy76GVYDLqPKdANcyDuba?dl=0 > pretrained_model.zip
unzip prtrained_model.zip -d VisemeNet_tensorflow/data/ckpt/pretrain_biwi/

Freeze Graph and Save as pb

# Freeze Graph
python freeze_graph.py

Model Inference

Colab Demo

This code provides the simple and clean inference code without any needless ones
It's compatible with TF 2.0 Version

Requirements

Tensorflow 2.x
numpy
scipy
python_speech_features

How to run inference

import numpy as np
from inference import VisemeRegressor

pb_filepath = "./visemenet_frozen.pb"
wav_file_path = "./test_audio.wav"
out_txt_path = "./maya_viseme_outputs.txt"

viseme_regressor = VisemeRegressor(pb_filepath=pb_filepath)

viseme_outputs = viseme_regressor.predict_outputs(wav_file_path=wav_file_path)

np.savetxt(out_txt_path, viseme_outputs, '%.4f')

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Related tags

Overview

visemenet-inference

How to freeze graph

Requirements

Model Inference

Requirements

How to run inference

Owner

Junhwan Jang

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

Drone Task1 - Drone Task1 With Python

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

HIVE: Evaluating the Human Interpretability of Visual Explanations

Simple Python application to transform Serial data into OSC messages

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Official implementation for the paper: Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Image Captioning on google cloud platform based on iot

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Awesome Remote Sensing Toolkit based on PaddlePaddle.

Forecasting with Gradient Boosted Time Series Decomposition

"Domain Adaptive Semantic Segmentation without Source Data" (ACM MM 2021)

UniFormer - official implementation of UniFormer

Implementation of hyperparameter optimization/tuning methods for machine learning & deep learning models

This is an official source code for implementation on Extensive Deep Temporal Point Process

This repository holds code and data for our PETS'22 article 'From "Onion Not Found" to Guard Discovery'.

Python scripts for performing road segemtnation and car detection using the HybridNets multitask model in ONNX.