Variational autoencoder for anime face reconstruction

Last update: Dec 11, 2021

Related tags

Overview

VAE animeface

Variational autoencoder for anime face reconstruction

Introduction

This repository is an exploratory example to train a variational autoencoder to extract meaningful feature representations of anime girl face images.

The code architecture is mostly borrowed and modified from Yann Dubois's disentangling-vae repository. It has nice summarization and comparison of the different VAE model proposed recently.

Dataset

Anime Face Dataset contains 63,632 anime faces. (all rescaled to 64x64 in training)

Model

The model used is the one proposed in the paper Understanding disentangling in β-VAE, which is summarized below:

I used laplace as the target distribution to calculate the reconstruction loss. From Yann's code, it suggests that bernoulli would generally a better choice, but it looks it converge slowly in my case. (I didn't do a fair comparison to be conclusive)

Loss function used is β-VAEH from β-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework.

Result

Latent feature number is set to 20 (10 gaussian mean, 10 log gaussian variance). VAE model is trained for 100 epochs. All data is used for training, no validation and testing applied.

Face reconstruction

Prior space traversal

Based on the face reconstruction result while traversing across the latent space, we may speculate the generative property of each latent as following:

Hair shade
Hair length
Face orientation
Hair color
Face rotation
Bangs, face color
Hair glossiness
Unclear
Eye size & color
Bangs

Original faces clustering

Original anime faces are clustered based on latent features (selected feature is either below 1% (left 5) or above 99% (right 5) among all data points, while the rest latent features are closeto each other). Visulization of the original images mostly confirms the speculation above.

Latent feature diagnosis

Learned latent features are all close to standard normal distribution, and show minimum correlation.

Variational autoencoder for anime face reconstruction

Related tags

Overview

VAE animeface

Introduction

Dataset

Model

Result

Face reconstruction

Prior space traversal

Original faces clustering

Latent feature diagnosis

Owner

Minzhe Zhang

A computational block to solve entity alignment over textual attributes in a knowledge graph creation pipeline.

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

View model summaries in PyTorch!

Spectralformer: Rethinking hyperspectral image classification with transformers

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Explainable Medical ImageSegmentation via GenerativeAdversarial Networks andLayer-wise Relevance Propagation

A fast python implementation of Ray Tracing in One Weekend using python and Taichi

PyTorch implementation of the wavelet analysis from Torrence & Compo

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

This repo is official PyTorch implementation of MobileHumanPose: Toward real-time 3D human pose estimation in mobile devices(CVPRW 2021).

Fine-tune pretrained Convolutional Neural Networks with PyTorch

The repository offers the official implementation of our BMVC 2021 paper in PyTorch.

A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

The implementation code for "DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction"

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

KIND: an Italian Multi-Domain Dataset for Named Entity Recognition

Melanoma Skin Cancer Detection using Convolutional Neural Networks and Transfer Learning🕵🏻‍♂️

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!