a morph transfer UGATIT for image translation.

Last update: Nov 14, 2022

Related tags

Deep Learning Morph-UGATIT

Overview

Morph-UGATIT

a morph transfer UGATIT for image translation.

Introduction

中文技术文档

This is Pytorch implementation of UGATIT, paper "U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation".

Additionally, I DIY the model by adding two modules, a MLP module to learn a latent zone and an identity preserving loss. These two factors make UGATIT to achieve a progressive domain transfer for image translation. I call this method Morph UGATIT.

My work has two aspects:

Firstly, according to official TensorFlow code of UGATIT, I use PyTorch to reimplement it, very close to original TF model including network, training hyper parameters.
I add a MLP module, introducing a latent code for generator. And an identity preserving loss is used to learn more common feature for different domains.

I train model on two datasets, "adult2child" and "selfie2anime".

Requirements

python3.7
Pytorch >= 1.6
dlib. Before installing dlib, you should install Cmake and Boost

pip install Cmake
pip install Boost
pip install dlib

other common-used libraries.

How to Use

There are many models in my repo, but you just need two models and corresponding python script files.

UGATIT: "configs/cfgs_ugatit.py", "models/ugatit.py", "tool/train_ugatit.py", "tool/demo_ugatit.py"
Morph UGATIT: "configs/cfgs_s_ugatit_plus.py", "models/s_ugatit_plus.py", "tool/train_s_ugatit_plus.py", "tool/demo_morph_ugatit.py"

train step

getting dataset. The "adult2child" dataset comes from G-Lab, which is generated by StyleGAN. You can download here

The "selfie2anime" dataset comes from official UGATIT repo.

set configurations. configuration files can be found "configs" dir. You just focus on "cfgs_ugatit.py" and "cfgs_s_ugatit_plus.py". Please change:

dirA: domain A dataset path.
dirB: domain B dataset path.
anime: whether dataset is "selfie2anime".
tensorboard: tensorboard log path.
saved_dir: save model weight into "saved_dir".

start to train.

cd tool
python train_ugatit.py   # ugatit
python train_s_ugatit_plus.py   #  morph ugatit

you can also use tensorboard to check loss curves and some visualizations.

evaluation step

Since dlib is necessary, you should download dlib model weight here. change "alignment_loc" at "tool/demo_xxxx.py". "xxx" means "ugatit" or "morph_ugatit" to your dlib model weight path. Then put a test image into a dir.

cd tool
python demo_ugatit.py --type ugatit --resume ${ckpt path}$ --input ${image dir}$ --saved-dir ${result location}$ --align
python demo_morph_ugatit.py --resume ${ckpt path}$ --input ${image dir}$ --saved-dir ${result location}$ --align

Note: if you want to try "selfie2anime", please add a extra term "--anime".

Here I provide my pretrained model weights.

for "adult2child" dataset

ugatit

morph ugatit

for "selfie2anime" dataset

ugatit

More results can be seen here

References

official UGATIT repo
official CycleGAN repo
GLab, http://www.seeprettyface.com/
paper "Lifespan age transformation synthesis" and its' official code.

a morph transfer UGATIT for image translation.

Related tags

Overview

Morph-UGATIT

Introduction

Requirements

How to Use

train step

evaluation step

References

Owner

Code for "Steerable Pyramid Transform Enables Robust Left Ventricle Quantification"

Road Crack Detection Using Deep Learning Methods

Code repository for our paper regarding the L3D dataset.

SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

YolactEdge: Real-time Instance Segmentation on the Edge

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Qcover is an open source effort to help exploring combinatorial optimization problems in Noisy Intermediate-scale Quantum(NISQ) processor.

Honours project, on creating a depth estimation map from two stereo images of featureless regions

A Haskell kernel for IPython.

Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data

Official PyTorch implementation of the NeurIPS 2021 paper StyleGAN3

Agile SVG maker for python

Collection of sports betting AI tools.

An NVDA add-on to split screen reader and audio from other programs to different sound channels

Python version of the amazing Reaction Mechanism Generator (RMG).

a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.

Unet network with mean teacher for altrasound image segmentation