EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

Improving GAN Equilibrium by Raising Spatial Awareness
Jianyuan Wang, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, Bolei Zhou
arXiv preprint

[Paper] [Project Page] [Demo]

In Generative Adversarial Networks (GANs), a generator (G) and a discriminator (D) are expected to reach a certain equilibrium where D cannot distinguish the generated images from the real ones. However, in practice it is difficult to achieve such an equilibrium in GAN training, instead, D almost always surpasses G. We attribute this phenomenon to the information asymmetry that D learns its own visual attention when determining whether an image is real or fake, but G has no explicit clue on which regions to focus on.

To alleviate the issue of D dominating the competition in GANs, we aim to raise the spatial awareness of G. We encode randomly sampled multi-level heatmaps into the intermediate layers of G as an inductive bias. We further propose to align the spatial awareness of G with the attention map induced from D. Through this way we effectively lessen the information gap between D and G. Extensive results show that our method pushes the two-player game in GANs closer to the equilibrium, leading to a better synthesis performance. As a byproduct, the introduced spatial awareness facilitates interactive editing over the output synthesis.

BibTeX

@article{wang2021eqgan,
  title   = {Improving GAN Equilibrium by Raising Spatial Awareness},
  author  = {Wang, Jianyuan and Yang, Ceyuan and Xu, Yinghao and Shen, Yujun and Li, Hongdong and Zhou, Bolei},
  article = {arXiv preprint},
  year    = {2021}
}

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

Related tags

Overview

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

BibTeX

Owner

GenForce: May Generative Force Be with You

Revisting Open World Object Detection

Bringing Computer Vision and Flutter together , to build an awesome app !!

The missing CMake project initializer

This is the code used in the paper "Entity Embeddings of Categorical Variables".

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

Cweqgen - The CW Equation Generator

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

PyTorch implementation of DCT fast weight RNNs

Bayesian regularization for functional graphical models.

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

Graph parsing approach to structured sentiment analysis.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

Lenia - Mathematical Life Forms

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

This repo contains the official implementations of EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis

PyTorch common framework to accelerate network implementation, training and validation

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021