Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Last update: Jun 06, 2022

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training because the log-likelihood can be undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can thus be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive models. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy and classification performance. We demonstrate that ev-softmax successfully reduces the dimensionality of output probability distributions while maintaining multimodality.

Setup

Required packages are listed in requirements.txt.

Running

The implementation for the ev-softmax function and its loss function can be found in evsoftmax.py.

The MNIST CVAE and VQ-VAE experiments can be run using run_mnist_cvae.sh and run_vqvae.sh, respectively. Instructions for the SSVAE experiment can be found in mnist_ssvae/README.md, and scripts used for preprocessing, training, and evaluating can be found in mnist_ssvae/scripts. Instructions for the translation experiment can be found in translation/README.md, and scripts used for preprocessing, training, and evaluating can be found in translation/scripts/iwslt.

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Setup

Running

Owner

Stanford Intelligent Systems Laboratory

Dynamic Token Normalization Improves Vision Transformers

Official Repsoitory for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

Code for "Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency" paper

Where-Got-Time - An NUS timetable generator which uses a genetic algorithm to optimise timetables to suit the needs of NUS students

SAN for Product Attributes Prediction

The repo for the paper "I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-shaped Scene Text Detection".

Deep ViT Features as Dense Visual Descriptors

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal computer!

YolactEdge: Real-time Instance Segmentation on the Edge

Portfolio asset allocation strategies: from Markowitz to RNNs

Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI

LETR: Line Segment Detection Using Transformers without Edges

Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

This repository implements Douzero's interface to IGCA.

Interactive web apps created using geemap and streamlit