PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Explaining in Style: Training a GAN to explain a classifier in StyleSpace

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Probabilistic Gradient Boosting Machines

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

A community run, 5-day PyTorch Deep Learning Bootcamp

A fast Protein Chain / Ligand Extractor and organizer.

Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

9th place solution

Evolving neural network parameters in JAX.

High-Resolution Image Synthesis with Latent Diffusion Models

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Causal estimators for use with WhyNot

A collection of implementations of deep domain adaptation algorithms

Generative Exploration and Exploitation - This is an improved version of GENE.

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Using deep actor-critic model to learn best strategies in pair trading

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods