Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Last update: Jan 01, 2023

Overview

RAVE: Realtime Audio Variational autoEncoder

Official implementation of RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (article link) by Antoine Caillon and Philippe Esling.

If you use RAVE as a part of a music performance or installation, be sure to cite either this repository or the article !

Installation

RAVE needs python 3.9. Install the dependencies using

pip install -r requirements.txt

Detailed instructions to setup a training station for this project are available here.

Preprocessing

RAVE comes with two command line utilities, resample and duration. resample allows to pre-process (silence removal, loudness normalization) and augment (compression) an entire directory of audio files (.mp3, .aiff, .opus, .wav, .aac). duration prints out the total duration of a .wav folder.

Training

Both RAVE and the prior model are available in this repo. For most users we recommand to use the cli_helper.py script, since it will generate a set of instructions allowing the training and export of both RAVE and the prior model on a specific dataset.

python cli_helper.py

However, if you want to customize even more your training, you can use the provided train_{rave, prior}.py and export_{rave, prior}.py scripts manually.

Reconstructing audio

Once trained, you can reconstruct an entire folder containing wav files using

python reconstruct.py --ckpt /path/to/checkpoint --wav-folder /path/to/wav/folder

You can also export RAVE to a torchscript file using export_rave.py and use the encode and decode methods on tensors.

Realtime usage

UPDATE

If you want to use the realtime mode, you should update your dependencies !

pip install -r requirements.txt

RAVE and the prior model can be used in realtime on live audio streams, allowing creative interactions with both models.

nn~

RAVE is compatible with the nn~ max/msp and PureData external.

An audio example of the prior sampling patch is available in the docs/ folder.

RAVE vst

You can also use RAVE as a VST audio plugin using the RAVE vst !

Discussion

If you have questions, want to share your experience with RAVE or share musical pieces done with the model, you can use the Discussion tab !

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Related tags

Overview

RAVE: Realtime Audio Variational autoEncoder

Installation

Preprocessing

Training

Reconstructing audio

Realtime usage

nn~

RAVE vst

Discussion

Owner

ACIDS

Learning View Priors for Single-view 3D Reconstruction (CVPR 2019)

Rethinking Nearest Neighbors for Visual Classification

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

PIKA: a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

ICLR 2021, Fair Mixup: Fairness via Interpolation

Software & Hardware to do multi color printing with Sharpies

CAST: Character labeling in Animation using Self-supervision by Tracking

The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

Simple and Distributed Machine Learning

Predict and time series avocado hass

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Synthesizing and manipulating 2048x1024 images with conditional GANs

Simple object detection app with streamlit

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]

Iris prediction model is used to classify iris species created julia's DecisionTree, DataFrames, JLD2, PlotlyJS and Statistics packages.

Hyperbolic Hierarchical Clustering.

Code for the paper Open Sesame: Getting Inside BERT's Linguistic Knowledge.

Winners of the Facebook Image Similarity Challenge