Stochastic Positional Encoding (SPE)

This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Linear Complexity by Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang and Gaël Richard.

In this paper, we propose Stochastic Positional Encoding (SPE), which provably behaves like relative PE while being compatible with linear-complexity Transformers. We do this by drawing a connection between positional encoding and cross-covariance structures of correlated Gaussian processes.

Check out also the companion website with music examples.

Citation:

@inproceedings{pmlr-v139-liutkus21a,
  title = 	 {Relative Positional Encoding for {Transformers} with Linear Complexity},
  author =       {Liutkus, Antoine and C{\'i}fka, Ond{\v r}ej and Wu, Shih-Lun and {\c S}im{\c s}ekli, Umut and Yang, Yi-Hsuan and Richard, Ga{\"e}l},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7067--7079},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/liutkus21a/liutkus21a.pdf},
  url = 	 {http://proceedings.mlr.press/v139/liutkus21a.html}
}

SPE implementation

We have implemented SPE in PyTorch and JAX/Flax. Each implementation is available as a separate Python package under src.

Experiments

Each of the 3 experiments (LRA, pop piano generation, groove continuation) has a dedicated directory under experiments. See the README files there for how to set up the environment and prepare the datasets. To make sure you have the custom dependencies for each experiment, clone this repository with --recurse-submodules or run git submodule init && git submodule update after cloning.

Relative Positional Encoding for Transformers with Linear Complexity

Related tags

Overview

Stochastic Positional Encoding (SPE)

SPE implementation

Experiments

Owner

Antoine Liutkus

Locally Most Powerful Bayesian Test for Out-of-Distribution Detection using Deep Generative Models

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

Google Landmark Recogntion and Retrieval 2021 Solutions

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

LaBERT - A length-controllable and non-autoregressive image captioning model.

A PyTorch implementation of PointRend: Image Segmentation as Rendering

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Final report with code for KAIST Course KSE 801.

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

RL-GAN: Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Official Implementation of SWAD (NeurIPS 2021)

MiraiML: asynchronous, autonomous and continuous Machine Learning in Python

tf2onnx - Convert TensorFlow, Keras and Tflite models to ONNX.

MMFlow is an open source optical flow toolbox based on PyTorch

Code for CVPR2019 paper《Unequal Training for Deep Face Recognition with Long Tailed Noisy Data》

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.