This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Last update: Jan 07, 2023

Related tags

Overview

DTLN-aec

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation in TF-lite format. This model was handed in to the acoustic echo cancellation challenge (AEC-Challenge) organized by Microsoft. The DTLN-aec model is among the top-five models of the challenge. The results of the AEC-Challenge can be found here.

The model was trained on data from the DNS-Challenge and the AEC-Challenge reposetories.

The arXiv preprint can be found here.

@article{westhausen2020acoustic,
  title={Acoustic echo cancellation with the dual-signal transformation LSTM network},
  author={Westhausen, Nils L. and Meyer, Bernd T.},
  journal={arXiv preprint arXiv:2010.14337},
  year={2020}
}

Author: Nils L. Westhausen (Communication Acoustics , Carl von Ossietzky University, Oldenburg, Germany)

This code is licensed under the terms of the MIT license.

Usage:

First install the depencies from requirements.txt

Afterwards the model can be tested with:

$ python run_aec.py -i /folder/with/input/files -o /target/folder/ -m ./pretrained_models/dtln_aec_512

Files for testing can be found in the AEC-Challenge respository. The convention for file names is *_mic.wav for the near-end microphone signals and *_lpb.wav for the far-end microphone or loopback signals. The folder audio_samples contains one audio sample for each condition. The *_processed.wav files are created by the dtln_aec_512 model.

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Related tags

Overview

DTLN-aec

Contents:

Usage:

This repository is still under construction.

Owner

Nils L. Westhausen

COD-Rank-Localize-and-Segment (CVPR2021)

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Uni-Fold: Training your own deep protein-folding models

Cossim - Sharpened Cosine Distance implementation in PyTorch

Finetuning Pipeline

Complex Answer Generation For Conversational Search Systems.

Active and Sample-Efficient Model Evaluation

Implementation of SegNet: A Deep Convolutional Encoder-Decoder Architecture for Semantic Pixel-Wise Labelling

Projecting interval uncertainty through the discrete Fourier transform

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

Composing methods for ML training efficiency

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

Simulation of Self Driving Car

Pyeventbus: a publish/subscribe event bus

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Data and code for the paper "Importance of Kernel Bandwidth in Quantum Machine Learning"

Using some basic methods to show linkages and transformations of robotic arms

NLU Dataset Diagnostics