PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Last update: Aug 19, 2022

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

This repository contains the PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing", the Sheffield entry for the first Clarity enhancement challenge (CEC1). The system consists of a Conv-TasNet based denoising module, and a finite-inpulse-response (FIR) filter based amplification module. A differentiable approximation to the Cambridge MSBG model released in the CEC1 is used in the loss function.

Requirements

To run the training recipe of the amplification module, the MSBG package and PyTorch STOI are needed.

Training

To build the overall system, the Conv-TasNet based denoising module needs to be trained in the first stage, and the scripts are in the recipe_den_convtasnet. The FIR based amplification module is trained in the second stage, and the scripts are in the recipe_amp_fir. The MBSTOI folder contains the MBSTOI implementation from the CEC1 project, with also the DBSTOI implementation.

References

[1] Luo Y, Mesgarani N. Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation[J]. IEEE/ACM transactions on audio, speech, and language processing, 2019, 27(8): 1256-1266.
[2] Andersen A H, de Haan J M, Tan Z H, et al. Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions[J]. Speech Communication, 2018, 102: 1-13.
[3] C.H.Taal, R.C.Hendriks, R.Heusdens, J.Jensen 'A Short-Time Objective Intelligibility Measure for Time-Frequency Weighted Noisy Speech', ICASSP 2010, Texas, Dallas.

Citation

If you use this work, please cite:

@article{tutwo,
  title={A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing},
  author={Tu, Zehai and Zhang, Jisi and Ma, Ning and Barker, Jon},
  year={2021},
  booktitle={The Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2021)},
}

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Related tags

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Requirements

Training

References

Citation

Owner

Differential rendering based motion capture blender project.

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

HGCAE Pytorch implementation. CVPR2021 accepted.

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

[ICCV 2021 Oral] Deep Evidential Action Recognition

State of the Art Neural Networks for Deep Learning

FOSS Digital Asset Distribution Platform built on Frappe.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Deep Learning with PyTorch made easy 🚀 !

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

A novel Engagement Detection with Multi-Task Training (ED-MTT) system

Python package for multiple object tracking research with focus on laboratory animals tracking.

WSDM2022 Challenge - Large scale temporal graph link prediction

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

An expansion for RDKit to read all types of files in one line

Evolving neural network parameters in JAX.

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Related tags

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Requirements

Training

References

Citation

Owner

Differential rendering based motion capture blender project.

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

HGCAE Pytorch implementation. CVPR2021 accepted.

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers

[ICCV 2021 Oral] Deep Evidential Action Recognition

State of the Art Neural Networks for Deep Learning

FOSS Digital Asset Distribution Platform built on Frappe.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Deep Learning with PyTorch made easy 🚀 !

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

A novel Engagement Detection with Multi-Task Training (ED-MTT) system

Python package for multiple object tracking research with focus on laboratory animals tracking.

WSDM2022 Challenge - Large scale temporal graph link prediction

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

An expansion for RDKit to read all types of files in one line

Evolving neural network parameters in JAX.

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.