HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Last update: Dec 29, 2022

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

This is the unofficial implementation of Vocoder part of HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement.

Currently, this repo is WIP but you can start your training without any error.

Training:

python train.py --config config_v2.json

Citations:

@misc{https://doi.org/10.48550/arxiv.2203.13086,
  doi = {10.48550/ARXIV.2203.13086},
  
  url = {https://arxiv.org/abs/2203.13086},
  
  author = {Andreev, Pavel and Alanov, Aibek and Ivanov, Oleg and Vetrov, Dmitry},
  
  keywords = {Sound (cs.SD), Machine Learning (cs.LG), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering},
  
  title = {HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

References:

https://github.com/jik876/hifi-gan

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Training:

Citations:

References:

Owner

Rishikesh (ऋषिकेश)

Just Go with the Flow: Self-Supervised Scene Flow Estimation

ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.

A Distributional Approach To Controlled Text Generation

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

Retinal vessel segmentation based on GT-UNet

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

GrailQA: Strongly Generalizable Question Answering

Simple streamlit app to demonstrate HERE Tour Planning

A pytorch-based real-time segmentation model for autonomous driving

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

NEO: Non Equilibrium Sampling on the orbit of a deterministic transform

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Identifying Stroke Indicators Using Rough Sets

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

The official implementation of NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation [ICLR-2021]. https://arxiv.org/pdf/2101.12378.pdf

A set of tests for evaluating large-scale algorithms for Wasserstein-2 transport maps computation.

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Code for "OctField: Hierarchical Implicit Functions for 3D Modeling (NeurIPS 2021)"

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.