ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Last update: Dec 02, 2022

Related tags

Text Data & NLP ConferencingSpeech2022

Overview

ConferencingSpeech 2022 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech 2022 challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by inference scripts;
eval, this folder contains evaluation scripts to calculate PLCC, RMSE and SRCC;
data-sets, this folder contains training and development test data-sets provied to the participant;
- Tencent Corpus, this dataset includes about 14,000 speech chinese speech clips with simulated (e.g. codecs, packet-loss, background noise) and live conditions.
- NISQA Corpus, the NISQA Corpus includes more than 14,000 speech samples with simulated (e.g. codecs, packet-loss, background noise) and live (e.g. mobile phone, Zoom, Skype, WhatsApp) conditions.
- IU Bloomington Corpus, there are 10,000 speech signals extracted from COSINE and VOiCESdatasets, each truncated between 3 to 6 seconds long.
- PSTN Corpus, there are about 80,000 speech clips through classic public switched telephone networks, each truncated 10 seconds long.

Requirements

To install requirements install Anaconda and then use:

conda env create -f envs.yml

This will create a new environment with the name "conferencingSpeech". Activate this environment to go on:

conda activate conferencingSpeech

Code license

Apache 2.0

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Related tags

Overview

ConferencingSpeech 2022 challenge

Details

Requirements

Code license

Owner

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

A Structured Self-attentive Sentence Embedding

An ActivityWatch watcher to pose questions to the user and record her answers.

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Weaviate demo with the text2vec-openai module

lightweight, fast and robust columnar dataframe for data analytics with online update

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

Speech to text streamlit app

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Command Line Text-To-Speech using Google TTS

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

Synthetic data for the people.

Code for "Generative adversarial networks for reconstructing natural images from brain activity".

Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding

Unofficial PyTorch implementation of Google AI's VoiceFilter system