STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Last update: Oct 18, 2021

Related tags

Text Data & NLP st3

Overview

st3

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Currently it supports converting pbmm models to pt scripts with integrated beam search.

Check out the first pre-release: https://github.com/proger/st3/releases

PyTorch impelementations of BERT-based Spelling Error Correction Models

59 Jun 29, 2021

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Deepvoice3_pytorch PyTorch implementation of convolutional networks-based text-to-speech synthesis models: arXiv:1710.07654: Deep Voice 3: Scaling Tex

1.8k Dec 30, 2022

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

Transformer Embedder A Word Level Transformer layer based on PyTorch and 🤗 Transformers. How to use Install the library from PyPI: pip install transf

27 Nov 20, 2022

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

CIF-PyTorch This is a PyTorch based implementation of continuous integrate-and-fire (CIF) module for end-to-end (E2E) automatic speech recognition (AS

24 Dec 29, 2022

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

pl_prompt_sst An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SS

5 Oct 21, 2022

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding This repository contains the official PyTorch implementation of th

26 Dec 14, 2022

Releases(english1)

english1(Sep 13, 2021)
This is a conversion of Coqui English STT v0.9.3 model to TorchScript, allowing to deploy a speech recognizer as a single file. The TorchScript bundle is self-contained and runs DeepSpeech frontend and beam search returning 10 best results. LM Scorer is not supported at the moment.

To run, download the pt file and save the following code to recognize.py and make sure you have torchaudio installed using pip3 install torchaudio:

import torch, torchaudio, sys waveform, sr = torchaudio.load(sys.argv[1], normalize=True) assert sr == 16000 model = torch.jit.load('coqui-stt-0.9.3-models.pt') for transcript, scores in model(waveform.squeeze()): print(transcript, scores)

Now you can run the model on English recordings like below. Any format supported by TorchAudio backend should work.

python3 recognize.py sample.wav
Source code(tar.gz)
Source code(zip)
coqui-stt-0.9.3-models.pt(180.26 MB)

Owner

Vlad Ki

GitHub Repository

Natural Language Processing Best Practices & Examples

NLP Best Practices In recent years, natural language processing (NLP) has seen quick growth in quality and usability, and this has helped to drive bus

6.1k Dec 31, 2022

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

TFPNER TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech Named entity recognition (NER), which aims at identifyin

1 Feb 07, 2022

Mkdocs + material + cool stuff

Modern-Python-Doc-Example mkdocs + material + cool stuff Doc is live here Features out of the box amazing good looking website thanks to mkdocs.org an

61 Oct 26, 2022

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

59 Dec 01, 2022

Unsupervised text tokenizer focused on computational efficiency

YouTokenToMe YouTokenToMe is an unsupervised text tokenizer focused on computational efficiency. It currently implements fast Byte Pair Encoding (BPE)

847 Dec 19, 2022

Natural language computational chemistry command line interface.

nlcc Install pip install nlcc Must have Open-AI Codex key: export OPENAI_API_KEY=your key here then nlcc key bindings ctrl-w copy to clipboard (Note

37 Dec 14, 2022

This repository has a implementations of data augmentation for NLP for Japanese.

daaja This repository has a implementations of data augmentation for NLP for Japanese: EDA: Easy Data Augmentation Techniques for Boosting Performance

60 Nov 11, 2022

Japanese synonym library

chikkarpy chikkarpyはchikkarのPython版です。 chikkarpy is a Python version of chikkar. chikkarpy は Sudachi 同義語辞書を利用し、SudachiPyの出力に同義語展開を追加するために開発されたライブラリです。

48 Dec 14, 2022

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

Pretrain and Fine-tune a T5 model with Flax on GCP This tutorial details how pretrain and fine-tune a FlaxT5 model from HuggingFace using a TPU VM ava

41 Nov 18, 2022

SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。

SimpleChinese2 SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。声明本项目是为方便个人工作所创建的，仅有部分代码原创。

30 Dec 02, 2022

nlabel is a library for generating, storing and retrieving tagging information and embedding vectors from various nlp libraries through a unified interface.

2 Jun 10, 2022

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Related tags

Overview

st3

You might also like...

PyTorch impelementations of BERT-based Spelling Error Correction Models

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Releases(english1)

english1(Sep 13, 2021)

Owner

Vlad Ki

Natural Language Processing Best Practices & Examples

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Mkdocs + material + cool stuff

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

Unsupervised text tokenizer focused on computational efficiency

Natural language computational chemistry command line interface.

This repository has a implementations of data augmentation for NLP for Japanese.

Japanese synonym library

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。

nlabel is a library for generating, storing and retrieving tagging information and embedding vectors from various nlp libraries through a unified interface.

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

Codes for coreference-aware machine reading comprehension

Implementation of legal QA system based on SentenceKoBART

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

It analyze the sentiment of the user, whether it is postive or negative.

Repository for the paper "Optimal Subarchitecture Extraction for BERT"