🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Jun 13, 2024 - Python
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
An easy-to-use React.js component that leverages the Web Speech API to convert text to speech.
Pictalk is an open-source application designed to assist individuals with speech impediments communicate effectively using pictograms and pictures
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Tools for handling speech data in machine learning projects.
MARS5 speech model (TTS) from CAMB.AI
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
VITS-based Voice Conversion focused on simplicity, quality and performance.
Data manipulation and transformation for audio signal processing, powered by PyTorch
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Drift-Lens: an Unsupervised Drift Detection Framework for Deep Learning Classifiers on Unstructured Data
MATLAB implementation of the Speech Transmission Index for Public Address (STIPA) method for evaluating the speech transmission quality.
Add a description, image, and links to the speech topic page so that developers can more easily learn about it.
To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."