asr

Star

Here are 1,026 public repositories matching this topic...

KevKibe / African-Whisper

Star

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

speech speech-recognition speech-to-text whisper asr speech-translation speech-transcription

Updated Jun 11, 2024
Python

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Jun 11, 2024
C++

NVIDIA / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated Jun 11, 2024
Python

ShoukoChan / Voice-to-Text

Star

Voice to Text Model using OpenAI's Whisper

python torch speech-recognition cuda-toolkit asr nlp-deep-learning openai-whisper

Updated Jun 11, 2024

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jun 11, 2024
Python

deepgram / deepgram-python-sdk

Star

Official Python SDK for Deepgram's automated speech recognition APIs.

python speech-recognition hacktoberfest asr deepgram automated-speech-recognition

Updated Jun 11, 2024
Python

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated Jun 11, 2024
C++

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Jun 11, 2024
Python

unnumsykar / knowledge-transfer-GenAI

Star

how to compress large knowledge base (.mp4, .mp3, .wav) and transfer it into readable, short, summarized form for effective knowledge transfer

asr gpt-4 genai-usecase

Updated Jun 11, 2024

PaddlePaddle / PaddleSpeech

Star

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Jun 11, 2024
Python

doppeltilde / automatic_speech_recognition

Star

Containerized REST API for interacting with Hugging Face Faster Whisper models.

python docker machine-learning rest rest-api inference automatic-speech-recognition asr huggingface faster-whisper

Updated Jun 11, 2024
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 11, 2024
Python

deepgram-devs / deepgram-conversational-demo

Star

Deepgram Conversational AI demo

react nextjs tts stt asr deepgram vercel

Updated Jun 10, 2024
TypeScript

Macoron / whisper.unity

Star

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

unity3d speech-recognition openai speech-to-text stt whisper asr

Updated Jun 10, 2024
Metal

AssemblyAI / assemblyai-java-sdk

Star

The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, audio intelligence models, as well as the latest LeMUR models.

java ai speech-to-text transcription stt asr assemblyai llm