#

speechrecognition

Here are 162 public repositories matching this topic...

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Dec 28, 2024
Python

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Updated Dec 9, 2024
HTML

revdotcom / reverb

Open source inference code for Rev's model

docker open-source opensource neural-network canary speech-recognition deeplearning speech-to-text whisper rev asr speaker-diarization speechrecognition asr-model diarization huggingface revai pyannote wenet

Updated Dec 19, 2024
Python

robmsmt / KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

machine-learning deep-learning neural-network keras nn speech neural-networks baidu deeplearning speech-to-text asr ctc speechrecognition coreml deepspeech

Updated Mar 17, 2018
Python

Azure-Samples / SpeechToText-WebSockets-Javascript

SDK & Sample to do speech recognition using websockets in Javascript

javascript microsoft typescript browser sdk recognition js websocket websockets speech ts speech-recognition cognitive-services speechtotext speechrecognition microsoft-speech-service

Updated Mar 25, 2019
TypeScript

SamirPaulb / real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Updated Jan 22, 2024
Tcl

roshan9419 / PersonalAssistantChatbot

It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...

opencv chatbot tkinter speechrecognition pyttsx3 assistatant

Updated Jan 9, 2023
Python

by2101 / OpenASR

A pytorch based end2end speech recognition system.

speech transformer speech-recognition las speech-to-text asr speech-recognizer speechrecognition end2end

Updated Jan 16, 2021
Python

shangeth / wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

pytorch voice-recognition speech-recognition semi-supervised-learning deeplearning representation-learning unsupervised-learning speaker-recognition hacktoberfest speech-processing audio-processing speechrecognition

Updated Jun 6, 2021
Python

Open-Speech-EkStep / vakyansh-wav2vec2-experimentation

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

open-source speech pytorch speech-recognition asr indic-scripts indic-languages speechrecognition speechrecognition-python speech-recognition-model

Updated Sep 22, 2022
Python

goxr3plus / java-google-speech-api

🙊 Speech Recognition , Text To Speech , Google Translate

text-to-speech google-translate speechrecognition

Updated Sep 10, 2023
Java

WeBAD

solyarisoftware / WeBAD

Web Browser Audio Detection/Speech Recording Events API

audio javascript browser dom webrtc voice speech microphone voice-recognition recording volume push-to-talk volume-control audio-processing speechrecognition voice-interface recording-button audi-capture

Updated Jul 15, 2022
JavaScript

syntithenai / opensnips

Open source projects related to Snips https://snips.ai/.

docker nlu dialog speech kaldi audio-server rasa hotwords snowboy snips asr speechrecognition porcupine hark snips-skills

Updated Jan 12, 2023
JavaScript

botbahlul / autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

python ffmpeg captions voice-recognition speech-recognition subtitle speechrecognition voicerecognition google-translate-api subriptext auto-caption auto-subtitle srt-subtitle

Updated May 5, 2024
Python

jindongwang / EasyEspnet

Making Espnet easier to use

toolkit speech speech-recognition easy-to-use asr speechrecognition espnet

Updated Apr 9, 2021
Python

IS2AI / ISSAI_SAIDA_Kazakh_ASR

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.

speech-synthesis speech-recognition speech-to-text speechrecognition

Updated Jul 30, 2021
Shell

rollingstarky / Python-Voice-Assistant

A Python based Voice Assistant like Siri

python ai chatbot tts stt speechrecognition

Updated Oct 1, 2020
Python

AppleHolic / PytorchSR

Pytorch based phoneme recognition (TIMIT phoneme classification)

paper pytorch timit speechrecognition minimalgru cbhg

Updated Apr 25, 2018
Python

speech

ng-web-apis / speech

A library for using Web Speech API with Angular

text-to-speech angular speech speech-synthesis speech-recognition speech-to-text speechrecognition speech-api

Updated May 29, 2023
TypeScript

botbahlul / pyvosklivesubtitle

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE

python ffmpeg voice-recognition caption speech-recognition subtitle speechrecognition voicerecognition google-translate-api pysimplegui live-caption vosk auto-caption live-subtitle

Updated May 5, 2024
Python

Improve this page

Add a description, image, and links to the speechrecognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speechrecognition topic, visit your repo's landing page and select "manage topics."