Code Stars

KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language:Python
Total stars: 2654
Stars trend:

15 Jan 2025
 2am ▎ +2
 3am ▍ +3
 4am  +0
 5am ▊ +6
 6am ▎ +2
 7am ▎ +2
 8am ▎ +2
 9am ▌ +4
10am ▎ +2
11am ██▏ +17
12pm ██▍ +19
 1pm ██▍ +19

#python
#python, #realtime, #speechtotext

85 views14:18

Code Stars

abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:

21 Jan 2025
 6am ██▍ +19
 7am ▎ +2
 8am ▌ +4
 9am ▍ +3
10am  +0
11am ▋ +5
12pm ▌ +4
 1pm ▊ +6
 2pm █▋ +13
 3pm ▉ +7
 4pm ▉ +7
 5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp

108 views18:17

Code Stars

amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:

3 Feb 2025
12am ▏ +1
 1am █▍ +11
 2am ▊ +6
 3am ▌ +4
 4am ▊ +6
 5am █▏ +9
 6am ▌ +4
 7am ███▏ +25
 8am █▎ +10

#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr

212 views09:19

Code Stars

freddyaboulton/fastrtc
The python library for real-time communication
Language:Python
Total stars: 1312
Stars trend:

28 Feb 2025
 6pm █▏ +9
 7pm █▍ +11
 8pm █▎ +10
 9pm ▉ +7
10pm █▍ +11
11pm █▏ +9
1 Mar 2025
12am █▍ +11
 1am █▍ +11
 2am █▍ +11
 3am █▋ +13
 4am █▊ +14
 5am ██ +16

#python
#artificialintelligence, #llm, #python, #realtime, #speechtotext, #texttospeech

67 views06:17

Code Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:

6 May 2025
 4am ▊ +6
 5am █▏ +9
 6am ▊ +6
 7am ▌ +4
 8am █▏ +9
 9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
 1pm ▍ +3
 2pm █▎ +10
 3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper

94 views16:17

Code Stars

Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:

8 May 2025
 5am ▍ +3
 6am ▍ +3
 7am ▏ +1
 8am ▍ +3
 9am  +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
 1pm ▍ +3
 2pm █▊ +14
 3pm █▊ +14
 4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers

93 views17:17

Code Stars

Capsize-Games/airunner
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Language:Python
Total stars: 948
Stars trend:

17 May 2025
 8am ▎ +2
 9am ▎ +2
10am ▍ +3
11am █▏ +9
12pm █▏ +9
 1pm █▏ +9
 2pm █▎ +10
 3pm █▏ +9
 4pm ▋ +5
 5pm ▋ +5
 6pm █ +8
 7pm ▊ +6

#python
#ai, #aiart, #art, #assetgenerator, #chatbot, #deeplearning, #desktopapp, #imagegeneration, #mistral, #multimodal, #privacy, #pygame, #pyside6, #python, #selfhosted, #speechtotext, #stablediffusion, #texttoimage, #texttospeech, #texttospeechapp

98 views20:18

Code Stars

alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:

7 Jun 2025
 7pm ▍ +3
 8pm ▋ +5
 9pm ▎ +2
10pm ▊ +6
11pm ▉ +7
8 Jun 2025
12am ▉ +7
 1am ▉ +7
 2am ▉ +7
 3am █ +8
 4am █▎ +10
 5am ▋ +5
 6am █▏ +9

#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk

107 views07:17

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:

27 Jun 2025
 8am ▍ +3
 9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
 1pm █▉ +15
 2pm █▊ +14
 3pm ▉ +7
 4pm █ +8
 5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp

89 views18:17

About

Blog

Apps

Platform