Code Stars
1.87K subscribers
8.64K photos
8.93K links
Code Stars provides notifications about GitHub repositories that are gaining a significant number of stars in a short period of time. Be the first to find out about trending repositories that everybody will be talking about soon.
#AI #chatGPT #python
Download Telegram
jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
Language:Python
Total stars: 233
Stars trend:
5 Jan 2024
12am ▊ +6
1am ████▌ +36
2am ███▌ +28
3am ██▍ +19
4am █ +8
5am █▋ +13
6am ██▏ +17
7am █▌ +12
8am ▋ +5
9am █ +8

#python
#speech, #speechrecognition, #speechtotext, #stt
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Language:C++
Total stars: 1120
Stars trend:
6 Jun 2024
4pm ▏ +1
5pm ▏ +1
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
7 Jun 2024
12am ▍ +3
1am ████▌ +36
2am ███▎ +26
3am ██ +16

#cplusplus
#aarch64, #android, #arm32, #asr, #cpp, #csharp, #dotnet, #ios, #linux, #macos, #mfc, #onnx, #openkylin, #raspberrypi, #riscv, #speechtotext, #texttospeech, #vits, #windows
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am +0
1am ▋ +5
2am ▍ +3
3am █▍ +11
4am ███ +24
5am █▋ +13
6am █ +8
7am ▉ +7

#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am ▍ +3
1am ██ +16
2am █▊ +14
3am █ +8
4am █▊ +14
5am ██▎ +18
6am ██▎ +18
7am ██▍ +19

#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
Language:TypeScript
Total stars: 835
Stars trend:
22 Jul 2024
5pm ███████▋ +61
6pm █████████▏ +73

#typescript
#audioediting, #speechtotext, #transcription, #videoediting
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
Language:Python
Total stars: 580
Stars trend:
3 Aug 2024
2pm ███████████▎ +90
3pm +0
4pm +0
5pm +0
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
4 Aug 2024
12am +0
1am ▏ +1

#python
#asr, #conformer, #deeplearning, #deepspeech, #pytorch, #speech, #speechrecognition, #speechtotext, #squeezeformer
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python
Total stars: 2735
Stars trend:
3 Sep 2024
1am ▍ +3
2am +0
3am +0
4am ▏ +1
5am +0
6am ▌ +4
7am █▏ +9
8am ██▎ +18
9am ██▍ +19
10am █▏ +9
11am ▊ +6
12pm █▍ +11

#python
#ai, #assistant, #languagemodel, #machinelearning, #python, #speech, #speechsynthesis, #speechtotext, #speechtranslation
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am ▏ +1
10am +0
11am +0
12pm ▏ +1
1pm ▏ +1
2pm ▌ +4
3pm ▋ +5
4pm ▌ +4
5pm █▎ +10
6pm ██▏ +17
7pm ██▎ +18
8pm ███▏ +25

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
ictnlp/LLaMA-Omni
Low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct.
Language:Python
Total stars: 109
Stars trend:
11 Sep 2024
4am ▍ +3
5am +0
6am █ +8
7am █▎ +10
8am █▎ +10
9am ▌ +4
10am ▊ +6
11am ▍ +3
12pm █▏ +9
1pm ▋ +5
2pm █▏ +9
3pm █▏ +9

#python
#largelanguagemodels, #multimodallargelanguagemodels, #speechinteraction, #speechlanguagemodel, #speechtospeech, #speechtotext
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Language:Python
Total stars: 2654
Stars trend:
15 Jan 2025
2am ▎ +2
3am ▍ +3
4am +0
5am ▊ +6
6am ▎ +2
7am ▎ +2
8am ▎ +2
9am ▌ +4
10am ▎ +2
11am ██▏ +17
12pm ██▍ +19
1pm ██▍ +19

#python
#python, #realtime, #speechtotext
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am ██▍ +19
7am ▎ +2
8am ▌ +4
9am ▍ +3
10am +0
11am ▋ +5
12pm ▌ +4
1pm ▊ +6
2pm █▋ +13
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
amanvirparhar/chaplin
A real-time silent speech recognition tool.
Language:Python
Total stars: 84
Stars trend:
3 Feb 2025
12am ▏ +1
1am █▍ +11
2am ▊ +6
3am ▌ +4
4am ▊ +6
5am █▏ +9
6am ▌ +4
7am ███▏ +25
8am █▎ +10

#python
#autoavsr, #avsr, #llm, #ollama, #speechrecognition, #speechtotext, #vsr
freddyaboulton/fastrtc
The python library for real-time communication
Language:Python
Total stars: 1312
Stars trend:
28 Feb 2025
6pm █▏ +9
7pm █▍ +11
8pm █▎ +10
9pm ▉ +7
10pm █▍ +11
11pm █▏ +9
1 Mar 2025
12am █▍ +11
1am █▍ +11
2am █▍ +11
3am █▋ +13
4am █▊ +14
5am ██ +16

#python
#artificialintelligence, #llm, #python, #realtime, #speechtotext, #texttospeech
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
6 May 2025
4am ▊ +6
5am █▏ +9
6am ▊ +6
7am ▌ +4
8am █▏ +9
9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
1pm ▍ +3
2pm █▎ +10
3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Language:Python
Total stars: 1009
Stars trend:
8 May 2025
5am ▍ +3
6am ▍ +3
7am ▏ +1
8am ▍ +3
9am +0
10am ▋ +5
11am █▍ +11
12pm █▍ +11
1pm ▍ +3
2pm █▊ +14
3pm █▊ +14
4pm █ +8

#python
#applesilicon, #audioprocessing, #mlx, #multimodal, #speechrecognition, #speechsynthesis, #speechtotext, #texttospeech, #transformers
Capsize-Games/airunner
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
Language:Python
Total stars: 948
Stars trend:
17 May 2025
8am ▎ +2
9am ▎ +2
10am ▍ +3
11am █▏ +9
12pm █▏ +9
1pm █▏ +9
2pm █▎ +10
3pm █▏ +9
4pm ▋ +5
5pm ▋ +5
6pm █ +8
7pm ▊ +6

#python
#ai, #aiart, #art, #assetgenerator, #chatbot, #deeplearning, #desktopapp, #imagegeneration, #mistral, #multimodal, #privacy, #pygame, #pyside6, #python, #selfhosted, #speechtotext, #stablediffusion, #texttoimage, #texttospeech, #texttospeechapp
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook
Total stars: 10057
Stars trend:
7 Jun 2025
7pm ▍ +3
8pm ▋ +5
9pm ▎ +2
10pm ▊ +6
11pm ▉ +7
8 Jun 2025
12am ▉ +7
1am ▉ +7
2am ▉ +7
3am █ +8
4am █▎ +10
5am ▋ +5
6am █▏ +9

#jupyternotebook
#android, #asr, #deeplearning, #deepneuralnetworks, #deepspeech, #googlespeechtotext, #ios, #kaldi, #offline, #privacy, #python, #raspberrypi, #speakeridentification, #speakerverification, #speechrecognition, #speechtotext, #speechtotextandroid, #stt, #voicerecognition, #vosk
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:
27 Jun 2025
8am ▍ +3
9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
1pm █▉ +15
2pm █▊ +14
3pm ▉ +7
4pm █ +8
5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp