Code Stars
1.87K subscribers
8.63K photos
8.92K links
Code Stars provides notifications about GitHub repositories that are gaining a significant number of stars in a short period of time. Be the first to find out about trending repositories that everybody will be talking about soon.
#AI #chatGPT #python
Download Telegram
innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Language: Python
Total stars: 228
Stars trend:
22 Aug 2023
 5pm ▎ +2

 6pm ▏ +1

 7pm ██▊ +22

 8pm ███████ +56

 9pm █████▎ +42

10pm ████▏ +33

11pm ███▋ +29

23 Aug 2023
12am ██ +16

#python
#asr, #innovatorved, #transcribe, #whisper
huggingface/distil-whisper


Total stars: 170
Stars trend:
31 Oct 2023
 5pm ▋ +5

 6pm ████▋ +37

 7pm ███▍ +27

 8pm ███▍ +27

 9pm ██▍ +19

10pm ███ +24

11pm ██▌ +20


#audio, #speechrecognition, #whisper
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
26 May 2024
7pm ▊ +6
8pm █████▍ +43
9pm ███▉ +31
10pm ██▊ +22

#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
xenova/whisper-web
ML-powered speech recognition directly in your browser
Language:TypeScript
Total stars: 676
Stars trend:
9 Jun 2024
3pm ▏ +1
4pm +0
5pm ▏ +1
6pm █ +8
7pm █▋ +13
8pm █ +8
9pm █▏ +9
10pm ▉ +7
11pm █▎ +10
10 Jun 2024
12am █▋ +13
1am ▋ +5

#typescript
#javascript, #transformers, #whisper
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am ▍ +3
1am ██ +16
2am █▊ +14
3am █ +8
4am █▊ +14
5am ██▎ +18
6am ██▎ +18
7am ██▍ +19

#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel.
Language:TypeScript
Total stars: 188
Stars trend:
8 Jul 2024
6am ▏ +1
7am ▍ +3
8am ▉ +7
9am ▌ +4
10am ▏ +1
11am █ +8
12pm █▍ +11
1pm ▋ +5
2pm ▊ +6
3pm ▎ +2
4pm ▊ +6
5pm ███▋ +29

#typescript
#artificialintelligence, #cartesia, #groq, #llama, #nextjs, #react, #vercel, #whisper
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
Language:Python
Total stars: 194
Stars trend:
22 Jul 2024
12am ▌ +4
1am ▎ +2
2am ▍ +3
3am ▎ +2
4am █ +8
5am ██ +16
6am █▉ +15
7am ██ +16
8am █▎ +10

#python
#ai, #asr, #funasr, #ollama, #python, #qwen2, #whisper
Woolverine94/biniou
a self-hosted webui for 30+ generative ai
Language:Python
Total stars: 369
Stars trend:
26 Aug 2024
11am █▏ +9
12pm █▎ +10
1pm █▏ +9
2pm █▎ +10
3pm █▏ +9
4pm █▋ +13
5pm █ +8
6pm █▎ +10
7pm █▍ +11
8pm █▋ +13
9pm ▋ +5
10pm ▉ +7

#python
#animatediff, #audiogen, #bark, #controlnet, #diffusers, #generativeai, #gfpgan, #gradio, #huggingface, #insightface, #ipadapter, #kandinsky, #llamacpppython, #musicgen, #photomaker, #realesrgan, #stablediffusion, #stablediffusion3, #webui, #whisper
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am ▏ +1
10am +0
11am +0
12pm ▏ +1
1pm ▏ +1
2pm ▌ +4
3pm ▋ +5
4pm ▌ +4
5pm █▎ +10
6pm ██▏ +17
7pm ██▎ +18
8pm ███▏ +25

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python
Total stars: 1085
Stars trend:
22 Sep 2024
10pm █ +8
11pm ▊ +6
23 Sep 2024
12am ▍ +3
1am ▊ +6
2am █▎ +10
3am ▋ +5
4am █ +8
5am ▌ +4
6am █▏ +9
7am ▌ +4
8am ▌ +4
9am █▏ +9

#python
#asr, #edgecomputing, #languagemodel, #llm, #ondeviceai, #ondeviceml, #sdk, #sdkpython, #stablediffusion, #transformers, #tts, #vlm, #whisper
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
microsoft/ai-dev-gallery
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Language:C#
Total stars: 442
Stars trend:
1 Jan 2025
9pm ▏ +1
10pm ▏ +1
11pm +0
2 Jan 2025
12am ▋ +5
1am █▏ +9
2am ██▌ +20
3am █ +8
4am ▋ +5
5am ██▍ +19
6am █▍ +11

#csharp
#ai, #csharp, #developertools, #directml, #dotnet, #genai, #mistral, #npu, #onnx, #onnxruntime, #onnxruntimegenai, #phi3, #qnn, #stablediffusion, #visualstudio, #whisper, #winappsdk, #windows, #winui3, #wpf
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am ██▍ +19
7am ▎ +2
8am ▌ +4
9am ▍ +3
10am +0
11am ▋ +5
12pm ▌ +4
1pm ▊ +6
2pm █▋ +13
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
Zackriya-Solutions/meeting-minutes
An open source Live Ai based meeting note taker and minutes generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon)
Language:C++
Total stars: 510
Stars trend:
14 Feb 2025
5pm ▏ +1
6pm +0
7pm █▏ +9
8pm ██ +16
9pm ██▍ +19
10pm █ +8
11pm ▋ +5
15 Feb 2025
12am ▉ +7
1am ▉ +7
2am █ +8
3am ▊ +6
4am ▊ +6

#cplusplus
#ai, #automation, #crossplatform, #linux, #live, #llm, #mac, #macosapp, #meetingminutes, #meetingnotes, #recorder, #rust, #whisper, #whispercpp, #windows
CodeUpdaterBot/ClickUi
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the chat UI. Experience the future of AI, and help build it too!
Language:Python
Total stars: 128
Stars trend:
2 Mar 2025
8am ▎ +2
9am ▌ +4
10am █ +8
11am ▉ +7
12pm ▉ +7
1pm ▊ +6
2pm █ +8
3pm █▍ +11
4pm ▉ +7
5pm █▊ +14
6pm █▎ +10
7pm █▏ +9

#python
#ai, #chatgpt, #claude, #deepseek, #hotkeys, #kokoro, #ollama, #opensource, #openai, #python, #sonos, #whisper
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:
8 Apr 2025
2pm ▉ +7
3pm █▏ +9
4pm ▋ +5
5pm ▋ +5
6pm ▌ +4
7pm ▊ +6
8pm █ +8
9pm ▎ +2
10pm ▍ +3
11pm ▊ +6
9 Apr 2025
12am ▊ +6
1am ██▏ +17

#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper
umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
Language:C#
Total stars: 838
Stars trend:
12 Apr 2025
3am ▎ +2
4am ▎ +2
5am ▍ +3
6am ▏ +1
7am ▌ +4
8am ▏ +1
9am ▏ +1
10am ▍ +3
11am █▎ +10
12pm ████▏ +33
1pm ▍ +3
2pm █▋ +13

#csharp
#asr, #csharp, #fasterwhisper, #flyleaf, #languagelearning, #llm, #mediaplayer, #ocr, #ollama, #player, #video, #videoplayer, #whisper, #wpf, #ytdlp
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:
6 May 2025
4am ▊ +6
5am █▏ +9
6am ▊ +6
7am ▌ +4
8am █▏ +9
9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
1pm ▍ +3
2pm █▎ +10
3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
niedev/RTranslator
Open source real-time translation app for Android that runs locally
Language:C++
Total stars: 8132
Stars trend:
14 Jun 2025
7am ▏ +1
8am +0
9am +0
10am +0
11am +0
12pm █▌ +12
1pm ██▍ +19
2pm ██▋ +21
3pm █▍ +11
4pm █▏ +9
5pm ▊ +6

#cplusplus
#android, #androidapp, #bluetoothle, #mobileapp, #nllb, #offline, #onnx, #onnxruntime, #realtimetranslator, #sentencepiece, #transformers, #translation, #translator, #whisper
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:
27 Jun 2025
8am ▍ +3
9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
1pm █▉ +15
2pm █▊ +14
3pm ▉ +7
4pm █ +8
5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp