Code Stars

innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Language: Python
Total stars: 228
Stars trend:
22 Aug 2023

 5pm ▎ +2

 6pm ▏ +1

 7pm ██▊ +22

 8pm ███████ +56

 9pm █████▎ +42

10pm ████▏ +33

11pm ███▋ +29

23 Aug 2023

12am ██ +16

#python
#asr, #innovatorved, #transcribe, #whisper

48 views01:16

Code Stars

huggingface/distil-whisper

Total stars: 170
Stars trend:
31 Oct 2023

 5pm ▋ +5

 6pm ████▋ +37

 7pm ███▍ +27

 8pm ███▍ +27

 9pm ██▍ +19

10pm ███ +24

11pm ██▌ +20

#audio, #speechrecognition, #whisper

59 views00:18

Code Stars

transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:

26 May 2024
 7pm ▊ +6
 8pm █████▍ +43
 9pm ███▉ +31
10pm ██▊ +22

#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx

119 views23:17

Code Stars

xenova/whisper-web
ML-powered speech recognition directly in your browser
Language:TypeScript
Total stars: 676
Stars trend:

9 Jun 2024
 3pm ▏ +1
 4pm  +0
 5pm ▏ +1
 6pm █ +8
 7pm █▋ +13
 8pm █ +8
 9pm █▏ +9
10pm ▉ +7
11pm █▎ +10
10 Jun 2024
12am █▋ +13
 1am ▋ +5

#typescript
#javascript, #transformers, #whisper

137 views02:17

Code Stars

mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:

21 Jun 2024
12am ▍ +3
 1am ██ +16
 2am █▊ +14
 3am █ +8
 4am █▊ +14
 5am ██▎ +18
 6am ██▎ +18
 7am ██▍ +19

#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper

106 views08:17

Code Stars

ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel.
Language:TypeScript
Total stars: 188
Stars trend:

8 Jul 2024
 6am ▏ +1
 7am ▍ +3
 8am ▉ +7
 9am ▌ +4
10am ▏ +1
11am █ +8
12pm █▍ +11
 1pm ▋ +5
 2pm ▊ +6
 3pm ▎ +2
 4pm ▊ +6
 5pm ███▋ +29

#typescript
#artificialintelligence, #cartesia, #groq, #llama, #nextjs, #react, #vercel, #whisper

107 views18:18

Code Stars

harry0703/AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python
Total stars: 194
Stars trend:

22 Jul 2024
12am ▌ +4
 1am ▎ +2
 2am ▍ +3
 3am ▎ +2
 4am █ +8
 5am ██ +16
 6am █▉ +15
 7am ██ +16
 8am █▎ +10

#python
#ai, #asr, #funasr, #ollama, #python, #qwen2, #whisper

108 views09:23

Code Stars

Woolverine94/biniou
a self-hosted webui for 30+ generative ai
Language:Python
Total stars: 369
Stars trend:

26 Aug 2024
11am █▏ +9
12pm █▎ +10
 1pm █▏ +9
 2pm █▎ +10
 3pm █▏ +9
 4pm █▋ +13
 5pm █ +8
 6pm █▎ +10
 7pm █▍ +11
 8pm █▋ +13
 9pm ▋ +5
10pm ▉ +7

#python
#animatediff, #audiogen, #bark, #controlnet, #diffusers, #generativeai, #gfpgan, #gradio, #huggingface, #insightface, #ipadapter, #kandinsky, #llamacpppython, #musicgen, #photomaker, #realesrgan, #stablediffusion, #stablediffusion3, #webui, #whisper

80 views02:41

Code Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:

3 Sep 2024
 9am ▏ +1
10am  +0
11am  +0
12pm ▏ +1
 1pm ▏ +1
 2pm ▌ +4
 3pm ▋ +5
 4pm ▌ +4
 5pm █▎ +10
 6pm ██▏ +17
 7pm ██▎ +18
 8pm ███▏ +25

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper

96 views21:19

Code Stars

NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python
Total stars: 1085
Stars trend:

22 Sep 2024
10pm █ +8
11pm ▊ +6
23 Sep 2024
12am ▍ +3
 1am ▊ +6
 2am █▎ +10
 3am ▋ +5
 4am █ +8
 5am ▌ +4
 6am █▏ +9
 7am ▌ +4
 8am ▌ +4
 9am █▏ +9

#python
#asr, #edgecomputing, #languagemodel, #llm, #ondeviceai, #ondeviceml, #sdk, #sdkpython, #stablediffusion, #transformers, #tts, #vlm, #whisper

232 views10:18

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:

9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
 1am █▏ +9
 2am ██▏ +17
 3am █▎ +10
 4am ▉ +7
 5am ▊ +6
 6am ▍ +3
 7am ▌ +4
 8am ▌ +4
 9am █ +8

#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp

102 views10:17

Code Stars

microsoft/ai-dev-gallery
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Language:C#
Total stars: 442
Stars trend:

1 Jan 2025
 9pm ▏ +1
10pm ▏ +1
11pm  +0
2 Jan 2025
12am ▋ +5
 1am █▏ +9
 2am ██▌ +20
 3am █ +8
 4am ▋ +5
 5am ██▍ +19
 6am █▍ +11

#csharp
#ai, #csharp, #developertools, #directml, #dotnet, #genai, #mistral, #npu, #onnx, #onnxruntime, #onnxruntimegenai, #phi3, #qnn, #stablediffusion, #visualstudio, #whisper, #winappsdk, #windows, #winui3, #wpf

125 views07:17

Code Stars

abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:

21 Jan 2025
 6am ██▍ +19
 7am ▎ +2
 8am ▌ +4
 9am ▍ +3
10am  +0
11am ▋ +5
12pm ▌ +4
 1pm ▊ +6
 2pm █▋ +13
 3pm ▉ +7
 4pm ▉ +7
 5pm ▊ +6

#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp

108 views18:17

Code Stars

Zackriya-Solutions/meeting-minutes
An open source Live Ai based meeting note taker and minutes generator that can completely run in your Local device (Mac OS support is added. Will add windows and linux support soon)
Language:C++
Total stars: 510
Stars trend:

14 Feb 2025
 5pm ▏ +1
 6pm  +0
 7pm █▏ +9
 8pm ██ +16
 9pm ██▍ +19
10pm █ +8
11pm ▋ +5
15 Feb 2025
12am ▉ +7
 1am ▉ +7
 2am █ +8
 3am ▊ +6
 4am ▊ +6

#cplusplus
#ai, #automation, #crossplatform, #linux, #live, #llm, #mac, #macosapp, #meetingminutes, #meetingnotes, #recorder, #rust, #whisper, #whispercpp, #windows

50 views05:17

Code Stars

CodeUpdaterBot/ClickUi
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the chat UI. Experience the future of AI, and help build it too!
Language:Python
Total stars: 128
Stars trend:

2 Mar 2025
 8am ▎ +2
 9am ▌ +4
10am █ +8
11am ▉ +7
12pm ▉ +7
 1pm ▊ +6
 2pm █ +8
 3pm █▍ +11
 4pm ▉ +7
 5pm █▊ +14
 6pm █▎ +10
 7pm █▏ +9

#python
#ai, #chatgpt, #claude, #deepseek, #hotkeys, #kokoro, #ollama, #opensource, #openai, #python, #sonos, #whisper

65 views20:17

Code Stars

modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python
Total stars: 9523
Stars trend:

8 Apr 2025
 2pm ▉ +7
 3pm █▏ +9
 4pm ▋ +5
 5pm ▋ +5
 6pm ▌ +4
 7pm ▊ +6
 8pm █ +8
 9pm ▎ +2
10pm ▍ +3
11pm ▊ +6
9 Apr 2025
12am ▊ +6
 1am ██▏ +17

#python
#audiovisualspeechrecognition, #conformer, #dfsmn, #paraformer, #pretrainedmodel, #punctuation, #pytorch, #rnnt, #speakerdiarization, #speechrecognition, #speechgpt, #speechllm, #vad, #voiceactivitydetection, #whisper

80 views02:18

Code Stars

umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
Language:C#
Total stars: 838
Stars trend:

12 Apr 2025
 3am ▎ +2
 4am ▎ +2
 5am ▍ +3
 6am ▏ +1
 7am ▌ +4
 8am ▏ +1
 9am ▏ +1
10am ▍ +3
11am █▎ +10
12pm ████▏ +33
 1pm ▍ +3
 2pm █▋ +13

#csharp
#asr, #csharp, #fasterwhisper, #flyleaf, #languagelearning, #llm, #mediaplayer, #ocr, #ollama, #player, #video, #videoplayer, #whisper, #wpf, #ytdlp

86 views15:19

Code Stars

m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 15449
Stars trend:

6 May 2025
 4am ▊ +6
 5am █▏ +9
 6am ▊ +6
 7am ▌ +4
 8am █▏ +9
 9am ▊ +6
10am ▋ +5
11am █▏ +9
12pm ▍ +3
 1pm ▍ +3
 2pm █▎ +10
 3pm ▉ +7

#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper

94 views16:17

Code Stars

niedev/RTranslator
Open source real-time translation app for Android that runs locally
Language:C++
Total stars: 8132
Stars trend:

14 Jun 2025
 7am ▏ +1
 8am  +0
 9am  +0
10am  +0
11am  +0
12pm █▌ +12
 1pm ██▍ +19
 2pm ██▋ +21
 3pm █▍ +11
 4pm █▏ +9
 5pm ▊ +6

#cplusplus
#android, #androidapp, #bluetoothle, #mobileapp, #nllb, #offline, #onnx, #onnxruntime, #realtimetranslator, #sentencepiece, #transformers, #translation, #translator, #whisper

103 views18:16

Code Stars

abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:

27 Jun 2025
 8am ▍ +3
 9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
 1pm █▉ +15
 2pm █▊ +14
 3pm ▉ +7
 4pm █ +8
 5pm ▌ +4

#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp

88 views18:17

About

Blog

Apps

Platform