toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C
Total stars: 255
Stars trend:
15 May 2023
#c
#alexa, #deeplearning, #echo, #espadf, #espidf, #esp32, #googlehome, #homeassistant, #homeautomation, #privacy, #speechrecognition, #speechtotext, #whisper
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Language: C
Total stars: 255
Stars trend:
15 May 2023
8am ▏ +1
9am +0
10am +0
11am +0
12pm +0
1pm +0
2pm █████▋ +45
3pm ███████████▋ +93
4pm █████ +40
#c
#alexa, #deeplearning, #echo, #espadf, #espidf, #esp32, #googlehome, #homeassistant, #homeautomation, #privacy, #speechrecognition, #speechtotext, #whisper
StanGirard/quivr
Dump all your files and thoughts into your GenerativeAI brain and chat with it
Language: Python
Total stars: 435
Stars trend:
15 May 2023
#python
#audio, #chat, #chatgpt, #csv, #embeddings, #generativeai, #obsidian, #pdf, #secondbrain, #vectorstore, #whisper
Dump all your files and thoughts into your GenerativeAI brain and chat with it
Language: Python
Total stars: 435
Stars trend:
15 May 2023
12pm ▏ +1
1pm █ +8
2pm █▉ +15
3pm █▋ +13
4pm █▌ +12
5pm █▋ +13
6pm █▊ +14
7pm ██▊ +22
8pm ███▏ +25
9pm ▊ +6
10pm █▎ +10
11pm ██▏ +17
#python
#audio, #chat, #chatgpt, #csv, #embeddings, #generativeai, #obsidian, #pdf, #secondbrain, #vectorstore, #whisper
guillaumekln/faster-whisper
Faster Whisper transcription with CTranslate2
Language: Python
Total stars: 3284
Stars trend:
19 Jul 2023
#python
#deeplearning, #inference, #openai, #quantization, #speechrecognition, #speechtotext, #transformer, #whisper
Faster Whisper transcription with CTranslate2
Language: Python
Total stars: 3284
Stars trend:
19 Jul 2023
1am ▍ +3
2am ▍ +3
3am ▎ +2
4am ██▌ +20
5am ██████▊ +54
6am █████▏ +41
7am ██▏ +17
8am ██▉ +23
9am █▍ +11
10am █ +8
11am ▉ +7
12pm █▍ +11
#python
#deeplearning, #inference, #openai, #quantization, #speechrecognition, #speechtotext, #transformer, #whisper
innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Language: Python
Total stars: 228
Stars trend:
22 Aug 2023
23 Aug 2023
#python
#asr, #innovatorved, #transcribe, #whisper
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
Language: Python
Total stars: 228
Stars trend:
22 Aug 2023
5pm ▎ +2
6pm ▏ +1
7pm ██▊ +22
8pm ███████ +56
9pm █████▎ +42
10pm ████▏ +33
11pm ███▋ +29
23 Aug 2023
12am ██ +16
#python
#asr, #innovatorved, #transcribe, #whisper
huggingface/distil-whisper
Total stars: 170
Stars trend:
31 Oct 2023
#audio, #speechrecognition, #whisper
Total stars: 170
Stars trend:
31 Oct 2023
5pm ▋ +5
6pm ████▋ +37
7pm ███▍ +27
8pm ███▍ +27
9pm ██▍ +19
10pm ███ +24
11pm ██▌ +20
#audio, #speechrecognition, #whisper
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
turnkey self-hosted offline transcription and diarization service with llm summary
Language:Python
Total stars: 200
Stars trend:
26 May 2024
7pm ▊ +6
8pm █████▍ +43
9pm ███▉ +31
10pm ██▊ +22
#python
#automation, #diarization, #llm, #mistral7b, #ollama, #speakerdiarization, #speechrecognition, #transcription, #whisper, #whisperx
xenova/whisper-web
ML-powered speech recognition directly in your browser
Language:TypeScript
Total stars: 676
Stars trend:
#typescript
#javascript, #transformers, #whisper
ML-powered speech recognition directly in your browser
Language:TypeScript
Total stars: 676
Stars trend:
9 Jun 2024
3pm ▏ +1
4pm +0
5pm ▏ +1
6pm █ +8
7pm █▋ +13
8pm █ +8
9pm █▏ +9
10pm ▉ +7
11pm █▎ +10
10 Jun 2024
12am █▋ +13
1am ▋ +5
#typescript
#javascript, #transformers, #whisper
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am ▍ +3
1am ██ +16
2am █▊ +14
3am █ +8
4am █▊ +14
5am ██▎ +18
6am ██▎ +18
7am ██▍ +19
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
ai-ng/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel.
Language:TypeScript
Total stars: 188
Stars trend:
#typescript
#artificialintelligence, #cartesia, #groq, #llama, #nextjs, #react, #vercel, #whisper
Fast voice assistant powered by Groq, Cartesia, and Vercel.
Language:TypeScript
Total stars: 188
Stars trend:
8 Jul 2024
6am ▏ +1
7am ▍ +3
8am ▉ +7
9am ▌ +4
10am ▏ +1
11am █ +8
12pm █▍ +11
1pm ▋ +5
2pm ▊ +6
3pm ▎ +2
4pm ▊ +6
5pm ███▋ +29
#typescript
#artificialintelligence, #cartesia, #groq, #llama, #nextjs, #react, #vercel, #whisper
Woolverine94/biniou
a self-hosted webui for 30+ generative ai
Language:Python
Total stars: 369
Stars trend:
#python
#animatediff, #audiogen, #bark, #controlnet, #diffusers, #generativeai, #gfpgan, #gradio, #huggingface, #insightface, #ipadapter, #kandinsky, #llamacpppython, #musicgen, #photomaker, #realesrgan, #stablediffusion, #stablediffusion3, #webui, #whisper
a self-hosted webui for 30+ generative ai
Language:Python
Total stars: 369
Stars trend:
26 Aug 2024
11am █▏ +9
12pm █▎ +10
1pm █▏ +9
2pm █▎ +10
3pm █▏ +9
4pm █▋ +13
5pm █ +8
6pm █▎ +10
7pm █▍ +11
8pm █▋ +13
9pm ▋ +5
10pm ▉ +7
#python
#animatediff, #audiogen, #bark, #controlnet, #diffusers, #generativeai, #gfpgan, #gradio, #huggingface, #insightface, #ipadapter, #kandinsky, #llamacpppython, #musicgen, #photomaker, #realesrgan, #stablediffusion, #stablediffusion3, #webui, #whisper
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python
Total stars: 10840
Stars trend:
3 Sep 2024
9am ▏ +1
10am +0
11am +0
12pm ▏ +1
1pm ▏ +1
2pm ▌ +4
3pm ▋ +5
4pm ▌ +4
5pm █▎ +10
6pm ██▏ +17
7pm ██▎ +18
8pm ███▏ +25
#python
#asr, #speech, #speechrecognition, #speechtotext, #whisper
NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python
Total stars: 1085
Stars trend:
#python
#asr, #edgecomputing, #languagemodel, #llm, #ondeviceai, #ondeviceml, #sdk, #sdkpython, #stablediffusion, #transformers, #tts, #vlm, #whisper
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
Language:Python
Total stars: 1085
Stars trend:
22 Sep 2024
10pm █ +8
11pm ▊ +6
23 Sep 2024
12am ▍ +3
1am ▊ +6
2am █▎ +10
3am ▋ +5
4am █ +8
5am ▌ +4
6am █▏ +9
7am ▌ +4
8am ▌ +4
9am █▏ +9
#python
#asr, #edgecomputing, #languagemodel, #llm, #ondeviceai, #ondeviceml, #sdk, #sdkpython, #stablediffusion, #transformers, #tts, #vlm, #whisper
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
microsoft/ai-dev-gallery
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Language:C#
Total stars: 442
Stars trend:
#csharp
#ai, #csharp, #developertools, #directml, #dotnet, #genai, #mistral, #npu, #onnx, #onnxruntime, #onnxruntimegenai, #phi3, #qnn, #stablediffusion, #visualstudio, #whisper, #winappsdk, #windows, #winui3, #wpf
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
Language:C#
Total stars: 442
Stars trend:
1 Jan 2025
9pm ▏ +1
10pm ▏ +1
11pm +0
2 Jan 2025
12am ▋ +5
1am █▏ +9
2am ██▌ +20
3am █ +8
4am ▋ +5
5am ██▍ +19
6am █▍ +11
#csharp
#ai, #csharp, #developertools, #directml, #dotnet, #genai, #mistral, #npu, #onnx, #onnxruntime, #onnxruntimegenai, #phi3, #qnn, #stablediffusion, #visualstudio, #whisper, #winappsdk, #windows, #winui3, #wpf