TEN-framework/TEN-Agent
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Language:Python
Total stars: 1252
Stars trend:
#python
#agent, #ai, #asr, #cpp, #gemini, #golang, #gpt4, #gpt4o, #llm, #lowlatency, #multimodal, #nextjs14, #openai, #python, #rag, #realtime, #realtime, #tts, #vision, #voiceassistant
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Language:Python
Total stars: 1252
Stars trend:
25 Oct 2024
10pm ▏ +1
11pm ▏ +1
26 Oct 2024
12am ▎ +2
1am ▊ +6
2am ██ +16
3am ██ +16
4am █▍ +11
5am ▊ +6
6am █▍ +11
7am ▏ +1
8am █▌ +12
#python
#agent, #ai, #asr, #cpp, #gemini, #golang, #gpt4, #gpt4o, #llm, #lowlatency, #multimodal, #nextjs14, #openai, #python, #rag, #realtime, #realtime, #tts, #vision, #voiceassistant
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
Language:Python
Total stars: 385
Stars trend:
9 Nov 2024
10pm ▏ +1
11pm ▌ +4
10 Nov 2024
12am ▎ +2
1am █▏ +9
2am ██▏ +17
3am █▎ +10
4am ▉ +7
5am ▊ +6
6am ▍ +3
7am ▌ +4
8am ▌ +4
9am █ +8
#python
#asr, #demucs, #fasterwhisper, #gradio, #speechrecognition, #speechsynthesis, #speechtotext, #stt, #subtitles, #texttospeech, #transcription, #translate, #translation, #translator, #tts, #uvr5, #webui, #webui, #whisper, #ytdlp
DrewThomasson/ebook2audiobook
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Language:Python
Total stars: 1325
Stars trend:
#python
#audiobooks, #chinese, #docker, #english, #epub, #gradio, #linux, #mac, #multilingual, #tts, #voicecloning, #windows, #xtts
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Language:Python
Total stars: 1325
Stars trend:
27 Dec 2024
1am ▏ +1
2am ▏ +1
3am ██▏ +17
4am █▋ +13
5am ██▏ +17
6am █████▎ +42
#python
#audiobooks, #chinese, #docker, #english, #epub, #gradio, #linux, #mac, #multilingual, #tts, #voicecloning, #windows, #xtts
DrewThomasson/ebook2audiobook
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Language:Python
Total stars: 6513
Stars trend:
#python
#audiobooks, #chinese, #docker, #english, #epub, #gradio, #linux, #mac, #multilingual, #tts, #voicecloning, #windows, #xtts
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Language:Python
Total stars: 6513
Stars trend:
13 Jan 2025
1am ██▊ +22
2am █████▏ +41
3am ███▉ +31
4am ██▊ +22
5am ██▌ +20
6am ███▊ +30
7am ███▋ +29
8am ███▏ +25
9am ██▍ +19
10am █▉ +15
11am █ +8
12pm █▍ +11
#python
#audiobooks, #chinese, #docker, #english, #epub, #gradio, #linux, #mac, #multilingual, #tts, #voicecloning, #windows, #xtts
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
Language:Python
Total stars: 293
Stars trend:
#python
#kokoro, #onnxruntime, #python, #tts
TTS with kokoro and onnx runtime
Language:Python
Total stars: 293
Stars trend:
14 Jan 2025
6pm ▌ +4
7pm ▏ +1
8pm +0
9pm +0
10pm +0
11pm ▏ +1
15 Jan 2025
12am ▎ +2
1am █▏ +9
2am ██▍ +19
3am ██▍ +19
4am ██▎ +18
5am ██ +16
#python
#kokoro, #onnxruntime, #python, #tts
mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language:Go
Total stars: 28664
Stars trend:
#go
#ai, #api, #audiogeneration, #distributed, #gemma, #gpt4all, #imagegeneration, #kubernetes, #libp2p, #llama, #llama3, #llm, #mamba, #mistral, #musicgen, #rerank, #rwkv, #stablediffusion, #textgeneration, #tts
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language:Go
Total stars: 28664
Stars trend:
19 Jan 2025
7am ▊ +6
8am ▉ +7
9am ▎ +2
10am ▍ +3
11am ▎ +2
12pm ▍ +3
1pm ██▍ +19
2pm █▋ +13
3pm █▋ +13
4pm █ +8
#go
#ai, #api, #audiogeneration, #distributed, #gemma, #gpt4all, #imagegeneration, #kubernetes, #libp2p, #llama, #llama3, #llm, #mamba, #mistral, #musicgen, #rerank, #rwkv, #stablediffusion, #textgeneration, #tts
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
Language:TypeScript
Total stars: 51908
Stars trend:
#typescript
#ai, #artifacts, #azureopenaiapi, #chat, #chatglm, #chatgpt, #claude, #dalle3, #functioncalling, #gemini, #gpt, #gpt4, #gpt4vision, #knowledgebase, #nextjs, #ollama, #openai, #qwen2, #rag, #tts
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
Language:TypeScript
Total stars: 51908
Stars trend:
20 Jan 2025
9pm ▏ +1
10pm ▏ +1
11pm ▍ +3
21 Jan 2025
12am ▌ +4
1am █▋ +13
2am █▉ +15
3am ██ +16
4am ▌ +4
5am ▊ +6
6am █ +8
7am █▍ +11
#typescript
#ai, #artifacts, #azureopenaiapi, #chat, #chatglm, #chatgpt, #claude, #dalle3, #functioncalling, #gemini, #gpt, #gpt4, #gpt4vision, #knowledgebase, #nextjs, #ollama, #openai, #qwen2, #rag, #tts
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python
Total stars: 2643
Stars trend:
21 Jan 2025
6am ██▍ +19
7am ▎ +2
8am ▌ +4
9am ▍ +3
10am +0
11am ▋ +5
12pm ▌ +4
1pm ▊ +6
2pm █▋ +13
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6
#python
#audiobook, #fasterwhisper, #gradio, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #webui, #whisper, #ytdlp
sauravpanda/BrowserAI
Run local LLMs inside your browser
Language:TypeScript
Total stars: 196
Stars trend:
#typescript
#agents, #ai, #llm, #llminference, #localllm, #tts, #webgpu
Run local LLMs inside your browser
Language:TypeScript
Total stars: 196
Stars trend:
22 Jan 2025
3pm ▏ +1
4pm +0
5pm █ +8
6pm ████▎ +34
7pm ▊ +6
8pm ▉ +7
9pm ▎ +2
10pm ▌ +4
11pm ▊ +6
23 Jan 2025
12am ▌ +4
1am ▍ +3
#typescript
#agents, #ai, #llm, #llminference, #localllm, #tts, #webgpu
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 6978
Stars trend:
#python
#speechsynthesis, #texttospeech, #tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 6978
Stars trend:
23 Jan 2025
2am ▏ +1
3am ▊ +6
4am █▊ +14
5am █▉ +15
6am █▍ +11
7am ██ +16
8am █▏ +9
9am █▎ +10
#python
#speechsynthesis, #texttospeech, #tts