readest/readest
Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
Language:TypeScript
Total stars: 2432
Stars trend:
#typescript
#ebook, #ebookreader, #epub, #foliate, #nextjs, #reader, #tauri, #tauri2, #tts
Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
Language:TypeScript
Total stars: 2432
Stars trend:
27 Jan 2025
12am ▌ +4
1am █▌ +12
2am █▏ +9
3am ▋ +5
4am █ +8
5am █▎ +10
6am ▍ +3
7am ▋ +5
8am █ +8
9am ▋ +5
10am ▍ +3
11am ▌ +4
#typescript
#ebook, #ebookreader, #epub, #foliate, #nextjs, #reader, #tauri, #tauri2, #tts
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python
Total stars: 39404
Stars trend:
#python
#texttospeech, #tts, #vits, #voiceclone, #voicecloneai, #voicecloning
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python
Total stars: 39404
Stars trend:
28 Jan 2025
12am █▌ +12
1am ▉ +7
2am ▋ +5
3am ▊ +6
4am ▋ +5
5am ▋ +5
6am ▉ +7
7am ▌ +4
8am ▉ +7
9am ▉ +7
10am █ +8
11am ▌ +4
#python
#texttospeech, #tts, #vits, #voiceclone, #voicecloneai, #voicecloning
TEN-framework/TEN-Agent
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
Language:Python
Total stars: 4121
Stars trend:
#python
#agent, #ai, #asr, #cpp, #gemini, #golang, #gpt4, #gpt4o, #llm, #lowlatency, #multimodal, #nextjs14, #openai, #python, #rag, #realtime, #realtime, #tts, #vision, #voiceassistant
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
Language:Python
Total stars: 4121
Stars trend:
28 Jan 2025
10am ▎ +2
11am ▉ +7
12pm ▉ +7
1pm ▉ +7
2pm █▏ +9
3pm █▌ +12
4pm █▍ +11
5pm ▋ +5
6pm █▌ +12
7pm ▌ +4
#python
#agent, #ai, #asr, #cpp, #gemini, #golang, #gpt4, #gpt4o, #llm, #lowlatency, #multimodal, #nextjs14, #openai, #python, #rag, #realtime, #realtime, #tts, #vision, #voiceassistant
mastra-ai/mastra
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
Language:TypeScript
Total stars: 8282
Stars trend:
#typescript
#agents, #ai, #chatbots, #evals, #javascript, #llm, #mcp, #nextjs, #nodejs, #reactjs, #tts, #typescript, #workflows
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
Language:TypeScript
Total stars: 8282
Stars trend:
28 Feb 2025
4pm █▊ +14
5pm ███ +24
6pm █ +8
7pm ██▏ +17
8pm █▍ +11
9pm █▌ +12
10pm ▍ +3
11pm ▉ +7
1 Mar 2025
12am █▉ +15
1am █▊ +14
2am █▌ +12
3am █▉ +15
#typescript
#agents, #ai, #chatbots, #evals, #javascript, #llm, #mcp, #nextjs, #nodejs, #reactjs, #tts, #typescript, #workflows
wangzongming/esp-ai
The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧~
Language:HTML
Total stars: 1575
Stars trend:
#html
#aiot, #arduino, #arduinollm, #espai, #esp32, #esp32ai, #esp32idf, #esp32llm, #esp8266, #iat, #llm, #rag, #tts
The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ | 最简单、最低成本的AI接入方案。喜欢本项目的话点个 Star 吧~
Language:HTML
Total stars: 1575
Stars trend:
4 Mar 2025
5am ▎ +8
6am ████████████▋ +361
7am █ +30
8am +0
9am +0
10am ██████████████████▊ +536
#html
#aiot, #arduino, #arduinollm, #espai, #esp32, #esp32ai, #esp32idf, #esp32llm, #esp8266, #iat, #llm, #rag, #tts
neural-maze/ava-whatsapp-agent-course
Meet Ava, the WhatsApp Agent
Language:Python
Total stars: 842
Stars trend:
#python
#agent, #agentbased, #agenticworkflow, #agents, #stt, #tts, #vectordatabase
Meet Ava, the WhatsApp Agent
Language:Python
Total stars: 842
Stars trend:
6 Apr 2025
5pm ██████▏ +49
6pm ████▎ +34
7pm ██▊ +22
8pm ██▌ +20
9pm ██▏ +17
#python
#agent, #agentbased, #agenticworkflow, #agents, #stt, #tts, #vectordatabase
krillinai/KrillinAI
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容
Language:Go
Total stars: 741
Stars trend:
#go
#dubbing, #localization, #tts, #videotranscription, #videotranslation
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容
Language:Go
Total stars: 741
Stars trend:
6 Apr 2025
10pm ▏ +1
11pm ▎ +2
7 Apr 2025
12am █▏ +9
1am █▌ +12
2am ██ +16
3am █▊ +14
4am █▍ +11
5am █▎ +10
#go
#dubbing, #localization, #tts, #videotranscription, #videotranslation
xming521/WeClone
欢迎star⭐。使用微信聊天记录微调大语言模型,使用微信语音消息大模➕0.5B大模型实现高质量声音克隆,并绑定到微信机器人,实现自己的数字分身。 数字克隆/数字分身/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA
Language:Python
Total stars: 680
Stars trend:
#python
#chatglm3, #llm, #tts, #wechat
欢迎star⭐。使用微信聊天记录微调大语言模型,使用微信语音消息大模➕0.5B大模型实现高质量声音克隆,并绑定到微信机器人,实现自己的数字分身。 数字克隆/数字分身/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA
Language:Python
Total stars: 680
Stars trend:
9 Apr 2025
12am ▍ +3
1am ▉ +7
2am ▋ +5
3am █▋ +13
4am █▎ +10
5am █▍ +11
6am ▊ +6
7am ▋ +5
8am ▍ +3
9am █▍ +11
10am ▍ +3
#python
#chatglm3, #llm, #tts, #wechat
cosin2077/easyVoice
开源文本转语音工具
Language:TypeScript
Total stars: 157
Stars trend:
#typescript
#edgetts, #tts, #ttsengines
开源文本转语音工具
Language:TypeScript
Total stars: 157
Stars trend:
11 Apr 2025
12am █▋ +13
1am ████▊ +38
2am ████▋ +37
3am ███ +24
4am █ +8
5am █▏ +9
#typescript
#edgetts, #tts, #ttsengines
canopyai/Orpheus-TTS
Towards Human-Sounding Speech
Language:Python
Total stars: 3833
Stars trend:
#python
#llm, #realtime, #tts
Towards Human-Sounding Speech
Language:Python
Total stars: 3833
Stars trend:
11 Apr 2025
3am █▍ +11
4am █▌ +12
5am █ +8
6am ▉ +7
7am ▊ +6
8am ▎ +2
9am ▌ +4
10am ▎ +2
11am ▎ +2
12pm ▊ +6
1pm ▊ +6
2pm █▎ +10
#python
#llm, #realtime, #tts
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python
Total stars: 13977
Stars trend:
#python
#asr, #deeplearning, #generativeai, #largelanguagemodels, #machinetranslation, #multimodal, #neuralnetworks, #speakerdiariazation, #speakerrecognition, #speechsynthesis, #speechtranslation, #tts
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python
Total stars: 13977
Stars trend:
8 May 2025
11am ▉ +7
12pm █▉ +15
1pm ▉ +7
2pm █▏ +9
3pm ▉ +7
4pm ▉ +7
5pm ▊ +6
6pm ▋ +5
7pm ▍ +3
8pm ▌ +4
9pm ▍ +3
10pm ▉ +7
#python
#asr, #deeplearning, #generativeai, #largelanguagemodels, #machinetranslation, #multimodal, #neuralnetworks, #speakerdiariazation, #speakerrecognition, #speechsynthesis, #speechtranslation, #tts
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Language:JavaScript
Total stars: 13020
Stars trend:
#javascript
#linux, #macos, #ocr, #pot, #potapp, #recognize, #tauri, #translate, #translation, #tts, #windows
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Language:JavaScript
Total stars: 13020
Stars trend:
26 Jun 2025
10pm ▊ +6
11pm ▍ +3
27 Jun 2025
12am ▊ +6
1am █▎ +10
2am █▍ +11
3am ▉ +7
4am █▋ +13
5am ▊ +6
6am ▊ +6
7am ▍ +3
8am ▋ +5
9am █ +8
#javascript
#linux, #macos, #ocr, #pot, #potapp, #recognize, #tauri, #translate, #translation, #tts, #windows
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:
#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Language:Python
Total stars: 3929
Stars trend:
27 Jun 2025
8am ▍ +3
9am █▊ +14
10am ▊ +6
11am █▉ +15
12pm █▉ +15
1pm █▉ +15
2pm █▊ +14
3pm ▉ +7
4pm █ +8
5pm ▌ +4
#python
#audiobook, #fasterwhisper, #gradio, #karaoke, #podcasts, #speechrecognition, #speechsynthesis, #speechtotext, #subtitles, #texttospeech, #transcription, #translator, #tts, #voicecloning, #voiceconversion, #webui, #whisper, #whisperx, #ytdlp