open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python
Total stars: 903
Stars trend:
#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #fastspeech2, #hifigan, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #voiceconversion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python
Total stars: 903
Stars trend:
19 Dec 2023
9am █▍ +11
10am ▋ +5
11am ▉ +7
12pm ██ +16
1pm █▋ +13
2pm █▋ +13
3pm ██▍ +19
4pm █▌ +12
5pm ██ +16
6pm █▍ +11
7pm ██▎ +18
8pm █▌ +12
#python
#audiogeneration, #audiosynthesis, #audioldm, #audit, #fastspeech2, #hifigan, #musicgeneration, #naturalspeech2, #singingvoiceconversion, #speechsynthesis, #texttoaudio, #texttospeech, #valle, #vits, #voiceconversion
myshell-ai/OpenVoice
Instant voice cloning by MyShell
Language:Python
Total stars: 685
Stars trend:
#python
#texttospeech, #tts, #voiceclone, #zeroshottts
Instant voice cloning by MyShell
Language:Python
Total stars: 685
Stars trend:
1 Jan 2024
9am ▏ +1
10am +0
11am ▎ +2
12pm ▏ +1
1pm ▎ +2
2pm ▍ +3
3pm ▎ +2
4pm ███████ +56
5pm ███████▋ +61
6pm █████▎ +42
#python
#texttospeech, #tts, #voiceclone, #zeroshottts
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python
Total stars: 127
Stars trend:
#python
#chinese, #english, #french, #japanese, #korean, #multilingual, #spanish, #texttospeech, #tts
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Language:Python
Total stars: 127
Stars trend:
26 Feb 2024
2pm ▊ +6
3pm ▎ +2
4pm +0
5pm +0
6pm +0
7pm +0
8pm ▏ +1
9pm +0
10pm ▌ +4
11pm ▋ +5
27 Feb 2024
12am █▌ +12
1am ███▌ +28
#python
#chinese, #english, #french, #japanese, #korean, #multilingual, #spanish, #texttospeech, #tts
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C
Total stars: 2997
Stars trend:
#c
#android, #espeak, #espeakng, #speechsynthesis, #texttospeech
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C
Total stars: 2997
Stars trend:
2 May 2024
12am ▏ +1
1am +0
2am ██ +16
3am █▌ +12
4am ██▋ +21
5am ███▍ +27
#c
#android, #espeak, #espeakng, #speechsynthesis, #texttospeech
6drf21e/ChatTTS_colab
🚀 One-click deployment (including offline integration package)! Based on ChatTTS, it supports timbre drawing, long audio generation and role-based reading. Simple and easy to use, no complicated installation required.
Language:Python
Total stars: 306
Stars trend:
#python
#chattts, #colabnotebook, #texttospeech
🚀 One-click deployment (including offline integration package)! Based on ChatTTS, it supports timbre drawing, long audio generation and role-based reading. Simple and easy to use, no complicated installation required.
Language:Python
Total stars: 306
Stars trend:
4 Jun 2024
6pm ▍ +3
7pm ▏ +1
8pm ▏ +1
9pm ▏ +1
10pm ▏ +1
11pm █▏ +9
5 Jun 2024
12am ▉ +7
1am ██▎ +18
2am █ +8
3am █▌ +12
4am █▏ +9
5am █▏ +9
#python
#chattts, #colabnotebook, #texttospeech
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Language:C++
Total stars: 1120
Stars trend:
#cplusplus
#aarch64, #android, #arm32, #asr, #cpp, #csharp, #dotnet, #ios, #linux, #macos, #mfc, #onnx, #openkylin, #raspberrypi, #riscv, #speechtotext, #texttospeech, #vits, #windows
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Language:C++
Total stars: 1120
Stars trend:
6 Jun 2024
4pm ▏ +1
5pm ▏ +1
6pm +0
7pm +0
8pm +0
9pm +0
10pm +0
11pm +0
7 Jun 2024
12am ▍ +3
1am ████▌ +36
2am ███▎ +26
3am ██ +16
#cplusplus
#aarch64, #android, #arm32, #asr, #cpp, #csharp, #dotnet, #ios, #linux, #macos, #mfc, #onnx, #openkylin, #raspberrypi, #riscv, #speechtotext, #texttospeech, #vits, #windows
lenML/ChatTTS-Forge
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
Language:Python
Total stars: 249
Stars trend:
#python
#agent, #chattts, #chatttsforge, #colab, #gpt, #llm, #ssml, #texttospeech, #tts
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
Language:Python
Total stars: 249
Stars trend:
11 Jun 2024
5am ▎ +2
6am ▋ +5
7am ▍ +3
8am ▋ +5
9am ██▏ +17
10am █▌ +12
11am ▋ +5
12pm ▉ +7
1pm ▋ +5
2pm ▊ +6
3pm ▊ +6
4pm ▍ +3
#python
#agent, #chattts, #chatttsforge, #colab, #gpt, #llm, #ssml, #texttospeech, #tts
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Language:Python
Total stars: 389
Stars trend:
17 Jun 2024
9pm ▏ +1
10pm ▏ +1
11pm ▎ +2
18 Jun 2024
12am +0
1am ▋ +5
2am ▍ +3
3am █▍ +11
4am ███ +24
5am █▋ +13
6am █ +8
7am ▉ +7
#python
#allinone, #asr, #audioprocessing, #machinetranslation, #nonautoregressive, #seamless, #simultaneoustranslation, #speech, #speechenhancement, #speechprocessing, #speechrecognition, #speechsynthesis, #speechtotext, #speechtranslation, #streamingaudio, #texttoaudio, #texttospeech, #translation, #tts, #voice
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:
#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python
Total stars: 556
Stars trend:
19 Jun 2024
8pm ▍ +3
9pm ██▌ +20
10pm ██▊ +22
11pm ▊ +6
20 Jun 2024
12am █▋ +13
1am █▋ +13
#python
#deeplearning, #pytorch, #speech, #speechprocessing, #speechsynthesis, #texttospeech, #toolkit, #tts
mezbaul-h/june
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
Local voice assistant combining the power of Ollama, Hugging Face Transformers, and the Coqui TTS Toolkit
Language:Python
Total stars: 137
Stars trend:
21 Jun 2024
12am ▍ +3
1am ██ +16
2am █▊ +14
3am █ +8
4am █▊ +14
5am ██▎ +18
6am ██▎ +18
7am ██▍ +19
#python
#ai, #assistantchatbots, #chatbot, #cliapp, #commandlinetool, #coquitts, #huggingface, #largelanguagemodels, #llm, #python, #speechrecognition, #speechtotext, #texttospeech, #whisper
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:
#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning
MARS5 speech model (TTS) from CAMB.AI
Language:Python
Total stars: 738
Stars trend:
24 Jun 2024
7pm ▏ +1
8pm ██▎ +18
9pm ████▉ +39
10pm ████▌ +36
#python
#prosody, #speech, #speechsynthesis, #texttospeech, #voicecloneai, #voicecloning
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 4321
Stars trend:
#python
#speechsynthesis, #texttospeech, #tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python
Total stars: 4321
Stars trend:
29 Jun 2024
12pm ▍ +3
1pm █▉ +15
2pm █▋ +13
3pm █▌ +12
4pm ██▏ +17
5pm ▊ +6
6pm ▍ +3
7pm ▍ +3
8pm ▏ +1
9pm +0
10pm ▌ +4
#python
#speechsynthesis, #texttospeech, #tts