GitHub Trends

#python #text_to_speech #tts #vits #voice_clone #voice_cloneai #voice_cloning

GPT-SoVITS-WebUI is a powerful tool for converting text to speech and changing voices. Here’s what it offers** You can convert text to speech instantly with just a 5-second vocal sample.
- **Few-shot TTS** It works in several languages including English, Japanese, Korean, Cantonese, and Chinese.
- **WebUI Tools:** It includes tools like voice separation, automatic training set segmentation, and text labeling, making it easier to create and use the models.

Using GPT-SoVITS-WebUI benefits you by allowing quick and easy voice conversions and text-to-speech functions with high quality and flexibility.

https://github.com/RVC-Boss/GPT-SoVITS

GitHub

GitHub - RVC-Boss/GPT-SoVITS: 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

1 min voice data can also be used to train a good TTS model! (few shot voice cloning) - RVC-Boss/GPT-SoVITS

433 views12:30

GitHub Trends

#python #text_to_speech #tts #voice_clone #zero_shot_tts

OpenVoice is a free, open-source tool that lets you clone any voice using just a short audio sample, then generate speech in that voice across many languages and accents[1][5][8]. You can fine-tune how the voice sounds—adjusting emotion, accent, rhythm, pauses, and intonation—to match your needs[1][3][5]. A major benefit is “zero-shot” cloning: you can make the cloned voice speak languages it was never trained on, which is rare in voice AI[1][3][4]. The latest version, OpenVoice V2, offers even better sound quality, supports six major languages natively, and is free for both personal and commercial use[1]. This makes it easy and affordable for anyone to create realistic, customizable voice content without needing technical expertise or expensive software.

https://github.com/myshell-ai/OpenVoice

GitHub

GitHub - myshell-ai/OpenVoice: Instant voice cloning by MIT and MyShell. Audio foundation model.

Instant voice cloning by MIT and MyShell. Audio foundation model. - myshell-ai/OpenVoice

605 views13:00

GitHub Trends

#typescript #ai #cuda #mlx #qwen3_tts #qwen3_tts_ui #voice_ai #voice_clone #whisper

Voicebox is a free, open-source voice synthesis studio that lets you clone voices, generate speech in 23 languages, and apply audio effects—all running privately on your computer. You can create realistic voice clones from just seconds of audio, use five different text-to-speech engines for different needs, add effects like reverb and pitch shift, and build multi-voice projects with a timeline editor. The key benefit is complete privacy: your voice data and AI models never leave your machine, unlike cloud-based alternatives. It also includes an API for building voice-powered applications and works across Mac, Windows, and Linux with GPU acceleration support.

https://github.com/jamiepine/voicebox

GitHub

GitHub - jamiepine/voicebox: The open-source voice synthesis studio

The open-source voice synthesis studio. Contribute to jamiepine/voicebox development by creating an account on GitHub.

309 views11:30

About

Blog

Apps

Platform