#python #audio #audio_tokenizer #llm #multimodal #text_to_speech #voice_cloning
MOSS-TTS is an open-source family of speech and sound models for natural, high-quality audio generation, including voice cloning, multi-speaker dialogue, real-time speech, and sound effects. It supports 31 languages in v1.5, better voice stability, and pause control, and it also offers a lightweight Nano version that can run on 4 CPU cores. The benefit to you is simple: you can create realistic speech or sound for apps, demos, or products with strong quality, flexible control, and multiple ways to run it.
https://github.com/OpenMOSS/MOSS-TTS
MOSS-TTS is an open-source family of speech and sound models for natural, high-quality audio generation, including voice cloning, multi-speaker dialogue, real-time speech, and sound effects. It supports 31 languages in v1.5, better voice stability, and pause control, and it also offers a lightweight Nano version that can run on 4 CPU cores. The benefit to you is simple: you can create realistic speech or sound for apps, demos, or products with strong quality, flexible control, and multiple ways to run it.
https://github.com/OpenMOSS/MOSS-TTS
GitHub
GitHub - OpenMOSS/MOSS-TTS: MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS…
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario...