GitHub Trends

#python #audio #audio_tokenizer #llm #multimodal #text_to_speech #voice_cloning

MOSS-TTS is an open-source family of speech and sound models for natural, high-quality audio generation, including voice cloning, multi-speaker dialogue, real-time speech, and sound effects. It supports 31 languages in v1.5, better voice stability, and pause control, and it also offers a lightweight Nano version that can run on 4 CPU cores. The benefit to you is simple: you can create realistic speech or sound for apps, demos, or products with strong quality, flexible control, and multiple ways to run it.

https://github.com/OpenMOSS/MOSS-TTS

GitHub

GitHub - OpenMOSS/MOSS-TTS: MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS…

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario...

440 views11:30

About

Blog

Apps

Platform