npuichigo/waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Language: Python
#neural_vocoder #text_to_speech #waveglow
Stars: 112 Issues: 1 Forks: 12
https://github.com/npuichigo/waveglow
  
  A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Language: Python
#neural_vocoder #text_to_speech #waveglow
Stars: 112 Issues: 1 Forks: 12
https://github.com/npuichigo/waveglow
GitHub
  
  GitHub - npuichigo/waveglow: A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
  A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis - npuichigo/waveglow
  leon-ai/leon
π§ Leon is your open-source personal assistant.
Language: JavaScript
#ai #artificial_intelligence #leon #nodejs #personal_assistant #python #speech_recognition #speech_synthesis #speech_to_text #text_to_speech
Stars: 165 Issues: 6 Forks: 2
https://github.com/leon-ai/leon
  
  π§ Leon is your open-source personal assistant.
Language: JavaScript
#ai #artificial_intelligence #leon #nodejs #personal_assistant #python #speech_recognition #speech_synthesis #speech_to_text #text_to_speech
Stars: 165 Issues: 6 Forks: 2
https://github.com/leon-ai/leon
GitHub
  
  GitHub - leon-ai/leon: π§  Leon is your open-source personal assistant.
  π§  Leon is your open-source personal assistant. Contribute to leon-ai/leon development by creating an account on GitHub.
  kxxt/aspeak
This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.
Language: Python
#aspeak #azure_cognitive_services #cli #python #speech_synthesis #text_to_speech #tts #tts_engine
Stars: 121 Issues: 3 Forks: 8
https://github.com/kxxt/aspeak
  
  This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.
Language: Python
#aspeak #azure_cognitive_services #cli #python #speech_synthesis #text_to_speech #tts #tts_engine
Stars: 121 Issues: 3 Forks: 8
https://github.com/kxxt/aspeak
GitHub
  
  GitHub - kxxt/aspeak: A simple text-to-speech client for Azure TTS API.
  A simple text-to-speech client for Azure TTS API.  - GitHub - kxxt/aspeak: A simple text-to-speech client for Azure TTS API.
π5
  lucidrains/natural-speech-pytorch
Implementation of the neural network proposed in Natural Speech, a text-to-speech generator that is indistinguishable from human recordings for the first time, from Microsoft Research
Language: Python
#artificial_intelligence #deep_learning #text_to_speech
Stars: 115 Issues: 0 Forks: 0
https://github.com/lucidrains/natural-speech-pytorch
  
  Implementation of the neural network proposed in Natural Speech, a text-to-speech generator that is indistinguishable from human recordings for the first time, from Microsoft Research
Language: Python
#artificial_intelligence #deep_learning #text_to_speech
Stars: 115 Issues: 0 Forks: 0
https://github.com/lucidrains/natural-speech-pytorch
GitHub
  
  GitHub - lucidrains/natural-speech-pytorch: Implementation of the neural network proposed in Natural Speech, a text-to-speech generatorβ¦
  Implementation of the neural network proposed in Natural Speech, a text-to-speech generator that is indistinguishable from human recordings for the first time, from Microsoft Research - GitHub - lu...
π₯2π1
  jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
  
  Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
GitHub
  
  GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
  Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller
π1π€―1
  enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e
  
  An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language: Python
#audio_lm #pytorch #text_to_speech #tts #vall_e #valle
Stars: 212 Issues: 2 Forks: 32
https://github.com/enhuiz/vall-e
GitHub
  
  GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E
  An unofficial PyTorch implementation of the audio LM VALL-E  - GitHub - enhuiz/vall-e: An unofficial PyTorch implementation of the audio LM VALL-E
π5π1
  netease-youdao/EmotiVoice
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
  
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
  
  GitHub - netease-youdao/EmotiVoice: EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
  EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
π1
  jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
  
  SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
GitHub
  
  GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio languageβ¦
  [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling  - GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 4...
  lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
  
  Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
GitHub
  
  GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX
  Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.
  edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
  
  Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
GitHub
  
  GitHub - edwko/OuteTTS: Interface for OuteTTS models.
  Interface for OuteTTS models. Contribute to edwko/OuteTTS development by creating an account on GitHub.
  