baaivision/GeoDream
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Language: Python
#3d #3d_aigc #3d_generation #text_to_3d
Stars: 244 Issues: 1 Forks: 4
https://github.com/baaivision/GeoDream
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Language: Python
#3d #3d_aigc #3d_generation #text_to_3d
Stars: 244 Issues: 1 Forks: 4
https://github.com/baaivision/GeoDream
GitHub
GitHub - baaivision/GeoDream: GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation - baaivision/GeoDream
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
GitHub
GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit
YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
reqable/re-editor
Re-Editor is a powerful lightweight text and code editor widget.
Language: Dart
#code_editor #flutter #syntax_highlighting #text_editor
Stars: 315 Issues: 0 Forks: 18
https://github.com/reqable/re-editor
Re-Editor is a powerful lightweight text and code editor widget.
Language: Dart
#code_editor #flutter #syntax_highlighting #text_editor
Stars: 315 Issues: 0 Forks: 18
https://github.com/reqable/re-editor
GitHub
GitHub - reqable/re-editor: Re-Editor is a powerful lightweight text and code editor widget.
Re-Editor is a powerful lightweight text and code editor widget. - reqable/re-editor
3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
GitHub
GitHub - 3DTopia/LGM: [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation. - 3DTopia/LGM
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
GitHub
GitHub - PKU-YuanGroup/MagicTime: [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators - PKU-YuanGroup/MagicTime
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Language: Python
#gpt #kanformers #kolmogorov_arnold_networks #kolmogorov_arnold_representation #llm #text_generation #transformers
Stars: 217 Issues: 2 Forks: 11
https://github.com/AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Language: Python
#gpt #kanformers #kolmogorov_arnold_networks #kolmogorov_arnold_representation #llm #text_generation #transformers
Stars: 217 Issues: 2 Forks: 11
https://github.com/AdityaNG/kan-gpt
GitHub
GitHub - AdityaNG/kan-gpt: The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks…
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling - AdityaNG/kan-gpt
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
GitHub
GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language…
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling - GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 4...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
GitHub
GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX
Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.
edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
GitHub
GitHub - edwko/OuteTTS: Interface for OuteTTS models.
Interface for OuteTTS models. Contribute to edwko/OuteTTS development by creating an account on GitHub.
Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
GitHub
GitHub - Lightricks/LTX-Video: Official repository for LTX-Video
Official repository for LTX-Video. Contribute to Lightricks/LTX-Video development by creating an account on GitHub.
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
GitHub
GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.
declare-lab/TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
Language: Jupyter Notebook
#flow_matching #generative_ai #text_to_audio #text_to_audio_ai #tta
Stars: 152 Issues: 2 Forks: 13
https://github.com/declare-lab/TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
Language: Jupyter Notebook
#flow_matching #generative_ai #text_to_audio #text_to_audio_ai #tta
Stars: 152 Issues: 2 Forks: 13
https://github.com/declare-lab/TangoFlux
GitHub
GitHub - declare-lab/TangoFlux: TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching - declare-lab/TangoFlux
FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
GitHub
GitHub - FoundationVision/FlashVideo: FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation - FoundationVision/FlashVideo
isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
GitHub
GitHub - isaiahbjork/orpheus-tts-local: Run Orpheus 3B Locally With LM Studio
Run Orpheus 3B Locally With LM Studio. Contribute to isaiahbjork/orpheus-tts-local development by creating an account on GitHub.
zobweyt/textcase
A feature-rich Python text case conversion library
Language: Python
#camel_case #case #constant_case #conversion #foss #just #kebab_case #lower_case #mypy #nix #pascal_case #pypi #pytest #python #ruff #sentence_case #snake_case #text #title_case #upper_case
Stars: 165 Issues: 2 Forks: 0
https://github.com/zobweyt/textcase
A feature-rich Python text case conversion library
Language: Python
#camel_case #case #constant_case #conversion #foss #just #kebab_case #lower_case #mypy #nix #pascal_case #pypi #pytest #python #ruff #sentence_case #snake_case #text #title_case #upper_case
Stars: 165 Issues: 2 Forks: 0
https://github.com/zobweyt/textcase
GitHub
GitHub - zobweyt/textcase: Python library for text case conversions
Python library for text case conversions. Contribute to zobweyt/textcase development by creating an account on GitHub.
mirth/chonky
Fully neural approach for text chunking
Language: Python
#ai #chunking #llms #ml #rag #semantic_chunking #text_splitter
Stars: 232 Issues: 0 Forks: 6
https://github.com/mirth/chonky
Fully neural approach for text chunking
Language: Python
#ai #chunking #llms #ml #rag #semantic_chunking #text_splitter
Stars: 232 Issues: 0 Forks: 6
https://github.com/mirth/chonky
GitHub
GitHub - mirth/chonky: Fully neural approach for text chunking
Fully neural approach for text chunking. Contribute to mirth/chonky development by creating an account on GitHub.
nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
Language: Python
#ai #open_weight #text_to_speech
Stars: 2047 Issues: 8 Forks: 92
https://github.com/nari-labs/dia
A TTS model capable of generating ultra-realistic dialogue in one pass.
Language: Python
#ai #open_weight #text_to_speech
Stars: 2047 Issues: 8 Forks: 92
https://github.com/nari-labs/dia
GitHub
GitHub - nari-labs/dia: A TTS model capable of generating ultra-realistic dialogue in one pass.
A TTS model capable of generating ultra-realistic dialogue in one pass. - nari-labs/dia
JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
GitHub
GitHub - JAMESYJL/ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni
Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
GitHub
GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1