lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
GitHub
GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX
Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.
shallowdream204/DreamClear
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Language: Python
#diffusion_transformer #pixelart #restoration #super_resolution
Stars: 307 Issues: 5 Forks: 12
https://github.com/shallowdream204/DreamClear
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Language: Python
#diffusion_transformer #pixelart #restoration #super_resolution
Stars: 307 Issues: 5 Forks: 12
https://github.com/shallowdream204/DreamClear
GitHub
GitHub - shallowdream204/DreamClear: [NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset…
[NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation - shallowdream204/DreamClear
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Language: Cuda
#diffusion_models #flux #genai #lora #mlsys #quantization
Stars: 249 Issues: 10 Forks: 13
https://github.com/mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Language: Cuda
#diffusion_models #flux #genai #lora #mlsys #quantization
Stars: 249 Issues: 10 Forks: 13
https://github.com/mit-han-lab/nunchaku
GitHub
GitHub - nunchaku-tech/nunchaku: [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models - nunchaku-tech/nunchaku
lucidrains/MIMO-pytorch
Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group
Language: Python
#artificial_intelligence #character_video_synthesis #deep_learning #diffusion
Stars: 112 Issues: 0 Forks: 4
https://github.com/lucidrains/MIMO-pytorch
Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group
Language: Python
#artificial_intelligence #character_video_synthesis #deep_learning #diffusion
Stars: 112 Issues: 0 Forks: 4
https://github.com/lucidrains/MIMO-pytorch
GitHub
GitHub - lucidrains/MIMO-pytorch: Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed…
Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group - lucidrains/MIMO-pytorch
Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
GitHub
GitHub - Lightricks/LTX-Video: Official repository for LTX-Video
Official repository for LTX-Video. Contribute to Lightricks/LTX-Video development by creating an account on GitHub.
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
GitHub
GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.
TencentARC/BrushEdit
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Language: Python
#diffusion_models #image_editing #image_inpainting
Stars: 262 Issues: 4 Forks: 12
https://github.com/TencentARC/BrushEdit
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Language: Python
#diffusion_models #image_editing #image_inpainting
Stars: 262 Issues: 4 Forks: 12
https://github.com/TencentARC/BrushEdit
GitHub
GitHub - TencentARC/BrushEdit: [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting…
[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing" - TencentARC/BrushEdit
baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
GitHub
GitHub - baaivision/NOVA: [ICLR 2025] Autoregressive Video Generation without Vector Quantization
[ICLR 2025] Autoregressive Video Generation without Vector Quantization - baaivision/NOVA
❤1
Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python
#diffusion_models #image_to_video #image_to_video_generation #videogeneration
Stars: 646 Issues: 12 Forks: 32
https://github.com/Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python
#diffusion_models #image_to_video #image_to_video_generation #videogeneration
Stars: 646 Issues: 12 Forks: 32
https://github.com/Tencent/HunyuanVideo-I2V
GitHub
GitHub - Tencent-Hunyuan/HunyuanVideo-I2V: HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo - Tencent-Hunyuan/HunyuanVideo-I2V
SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
GitHub
GitHub - SandAI-org/MAGI-1: MAGI-1: Autoregressive Video Generation at Scale
MAGI-1: Autoregressive Video Generation at Scale. Contribute to SandAI-org/MAGI-1 development by creating an account on GitHub.
👍1
River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
GitHub
GitHub - River-Zhang/ICEdit: Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released!…
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou...
👍1
Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
GitHub
GitHub - Tencent/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent/HunyuanCustom
❤1
Gen-Verse/MMaDA
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
Language: Python
#diffusion_models #llm_reasoning #unified_multimodal_understanding_and_generation
Stars: 494 Issues: 4 Forks: 13
https://github.com/Gen-Verse/MMaDA
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
Language: Python
#diffusion_models #llm_reasoning #unified_multimodal_understanding_and_generation
Stars: 494 Issues: 4 Forks: 13
https://github.com/Gen-Verse/MMaDA
GitHub
GitHub - Gen-Verse/MMaDA: [NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models - Gen-Verse/MMaDA
haidog-yaqub/MeanFlow
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
Language: Python
#diffusion_models #flow_matching #generative_model
Stars: 185 Issues: 4 Forks: 7
https://github.com/haidog-yaqub/MeanFlow
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
Language: Python
#diffusion_models #flow_matching #generative_model
Stars: 185 Issues: 4 Forks: 7
https://github.com/haidog-yaqub/MeanFlow
GitHub
GitHub - haidog-yaqub/MeanFlow: Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling"…
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al. - haidog-yaqub/MeanFlow
liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
GitHub
GitHub - liuff19/LangScene-X: [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video…
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion - liuff19/LangScene-X
krea-ai/flux-krea
Official GitHub repository for FLUX.1 Krea [dev].
Language: Python
#diffusion_models #flux #machine_learning #text_to_image
Stars: 199 Issues: 3 Forks: 7
https://github.com/krea-ai/flux-krea
Official GitHub repository for FLUX.1 Krea [dev].
Language: Python
#diffusion_models #flux #machine_learning #text_to_image
Stars: 199 Issues: 3 Forks: 7
https://github.com/krea-ai/flux-krea
GitHub
GitHub - krea-ai/flux-krea: Official GitHub repository for FLUX.1 Krea [dev].
Official GitHub repository for FLUX.1 Krea [dev]. Contribute to krea-ai/flux-krea development by creating an account on GitHub.
pengzhangzhi/Open-dLLM
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Language: Python
#diffusion_models #large_language_models
Stars: 159 Issues: 3 Forks: 5
https://github.com/pengzhangzhi/Open-dLLM
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Language: Python
#diffusion_models #large_language_models
Stars: 159 Issues: 3 Forks: 5
https://github.com/pengzhangzhi/Open-dLLM
GitHub
GitHub - pengzhangzhi/Open-dLLM: The most open diffusion language model for code generation — releasing pretraining, evaluation…
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints. - pengzhangzhi/Open-dLLM
Tencent-Hunyuan/HunyuanImage-2.1
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​
Language: Python
#aigc #diffusion_models #diffusion_transformer #image_generation #text_to_image
Stars: 255 Issues: 7 Forks: 16
https://github.com/Tencent-Hunyuan/HunyuanImage-2.1
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​
Language: Python
#aigc #diffusion_models #diffusion_transformer #image_generation #text_to_image
Stars: 255 Issues: 7 Forks: 16
https://github.com/Tencent-Hunyuan/HunyuanImage-2.1
GitHub
GitHub - Tencent-Hunyuan/HunyuanImage-2.1: HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image…
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​ - Tencent-Hunyuan/HunyuanImage-2.1
❤1
Alpha-VLLM/Lumina-DiMOO
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Language: Python
#diffusion_large_language_model #discrete_diffusion_models #unified_multimodal_understanding_and_generation
Stars: 221 Issues: 1 Forks: 6
https://github.com/Alpha-VLLM/Lumina-DiMOO
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Language: Python
#diffusion_large_language_model #discrete_diffusion_models #unified_multimodal_understanding_and_generation
Stars: 221 Issues: 1 Forks: 6
https://github.com/Alpha-VLLM/Lumina-DiMOO
GitHub
GitHub - Alpha-VLLM/Lumina-DiMOO: Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model - Alpha-VLLM/Lumina-DiMOO
OpenImagingLab/FlashVSR
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
Language: Python
#diffusion_models #video_super_resolution
Stars: 218 Issues: 5 Forks: 4
https://github.com/OpenImagingLab/FlashVSR
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
Language: Python
#diffusion_models #video_super_resolution
Stars: 218 Issues: 5 Forks: 4
https://github.com/OpenImagingLab/FlashVSR
GitHub
GitHub - OpenImagingLab/FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion…
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de...