mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
GitHub
GitHub - mayuelala/FollowYourClick: [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Foll...
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
GitHub
GitHub - FoundationVision/VAR: [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official…
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Predi...
AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
GitHub
GitHub - AiuniAI/Unique3D: [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image - AiuniAI/Unique3D
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
GitHub
GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo
gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
GitHub
GitHub - gcui-art/album-ai: AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery. - gcui-art/album-ai
C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
GitHub
GitHub - C-Naoki/image-stitcher: This is a python implementation for stitching images.
This is a python implementation for stitching images. - C-Naoki/image-stitcher
facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
GitHub
GitHub - facebookresearch/watermark-anything: Official implementation of the paper "Watermark Anything with Localized Messages"
Official implementation of the paper "Watermark Anything with Localized Messages" - facebookresearch/watermark-anything
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language: Python
#aigc #image_editing #mllm
Stars: 531 Issues: 7 Forks: 32
https://github.com/magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language: Python
#aigc #image_editing #mllm
Stars: 531 Issues: 7 Forks: 32
https://github.com/magic-quill/MagicQuill
GitHub
GitHub - ant-research/MagicQuill: [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing…
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System - ant-research/MagicQuill
Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
GitHub
GitHub - Lightricks/LTX-Video: Official repository for LTX-Video
Official repository for LTX-Video. Contribute to Lightricks/LTX-Video development by creating an account on GitHub.
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
GitHub
GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.
TencentARC/BrushEdit
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Language: Python
#diffusion_models #image_editing #image_inpainting
Stars: 262 Issues: 4 Forks: 12
https://github.com/TencentARC/BrushEdit
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Language: Python
#diffusion_models #image_editing #image_inpainting
Stars: 262 Issues: 4 Forks: 12
https://github.com/TencentARC/BrushEdit
GitHub
GitHub - TencentARC/BrushEdit: [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting…
[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing" - TencentARC/BrushEdit
baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
GitHub
GitHub - baaivision/NOVA: [ICLR 2025] Autoregressive Video Generation without Vector Quantization
[ICLR 2025] Autoregressive Video Generation without Vector Quantization - baaivision/NOVA
Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python
#diffusion_models #image_to_video #image_to_video_generation #videogeneration
Stars: 646 Issues: 12 Forks: 32
https://github.com/Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python
#diffusion_models #image_to_video #image_to_video_generation #videogeneration
Stars: 646 Issues: 12 Forks: 32
https://github.com/Tencent/HunyuanVideo-I2V
GitHub
GitHub - Tencent-Hunyuan/HunyuanVideo-I2V: HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo - Tencent-Hunyuan/HunyuanVideo-I2V
VAST-AI-Research/TripoSG
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Language: Python
#3d_genai #3d_generation #3d_reconstruction #image_to_3d
Stars: 249 Issues: 7 Forks: 15
https://github.com/VAST-AI-Research/TripoSG
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Language: Python
#3d_genai #3d_generation #3d_reconstruction #image_to_3d
Stars: 249 Issues: 7 Forks: 15
https://github.com/VAST-AI-Research/TripoSG
GitHub
GitHub - VAST-AI-Research/TripoSG: TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models - VAST-AI-Research/TripoSG
VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF
GitHub
GitHub - VAST-AI-Research/TripoSF: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling - VAST-AI-Research/TripoSF
lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler
GitHub
GitHub - lum3on/comfyui_HiDream-Sampler: ComfyUI Wrapper for HiDream
ComfyUI Wrapper for HiDream. Contribute to lum3on/comfyui_HiDream-Sampler development by creating an account on GitHub.
River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit
GitHub
GitHub - River-Zhang/ICEdit: Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released!…
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou...
Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
GitHub
GitHub - Tencent/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent/HunyuanCustom
JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni
GitHub
GitHub - JAMESYJL/ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni
Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1
GitHub
GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1