GitHub repos

mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick

GitHub

GitHub - mayuelala/FollowYourClick: [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…

[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Foll...

3.2K views16:28

GitHub repos

FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR

GitHub

GitHub - FoundationVision/VAR: [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official…

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Predi...

2.4K views16:29

GitHub repos

AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D

GitHub

GitHub - AiuniAI/Unique3D: [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image - AiuniAI/Unique3D

2.0K views04:00

GitHub repos

fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo

GitHub

GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo

3.2K views10:00

GitHub repos

gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai

GitHub

GitHub - gcui-art/album-ai: AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.

AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery. - gcui-art/album-ai

3.1K views22:00

GitHub repos

C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher

GitHub

GitHub - C-Naoki/image-stitcher: This is a python implementation for stitching images.

This is a python implementation for stitching images. - C-Naoki/image-stitcher

1.9K views22:00

GitHub repos

facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything

GitHub

GitHub - facebookresearch/watermark-anything: Official implementation of the paper "Watermark Anything with Localized Messages"

Official implementation of the paper "Watermark Anything with Localized Messages" - facebookresearch/watermark-anything

1.8K views05:00

GitHub repos

magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language: Python
#aigc #image_editing #mllm
Stars: 531 Issues: 7 Forks: 32
https://github.com/magic-quill/MagicQuill

GitHub

GitHub - ant-research/MagicQuill: [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing…

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System - ant-research/MagicQuill

1.8K views11:00

GitHub repos

Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video

GitHub

GitHub - Lightricks/LTX-Video: Official repository for LTX-Video

Official repository for LTX-Video. Contribute to Lightricks/LTX-Video development by creating an account on GitHub.

1.7K views17:00

GitHub repos

Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo

GitHub

GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI

LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.

1.7K views23:00

GitHub repos

TencentARC/BrushEdit
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
Language: Python
#diffusion_models #image_editing #image_inpainting
Stars: 262 Issues: 4 Forks: 12
https://github.com/TencentARC/BrushEdit

GitHub

GitHub - TencentARC/BrushEdit: [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting…

[TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing" - TencentARC/BrushEdit

1.9K views11:00

GitHub repos

baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA

GitHub

GitHub - baaivision/NOVA: [ICLR 2025] Autoregressive Video Generation without Vector Quantization

[ICLR 2025] Autoregressive Video Generation without Vector Quantization - baaivision/NOVA

1.8K views23:00

GitHub repos

Tencent/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Language: Python
#diffusion_models #image_to_video #image_to_video_generation #videogeneration
Stars: 646 Issues: 12 Forks: 32
https://github.com/Tencent/HunyuanVideo-I2V

GitHub

GitHub - Tencent-Hunyuan/HunyuanVideo-I2V: HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo - Tencent-Hunyuan/HunyuanVideo-I2V

1.7K views23:00

GitHub repos

VAST-AI-Research/TripoSG
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
Language: Python
#3d_genai #3d_generation #3d_reconstruction #image_to_3d
Stars: 249 Issues: 7 Forks: 15
https://github.com/VAST-AI-Research/TripoSG

GitHub

GitHub - VAST-AI-Research/TripoSG: TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models - VAST-AI-Research/TripoSG

1.7K views22:00

GitHub repos

VAST-AI-Research/TripoSF
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
Language: Python
#3d_generation #3d_reconstruction #flexicubes #image_to_3d
Stars: 237 Issues: 2 Forks: 6
https://github.com/VAST-AI-Research/TripoSF

GitHub

GitHub - VAST-AI-Research/TripoSF: SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling - VAST-AI-Research/TripoSF

1.7K views16:00

GitHub repos

lum3on/comfyui_HiDream-Sampler
ComfyUI Wrapper for HiDream
Language: Python
#ai_art #comfy_nodes #comfyui #custom_node #diffusers #image_generation
Stars: 222 Issues: 32 Forks: 18
https://github.com/lum3on/comfyui_HiDream-Sampler

GitHub

GitHub - lum3on/comfyui_HiDream-Sampler: ComfyUI Wrapper for HiDream

ComfyUI Wrapper for HiDream. Contribute to lum3on/comfyui_HiDream-Sampler development by creating an account on GitHub.

1.6K views10:00

GitHub repos

River-Zhang/ICEdit
Repository for paper "In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer"
Language: Python
#diffusion #diffusion_models #diffusion_transformer #dit #editing_image #image_editing #in_context
Stars: 136 Issues: 1 Forks: 3
https://github.com/River-Zhang/ICEdit

GitHub

GitHub - River-Zhang/ICEdit: Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released!…

Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou...

1.7K views22:00

GitHub repos

Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom

GitHub

GitHub - Tencent/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent/HunyuanCustom

1.6K views16:00

GitHub repos

JAMESYJL/ShapeLLM-Omni
A Native Multimodal LLM for 3D Generation and Understanding
Language: Python
#3d_captioning #3d_editing #image_to_3d #llm #text_to_3d
Stars: 223 Issues: 3 Forks: 6
https://github.com/JAMESYJL/ShapeLLM-Omni

GitHub

GitHub - JAMESYJL/ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

A Native Multimodal LLM for 3D Generation and Understanding - JAMESYJL/ShapeLLM-Omni

1.5K views22:00

GitHub repos

Tencent-Hunyuan/Hunyuan3D-2.1
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Language: Python
#3d #3d_aigc #3d_generation #hunyuan3d #image_to_3d #shape #shape_generation #text_to_3d #texture_genertaion
Stars: 427 Issues: 13 Forks: 28
https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1

GitHub

GitHub - Tencent-Hunyuan/Hunyuan3D-2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material - Tencent-Hunyuan/Hunyuan3D-2.1

1.5K views10:00

About

Blog

Apps

Platform