OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
GitHub
GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...
Zeqiang-Lai/DragGAN
Unofficial implementation of "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
Language: Python
#draggan #image_editing #image_generation
Stars: 179 Issues: 4 Forks: 21
https://github.com/Zeqiang-Lai/DragGAN
Unofficial implementation of "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
Language: Python
#draggan #image_editing #image_generation
Stars: 179 Issues: 4 Forks: 21
https://github.com/Zeqiang-Lai/DragGAN
GitHub
GitHub - OpenGVLab/DragGAN: Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the…
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, ma...
axodox/axodox-machinelearning
This repository contains a C++ ONNX implementation of StableDiffusion.
Language: C++
#cpp #image_generation #mit_license #native #nuget #onnx #stable_diffusion
Stars: 241 Issues: 1 Forks: 8
https://github.com/axodox/axodox-machinelearning
This repository contains a C++ ONNX implementation of StableDiffusion.
Language: C++
#cpp #image_generation #mit_license #native #nuget #onnx #stable_diffusion
Stars: 241 Issues: 1 Forks: 8
https://github.com/axodox/axodox-machinelearning
GitHub
GitHub - axodox/axodox-machinelearning: This repository contains a pure C++ ONNX implementation of multiple offline AI models,…
This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, Midas, HED and OpenPose. - axodox/axodox-machinelearning
Yujun-Shi/DragDiffusion
Official code for DragDiffusion
Language: Python
#artificial_intelligence #diffusion_models #dragdiffusion #draggan #image_editing
Stars: 288 Issues: 3 Forks: 23
https://github.com/Yujun-Shi/DragDiffusion
Official code for DragDiffusion
Language: Python
#artificial_intelligence #diffusion_models #dragdiffusion #draggan #image_editing
Stars: 288 Issues: 3 Forks: 23
https://github.com/Yujun-Shi/DragDiffusion
GitHub
GitHub - Yujun-Shi/DragDiffusion: [CVPR2024, Highlight] Official code for DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion - Yujun-Shi/DragDiffusion
leejet/stable-diffusion.cpp
Stable Diffusion in pure C/C++
Language: C
#ai #cplusplus #diffusion #ggml #image_generation #latent_diffusion #stable_diffusion #text2image #txt2img
Stars: 238 Issues: 5 Forks: 12
https://github.com/leejet/stable-diffusion.cpp
Stable Diffusion in pure C/C++
Language: C
#ai #cplusplus #diffusion #ggml #image_generation #latent_diffusion #stable_diffusion #text2image #txt2img
Stars: 238 Issues: 5 Forks: 12
https://github.com/leejet/stable-diffusion.cpp
GitHub
GitHub - leejet/stable-diffusion.cpp: Stable Diffusion and Flux in pure C/C++
Stable Diffusion and Flux in pure C/C++. Contribute to leejet/stable-diffusion.cpp development by creating an account on GitHub.
dreamgaussian/dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Language: Python
#image_to_3d #text_to_3d
Stars: 307 Issues: 2 Forks: 17
https://github.com/dreamgaussian/dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Language: Python
#image_to_3d #text_to_3d
Stars: 307 Issues: 2 Forks: 17
https://github.com/dreamgaussian/dreamgaussian
GitHub
GitHub - dreamgaussian/dreamgaussian: [ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation - dreamgaussian/dreamgaussian
cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
GitHub
GitHub - cvg/glue-factory: Training library for local feature detection and matching
Training library for local feature detection and matching - cvg/glue-factory
deepseek-ai/DreamCraft3D
Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
#3d_creation #3d_generation #aigc #diffusion_models #generative_model #image_to_3d
Stars: 304 Issues: 3 Forks: 4
https://github.com/deepseek-ai/DreamCraft3D
Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
#3d_creation #3d_generation #aigc #diffusion_models #generative_model #image_to_3d
Stars: 304 Issues: 3 Forks: 4
https://github.com/deepseek-ai/DreamCraft3D
GitHub
GitHub - deepseek-ai/DreamCraft3D: [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped…
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior - deepseek-ai/DreamCraft3D
jiawei-ren/dreamgaussian4d
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
Language: Python
#image_to_4d
Stars: 138 Issues: 0 Forks: 5
https://github.com/jiawei-ren/dreamgaussian4d
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
Language: Python
#image_to_4d
Stars: 138 Issues: 0 Forks: 5
https://github.com/jiawei-ren/dreamgaussian4d
GitHub
GitHub - jiawei-ren/dreamgaussian4d: [arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting - jiawei-ren/dreamgaussian4d
LiheYoung/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Language: Python
#depth_estimation #image_synthesis #metric_depth_estimation #monocular_depth_estimation
Stars: 1116 Issues: 11 Forks: 62
https://github.com/LiheYoung/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Language: Python
#depth_estimation #image_synthesis #metric_depth_estimation #monocular_depth_estimation
Stars: 1116 Issues: 11 Forks: 62
https://github.com/LiheYoung/Depth-Anything
GitHub
GitHub - LiheYoung/Depth-Anything: [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model…
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation - LiheYoung/Depth-Anything
YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
GitHub
GitHub - 3DTopia/LGM: [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation. - 3DTopia/LGM
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
GitHub
GitHub - mayuelala/FollowYourClick: [arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Fol...
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
GitHub
GitHub - FoundationVision/VAR: [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of…
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction&qu...
AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
GitHub
GitHub - AiuniAI/Unique3D: [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image - AiuniAI/Unique3D
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
GitHub
GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo
gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
GitHub
GitHub - gcui-art/album-ai: AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery. - gcui-art/album-ai
C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
GitHub
GitHub - C-Naoki/image-stitcher: This is a python implementation for stitching images.
This is a python implementation for stitching images. - C-Naoki/image-stitcher
facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
GitHub
GitHub - facebookresearch/watermark-anything: Official implementation of the paper "Watermark Anything with Localized Messages"
Official implementation of the paper "Watermark Anything with Localized Messages" - facebookresearch/watermark-anything
magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language: Python
#aigc #image_editing #mllm
Stars: 531 Issues: 7 Forks: 32
https://github.com/magic-quill/MagicQuill
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Language: Python
#aigc #image_editing #mllm
Stars: 531 Issues: 7 Forks: 32
https://github.com/magic-quill/MagicQuill
GitHub
GitHub - magic-quill/MagicQuill: Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System - magic-quill/MagicQuill