lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language: Python
#artificial_intelligence #ddpm #deep_learning #text_to_video #video_generation
Stars: 81 Issues: 1 Forks: 2
https://github.com/lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language: Python
#artificial_intelligence #ddpm #deep_learning #text_to_video #video_generation
Stars: 81 Issues: 1 Forks: 2
https://github.com/lucidrains/video-diffusion-pytorch
GitHub
GitHub - lucidrains/video-diffusion-pytorch: Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs…
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch - lucidrains/video-diffusion-pytorch
jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
GitHub
GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller
OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
GitHub
GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
GitHub
GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk
GitHub
GitHub - ali-vilab/dreamtalk: Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion…
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models - ali-vilab/dreamtalk
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
GitHub
GitHub - mayuelala/FollowYourClick: [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Foll...
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
GitHub
GitHub - PKU-YuanGroup/MagicTime: [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators - PKU-YuanGroup/MagicTime
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language: Python
#diffusion_models #flow_matching #video_generation
Stars: 613 Issues: 10 Forks: 47
https://github.com/jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language: Python
#diffusion_models #flow_matching #video_generation
Stars: 613 Issues: 10 Forks: 47
https://github.com/jy0205/Pyramid-Flow
GitHub
GitHub - jy0205/Pyramid-Flow: [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling - jy0205/Pyramid-Flow
baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
NOVA: Autoregressive Video Generation without Vector Quantization
Language: Python
#autoregressive_models #diffusion_models #image_generation #video_generation
Stars: 145 Issues: 1 Forks: 2
https://github.com/baaivision/NOVA
GitHub
GitHub - baaivision/NOVA: [ICLR 2025] Autoregressive Video Generation without Vector Quantization
[ICLR 2025] Autoregressive Video Generation without Vector Quantization - baaivision/NOVA
FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
GitHub
GitHub - FoundationVision/FlashVideo: FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation - FoundationVision/FlashVideo
liuff19/Video-T1
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Language: Python
#aigc #chain_of_thought #test_time_scaling #video #video_generation
Stars: 187 Issues: 2 Forks: 12
https://github.com/liuff19/Video-T1
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Language: Python
#aigc #chain_of_thought #test_time_scaling #video #video_generation
Stars: 187 Issues: 2 Forks: 12
https://github.com/liuff19/Video-T1
GitHub
GitHub - liuff19/Video-T1: Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Official Implementation of Video-T1: Test-Time Scaling for Video Generation - liuff19/Video-T1
hanyang-21/VideoScene
[CVPR 2025] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Language: Python
#3d_reconstruction #video #video_generation
Stars: 154 Issues: 4 Forks: 3
https://github.com/hanyang-21/VideoScene
[CVPR 2025] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Language: Python
#3d_reconstruction #video #video_generation
Stars: 154 Issues: 4 Forks: 3
https://github.com/hanyang-21/VideoScene
GitHub
GitHub - hanyang-21/VideoScene: [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One…
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step - hanyang-21/VideoScene
ali-vilab/UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Language: Python
#human_image_animation #video_diffusion_transformers #video_generation
Stars: 225 Issues: 5 Forks: 17
https://github.com/ali-vilab/UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Language: Python
#human_image_animation #video_diffusion_transformers #video_generation
Stars: 225 Issues: 5 Forks: 17
https://github.com/ali-vilab/UniAnimate-DiT
GitHub
GitHub - ali-vilab/UniAnimate-DiT: UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer - ali-vilab/UniAnimate-DiT
SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
GitHub
GitHub - SandAI-org/MAGI-1: MAGI-1: Autoregressive Video Generation at Scale
MAGI-1: Autoregressive Video Generation at Scale. Contribute to SandAI-org/MAGI-1 development by creating an account on GitHub.