Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
GitHub
GitHub - memvid/memvid: Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.…
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory. - memvid/memvid
THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
GitHub
GitHub - zai-org/GLM-V: GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning - zai-org/GLM-V
❤1
liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
GitHub
GitHub - liuff19/LangScene-X: [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video…
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion - liuff19/LangScene-X
Wan-Video/Wan2.2
Wan: Open and Advanced Large-Scale Video Generative Models
Language: Python
#aigc #video_generation
Stars: 1285 Issues: 21 Forks: 26
https://github.com/Wan-Video/Wan2.2
Wan: Open and Advanced Large-Scale Video Generative Models
Language: Python
#aigc #video_generation
Stars: 1285 Issues: 21 Forks: 26
https://github.com/Wan-Video/Wan2.2
GitHub
GitHub - Wan-Video/Wan2.2: Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models - Wan-Video/Wan2.2
SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
GitHub
GitHub - SkyworkAI/Matrix-3D: Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or…
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt. - SkyworkAI/Matrix-3D
showlab/Code2Video
Video generation via code
Language: Python
#coding #multi_agent #video_generation
Stars: 256 Issues: 0 Forks: 31
https://github.com/showlab/Code2Video
Video generation via code
Language: Python
#coding #multi_agent #video_generation
Stars: 256 Issues: 0 Forks: 31
https://github.com/showlab/Code2Video
GitHub
GitHub - showlab/Code2Video: Video generation via code
Video generation via code. Contribute to showlab/Code2Video development by creating an account on GitHub.
OpenImagingLab/FlashVSR
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
Language: Python
#diffusion_models #video_super_resolution
Stars: 218 Issues: 5 Forks: 4
https://github.com/OpenImagingLab/FlashVSR
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional decoder.
Language: Python
#diffusion_models #video_super_resolution
Stars: 218 Issues: 5 Forks: 4
https://github.com/OpenImagingLab/FlashVSR
GitHub
GitHub - OpenImagingLab/FlashVSR: [CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient…
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co...
❤1
EzioBy/Ditto
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Language: Python
#diffusion_models #synthetic_data #video_editing
Stars: 333 Issues: 7 Forks: 28
https://github.com/EzioBy/Ditto
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Language: Python
#diffusion_models #synthetic_data #video_editing
Stars: 333 Issues: 7 Forks: 28
https://github.com/EzioBy/Ditto
GitHub
GitHub - EzioBy/Ditto: [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset - EzioBy/Ditto
Tencent-Hunyuan/HunyuanVideo-1.5
HunyuanVideo-1.5: A leading lightweight video generation model
Language: Python
#image_to_video #text_to_video #video_generation
Stars: 360 Issues: 5 Forks: 17
https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5
HunyuanVideo-1.5: A leading lightweight video generation model
Language: Python
#image_to_video #text_to_video #video_generation
Stars: 360 Issues: 5 Forks: 17
https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5
GitHub
GitHub - Tencent-Hunyuan/HunyuanVideo-1.5: HunyuanVideo-1.5: A leading lightweight video generation model
HunyuanVideo-1.5: A leading lightweight video generation model - Tencent-Hunyuan/HunyuanVideo-1.5
classicshi/sora-api
This project is designed to simplify the process of creating Sora 2 AI videos using the API and a web interface.
Language: Python
#ai #artifical_intelligence #sora_ai #sora_api #sora2 #video_generation
Stars: 267 Issues: 1 Forks: 72
https://github.com/classicshi/sora-api
This project is designed to simplify the process of creating Sora 2 AI videos using the API and a web interface.
Language: Python
#ai #artifical_intelligence #sora_ai #sora_api #sora2 #video_generation
Stars: 267 Issues: 1 Forks: 72
https://github.com/classicshi/sora-api
GitHub
GitHub - classicshi/sora-api: This project is designed to simplify the process of creating Sora 2 AI videos using the API and a…
This project is designed to simplify the process of creating Sora 2 AI videos using the API and a web interface. - classicshi/sora-api
Robbyant/lingbot-world
Advancing Open-source World Models
Language: Python
#aigc #image_to_video #lingbot_world #video_generation #world_models
Stars: 971 Issues: 11 Forks: 35
https://github.com/Robbyant/lingbot-world
Advancing Open-source World Models
Language: Python
#aigc #image_to_video #lingbot_world #video_generation #world_models
Stars: 971 Issues: 11 Forks: 35
https://github.com/Robbyant/lingbot-world
GitHub
GitHub - Robbyant/lingbot-world: Advancing Open-source World Models
Advancing Open-source World Models. Contribute to Robbyant/lingbot-world development by creating an account on GitHub.
OpenMOSS/MOVA
MOVA: Towards Scalable and Synchronized Video–Audio Generation
Language: Python
#diffusion_models #multimodal #sglang #video_audio_generation
Stars: 397 Issues: 7 Forks: 24
https://github.com/OpenMOSS/MOVA
MOVA: Towards Scalable and Synchronized Video–Audio Generation
Language: Python
#diffusion_models #multimodal #sglang #video_audio_generation
Stars: 397 Issues: 7 Forks: 24
https://github.com/OpenMOSS/MOVA
GitHub
GitHub - OpenMOSS/MOVA: MOVA: Towards Scalable and Synchronized Video–Audio Generation
MOVA: Towards Scalable and Synchronized Video–Audio Generation - OpenMOSS/MOVA
❤1
PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
Language: Python
#acceleration #diffusion #diffusion_model #diffusion_models #efficient_tuning #high__quality #image_to_video #image2video #interactive #long_context #long_video_generation #real_time #text_to_video #text2video #video_generation #video_generator #video_to_video #video2video #world_model #world_models
Stars: 712 Issues: 5 Forks: 46
https://github.com/PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
Language: Python
#acceleration #diffusion #diffusion_model #diffusion_models #efficient_tuning #high__quality #image_to_video #image2video #interactive #long_context #long_video_generation #real_time #text_to_video #text2video #video_generation #video_generator #video_to_video #video2video #world_model #world_models
Stars: 712 Issues: 5 Forks: 46
https://github.com/PKU-YuanGroup/Helios
GitHub
GitHub - PKU-YuanGroup/Helios: Helios: Real Real-Time Long Video Generation Model
Helios: Real Real-Time Long Video Generation Model - PKU-YuanGroup/Helios