cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
  
  Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
GitHub
  
  GitHub - cvg/glue-factory: Training library for local feature detection and matching
  Training library for local feature detection and matching - cvg/glue-factory
  hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
  
  GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GitHub
  
  GitHub - hustvl/GaussianDreamer: [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion…
  [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models - hustvl/GaussianDreamer
👍2
  hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
  
  [arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
GitHub
  
  GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
  [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie
  lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
  
  A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
GitHub
  
  GitHub - lxe/llavavision: A simple "Be My Eyes" web app with a llama.cpp/llava backend
  A simple "Be My Eyes" web app with a llama.cpp/llava backend - lxe/llavavision
  roboflow/awesome-openai-vision-api-experiments
Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
  
  Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
GitHub
  
  GitHub - roboflow/awesome-openai-vision-api-experiments: Must-have resource for anyone who wants to experiment with and build on…
  Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥 - roboflow/awesome-openai-vision-api-experiments
❤1
  spla-tam/SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
  
  SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
GitHub
  
  GitHub - spla-tam/SplaTAM: SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
  SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024) - spla-tam/SplaTAM
  3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
  
  An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
GitHub
  
  GitHub - 3DTopia/OpenLRM: An open-source impl. of Large Reconstruction Models
  An open-source impl. of Large Reconstruction Models - 3DTopia/OpenLRM
🔥1
  robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
  
  A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
GitHub
  
  GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)
  Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs
🥰1👏1
  VikParuchuri/surya
Multilingual document OCR models for text detection and recognition
Language: Python
#ai #computer
Stars: 406 Issues: 0 Forks: 16
https://github.com/VikParuchuri/surya
  
  Multilingual document OCR models for text detection and recognition
Language: Python
#ai #computer
Stars: 406 Issues: 0 Forks: 16
https://github.com/VikParuchuri/surya
GitHub
  
  GitHub - datalab-to/surya: OCR, layout analysis, reading order, table recognition in 90+ languages
  OCR, layout analysis, reading order, table recognition in 90+ languages - datalab-to/surya
  muskie82/MonoGS
[CVPR'24] Gaussian Splatting SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 362 Issues: 4 Forks: 19
https://github.com/muskie82/MonoGS
  
  [CVPR'24] Gaussian Splatting SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 362 Issues: 4 Forks: 19
https://github.com/muskie82/MonoGS
GitHub
  
  GitHub - muskie82/MonoGS: [CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
  [CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM - muskie82/MonoGS
❤1👍1
  nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
Language: Python
#3d #3dgs #autonomous_driving #computer_vision #driving #dynamic_scene #gaussian #neural_network #neural_rendering #unsupervised_segmentation
Stars: 201 Issues: 2 Forks: 10
https://github.com/nnanhuang/S3Gaussian
  
  Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
Language: Python
#3d #3dgs #autonomous_driving #computer_vision #driving #dynamic_scene #gaussian #neural_network #neural_rendering #unsupervised_segmentation
Stars: 201 Issues: 2 Forks: 10
https://github.com/nnanhuang/S3Gaussian
GitHub
  
  GitHub - nnanhuang/S3Gaussian: Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
  Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving - nnanhuang/S3Gaussian
  suitedaces/computer-agent
Desktop app powered by Claude’s computer use capability to control your computer
Language: Python
#ai #ai_tools #anthropic #claude_3_5_sonnet #computer_use #gui #pyqt #pyqt6 #python
Stars: 174 Issues: 3 Forks: 11
https://github.com/suitedaces/computer-agent
  
  Desktop app powered by Claude’s computer use capability to control your computer
Language: Python
#ai #ai_tools #anthropic #claude_3_5_sonnet #computer_use #gui #pyqt #pyqt6 #python
Stars: 174 Issues: 3 Forks: 11
https://github.com/suitedaces/computer-agent
GitHub
  
  GitHub - suitedaces/computer-agent: Desktop app powered by Claude’s computer use capability to control your computer
  Desktop app powered by Claude’s computer use capability to control your computer - suitedaces/computer-agent
  bytedance/UI-TARS-desktop
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Language: TypeScript
#agent #browser_use #computer_use #electron #gui_agents #vision #vite #vlm
Stars: 505 Issues: 8 Forks: 35
https://github.com/bytedance/UI-TARS-desktop
  
  A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Language: TypeScript
#agent #browser_use #computer_use #electron #gui_agents #vision #vite #vlm
Stars: 505 Issues: 8 Forks: 35
https://github.com/bytedance/UI-TARS-desktop
GitHub
  
  GitHub - bytedance/UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
  The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra - bytedance/UI-TARS-desktop
❤1
  gszfwsb/NCFM
Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.
Language: Python
#computer_vision #data_centric_ai #dataset_distillation #synthetic_data
Stars: 268 Issues: 2 Forks: 15
https://github.com/gszfwsb/NCFM
  
  Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.
Language: Python
#computer_vision #data_centric_ai #dataset_distillation #synthetic_data
Stars: 268 Issues: 2 Forks: 15
https://github.com/gszfwsb/NCFM
GitHub
  
  GitHub - gszfwsb/NCFM: Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function:…
  Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Highlight). - gszfwsb/NCFM
  roboflow/rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, released under the Apache 2.0 license.
Language: Python
#computer_vision #detr #machine_learning #object_detection #rf_detr
Stars: 292 Issues: 3 Forks: 19
https://github.com/roboflow/rf-detr
  
  RF-DETR is a real-time object detection model architecture developed by Roboflow, released under the Apache 2.0 license.
Language: Python
#computer_vision #detr #machine_learning #object_detection #rf_detr
Stars: 292 Issues: 3 Forks: 19
https://github.com/roboflow/rf-detr
GitHub
  
  GitHub - roboflow/rf-detr: RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA…
  RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning. - roboflow/rf-detr
  ByteDance-Seed/TraceAnything
Trace Anything: Representing Any Video in 4D via Trajectory Fields
Language: Python
#3d_reconstruction #4d_reconstruction #computer_vision
Stars: 205 Issues: 0 Forks: 2
https://github.com/ByteDance-Seed/TraceAnything
  
  Trace Anything: Representing Any Video in 4D via Trajectory Fields
Language: Python
#3d_reconstruction #4d_reconstruction #computer_vision
Stars: 205 Issues: 0 Forks: 2
https://github.com/ByteDance-Seed/TraceAnything
GitHub
  
  GitHub - ByteDance-Seed/TraceAnything: Trace Anything: Representing Any Video in 4D via Trajectory Fields
  Trace Anything: Representing Any Video in 4D via Trajectory Fields - ByteDance-Seed/TraceAnything
  lightly-ai/lightly-studio
Curate, Annotate, and Manage Your Data in LightlyStudio.
Language: Python
#computer_vision #image_labeling #mlops
Stars: 395 Issues: 6 Forks: 7
https://github.com/lightly-ai/lightly-studio
  
  Curate, Annotate, and Manage Your Data in LightlyStudio.
Language: Python
#computer_vision #image_labeling #mlops
Stars: 395 Issues: 6 Forks: 7
https://github.com/lightly-ai/lightly-studio
GitHub
  
  GitHub - lightly-ai/lightly-studio: Curate, Annotate, and Manage Your Data in LightlyStudio.
  Curate, Annotate, and Manage Your Data in LightlyStudio. - lightly-ai/lightly-studio
  