cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language: Open Policy Agent
#annotation #annotation_tool #annotations #bounding_box #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labeling_tool #imagenet #labeling #labeling_tool #semantic_segmentation #video_annotation #yolo
Stars: 99 Issues: 14 Forks: 4
https://github.com/cvat-ai/cvat
  
  Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language: Open Policy Agent
#annotation #annotation_tool #annotations #bounding_box #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labeling_tool #imagenet #labeling #labeling_tool #semantic_segmentation #video_annotation #yolo
Stars: 99 Issues: 14 Forks: 4
https://github.com/cvat-ai/cvat
GitHub
  
  GitHub - cvat-ai/cvat: Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams…
  Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale. - cvat-ai/cvat
👍3🔥3
  b7leung/MLE-Flashcards
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
#ai #artificial_intelligence #computer_science #computer_vision #flashcards #interview #interview_preparation #machine_learning #review
Stars: 121 Issues: 1 Forks: 9
https://github.com/b7leung/MLE-Flashcards
  
  200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
#ai #artificial_intelligence #computer_science #computer_vision #flashcards #interview #interview_preparation #machine_learning #review
Stars: 121 Issues: 1 Forks: 9
https://github.com/b7leung/MLE-Flashcards
GitHub
  
  GitHub - b7leung/MLE-Flashcards: 200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and…
  200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science. - b7leung/MLE-Flashcards
❤1
  clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
  
  Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
GitHub
  
  GitHub - clovaai/donut: Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator…
  Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 - clovaai/donut
❤1
  roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
  
  Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
GitHub
  
  GitHub - roboflow/notebooks: A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything…
  A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM ...
  SkalskiP/courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language: Python
#computer_vision #deep_learning #deep_neural_networks #machine_learning #mlops #multimodal #natural_language_processing #nlp #transformers #tutorial
Stars: 323 Issues: 0 Forks: 29
https://github.com/SkalskiP/courses
  
  This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language: Python
#computer_vision #deep_learning #deep_neural_networks #machine_learning #mlops #multimodal #natural_language_processing #nlp #transformers #tutorial
Stars: 323 Issues: 0 Forks: 29
https://github.com/SkalskiP/courses
GitHub
  
  GitHub - SkalskiP/courses: This repository is a curated collection of links to various courses and resources about Artificial Intelligence…
  This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI) - SkalskiP/courses
👍1
  kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
  
  Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
GitHub
  
  GitHub - kevmo314/magic-copy: Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground…
  Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. - kevmo314/magic-copy
  Jumpat/SegmentAnythingin3D
Segment Anything in 3D with NeRFs
#3d #3d_segmentation #computer_vision #nerf #segment_anything #segmentation
Stars: 224 Issues: 2 Forks: 7
https://github.com/Jumpat/SegmentAnythingin3D
  
  Segment Anything in 3D with NeRFs
#3d #3d_segmentation #computer_vision #nerf #segment_anything #segmentation
Stars: 224 Issues: 2 Forks: 7
https://github.com/Jumpat/SegmentAnythingin3D
GitHub
  
  GitHub - Jumpat/SegmentAnythingin3D: Segment Anything in 3D with NeRFs (NeurIPS 2023 & IJCV 2025)
  Segment Anything in 3D with NeRFs (NeurIPS 2023 & IJCV 2025) - Jumpat/SegmentAnythingin3D
  X-PLUG/mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
  
  mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
GitHub
  
  GitHub - X-PLUG/mPLUG-Owl: mPLUG-Owl: The Powerful Multi-modal Large Language Model  Family
  mPLUG-Owl: The Powerful Multi-modal Large Language Model  Family - X-PLUG/mPLUG-Owl
  nv-tlabs/NKSR
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
Language: Python
#3d_reconstruction #computer_vision #graphics #neural_kernel #point_cloud
Stars: 250 Issues: 7 Forks: 6
https://github.com/nv-tlabs/NKSR
  
  [CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
Language: Python
#3d_reconstruction #computer_vision #graphics #neural_kernel #point_cloud
Stars: 250 Issues: 7 Forks: 6
https://github.com/nv-tlabs/NKSR
GitHub
  
  GitHub - nv-tlabs/NKSR: [CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
  [CVPR 2023 Highlight] Neural Kernel Surface Reconstruction - nv-tlabs/NKSR
👎1
  graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language: Python
#computer_graphics #computer_vision #radiance_field
Stars: 348 Issues: 4 Forks: 14
https://github.com/graphdeco-inria/gaussian-splatting
  
  Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language: Python
#computer_graphics #computer_vision #radiance_field
Stars: 348 Issues: 4 Forks: 14
https://github.com/graphdeco-inria/gaussian-splatting
GitHub
  
  GitHub - graphdeco-inria/gaussian-splatting: Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance…
  Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering" - graphdeco-inria/gaussian-splatting
👍1
  cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
  
  Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
GitHub
  
  GitHub - cvg/glue-factory: Training library for local feature detection and matching
  Training library for local feature detection and matching - cvg/glue-factory
  hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
  
  GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GitHub
  
  GitHub - hustvl/GaussianDreamer: [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion…
  [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models - hustvl/GaussianDreamer
👍2
  hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
  
  [arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
GitHub
  
  GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
  [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie
  lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
  
  A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
GitHub
  
  GitHub - lxe/llavavision: A simple "Be My Eyes" web app with a llama.cpp/llava backend
  A simple "Be My Eyes" web app with a llama.cpp/llava backend - lxe/llavavision
  roboflow/awesome-openai-vision-api-experiments
Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
  
  Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
GitHub
  
  GitHub - roboflow/awesome-openai-vision-api-experiments: Must-have resource for anyone who wants to experiment with and build on…
  Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥 - roboflow/awesome-openai-vision-api-experiments
❤1
  spla-tam/SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
  
  SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
GitHub
  
  GitHub - spla-tam/SplaTAM: SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
  SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024) - spla-tam/SplaTAM
  3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
  
  An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
GitHub
  
  GitHub - 3DTopia/OpenLRM: An open-source impl. of Large Reconstruction Models
  An open-source impl. of Large Reconstruction Models - 3DTopia/OpenLRM
🔥1
  robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
  
  A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
GitHub
  
  GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)
  Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs
🥰1👏1
  muskie82/MonoGS
[CVPR'24] Gaussian Splatting SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 362 Issues: 4 Forks: 19
https://github.com/muskie82/MonoGS
  
  [CVPR'24] Gaussian Splatting SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 362 Issues: 4 Forks: 19
https://github.com/muskie82/MonoGS
GitHub
  
  GitHub - muskie82/MonoGS: [CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
  [CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM - muskie82/MonoGS
❤1👍1
  nnanhuang/S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
Language: Python
#3d #3dgs #autonomous_driving #computer_vision #driving #dynamic_scene #gaussian #neural_network #neural_rendering #unsupervised_segmentation
Stars: 201 Issues: 2 Forks: 10
https://github.com/nnanhuang/S3Gaussian
  
  Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
Language: Python
#3d #3dgs #autonomous_driving #computer_vision #driving #dynamic_scene #gaussian #neural_network #neural_rendering #unsupervised_segmentation
Stars: 201 Issues: 2 Forks: 10
https://github.com/nnanhuang/S3Gaussian
GitHub
  
  GitHub - nnanhuang/S3Gaussian: Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
  Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving - nnanhuang/S3Gaussian
  