#python #annotation #annotation_tool #annotations #boundingbox #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labelling_tool #imagenet #labeling #labeling_tool #object_detection #pytorch #semantic_segmentation #tensorflow #video_annotation
CVAT is a powerful tool for annotating videos and images, especially useful for computer vision projects. It helps developers and companies annotate data quickly and efficiently. You can use CVAT online for free or subscribe for more features like unlimited data and integrations with other tools. It also offers a self-hosted option with enterprise support. CVAT supports many annotation formats and has automatic labeling options to speed up your work. It's widely used by many teams worldwide, making it a reliable choice for your data annotation needs.
https://github.com/cvat-ai/cvat
CVAT is a powerful tool for annotating videos and images, especially useful for computer vision projects. It helps developers and companies annotate data quickly and efficiently. You can use CVAT online for free or subscribe for more features like unlimited data and integrations with other tools. It also offers a self-hosted option with enterprise support. CVAT supports many annotation formats and has automatic labeling options to speed up your work. It's widely used by many teams worldwide, making it a reliable choice for your data annotation needs.
https://github.com/cvat-ai/cvat
GitHub
GitHub - cvat-ai/cvat: Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams…
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale. - cvat-ai/cvat
#svelte #collaboration #downloader #javascript #music #reddit #social_media #soundcloud #svelte #tiktok #twitter #typescript #video #vimeo #vk #webapp #youtube #youtube_downloader
Cobalt is a simple and efficient media downloader without ads, trackers, or paywalls. You just paste the link and get the file quickly. It's easy to use and doesn't bother you with unnecessary things. Cobalt is free, publicly accessible, and does not support piracy. It also has a community Discord server and Twitter for support. Using Cobalt helps you save what you love easily and quickly.
https://github.com/imputnet/cobalt
Cobalt is a simple and efficient media downloader without ads, trackers, or paywalls. You just paste the link and get the file quickly. It's easy to use and doesn't bother you with unnecessary things. Cobalt is free, publicly accessible, and does not support piracy. It also has a community Discord server and Twitter for support. Using Cobalt helps you save what you love easily and quickly.
https://github.com/imputnet/cobalt
GitHub
GitHub - imputnet/cobalt: best way to save what you love
best way to save what you love. Contribute to imputnet/cobalt development by creating an account on GitHub.
❤1
#go #activitypub #broadcasting #chat #decentralized #federation #fediverse #golang #hacktoberfest #hls #live #livestream #owncast #rtmp #self_hosted #streaming_video #video
Owncast is a free, open-source tool that lets you stream your videos live and control everything yourself. You can use it with popular broadcasting software like OBS or Streamlabs. It gives you full ownership over your content, interface, and audience, which means you have more freedom and control. To get started, you can visit the quickstart guide or view a demo to see how it works. This way, you don't have to rely on big streaming services and can manage your streams independently.
https://github.com/owncast/owncast
Owncast is a free, open-source tool that lets you stream your videos live and control everything yourself. You can use it with popular broadcasting software like OBS or Streamlabs. It gives you full ownership over your content, interface, and audience, which means you have more freedom and control. To get started, you can visit the quickstart guide or view a demo to see how it works. This way, you don't have to rely on big streaming services and can manage your streams independently.
https://github.com/owncast/owncast
GitHub
GitHub - owncast/owncast: Take control over your live stream video by running it yourself. Streaming + chat out of the box.
Take control over your live stream video by running it yourself. Streaming + chat out of the box. - owncast/owncast
#javascript #freetube #privacy #subscriptions #video #videos #youtube
FreeTube is a free, open-source app that lets you watch YouTube videos without ads and helps keep your viewing private. It doesn't use Google's tracking cookies or JavaScript, so you can enjoy videos without being tracked by Google. Your data stays on your computer, not online. FreeTube works on Windows, Mac, and Linux, and you can subscribe to channels without needing an account. It also offers features like importing subscriptions and using external players. This makes it a good choice for people who want more privacy while watching YouTube videos.
https://github.com/FreeTubeApp/FreeTube
FreeTube is a free, open-source app that lets you watch YouTube videos without ads and helps keep your viewing private. It doesn't use Google's tracking cookies or JavaScript, so you can enjoy videos without being tracked by Google. Your data stays on your computer, not online. FreeTube works on Windows, Mac, and Linux, and you can subscribe to channels without needing an account. It also offers features like importing subscriptions and using external players. This makes it a good choice for people who want more privacy while watching YouTube videos.
https://github.com/FreeTubeApp/FreeTube
GitHub
GitHub - FreeTubeApp/FreeTube: An Open Source YouTube app for privacy
An Open Source YouTube app for privacy. Contribute to FreeTubeApp/FreeTube development by creating an account on GitHub.
👍1
#go #dubbing #localization #tts #video_transcription #video_translation
Krillin AI is a tool that helps translate and dub videos easily. It supports many languages and can automatically add subtitles, translate them, and even change the voice. This tool is useful for making videos ready for different platforms like YouTube or TikTok. It saves time by doing everything in just a few clicks, making it easy to share videos with people who speak different languages.
https://github.com/krillinai/KrillinAI
Krillin AI is a tool that helps translate and dub videos easily. It supports many languages and can automatically add subtitles, translate them, and even change the voice. This tool is useful for making videos ready for different platforms like YouTube or TikTok. It saves time by doing everything in just a few clicks, making it easy to share videos with people who speak different languages.
https://github.com/krillinai/KrillinAI
GitHub
GitHub - krillinai/KrillinAI: Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations…
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo...
❤1
#python #face_animation #image_animation #video_editing #video_generation
LivePortrait is a tool that uses AI to animate still photos, making them look like videos. It works by identifying key facial features and adding realistic movements. This technology helps create lifelike videos that can be used for personalized communication. The benefit to users is that they can easily create engaging animated portraits from static images, which can be fun and useful for various applications like social media or storytelling.
https://github.com/KwaiVGI/LivePortrait
LivePortrait is a tool that uses AI to animate still photos, making them look like videos. It works by identifying key facial features and adding realistic movements. This technology helps create lifelike videos that can be used for personalized communication. The benefit to users is that they can easily create engaging animated portraits from static images, which can be fun and useful for various applications like social media or storytelling.
https://github.com/KwaiVGI/LivePortrait
GitHub
GitHub - KlingTeam/LivePortrait: Bring portraits to life!
Bring portraits to life! Contribute to KlingTeam/LivePortrait development by creating an account on GitHub.
#typescript #alternative #converter #data_manipulation #developer_tools #devtools #frontend #good_first_issue #image_manipulation #image_processing #javascript #pdf_manipulation #productivity #react #self_hosted #swissarmyknife #tools #typescript #video_manipulation #webapp #website
OmniTools is a self-hosted web app that helps with many tasks like image and video editing, number crunching, and more. It offers tools for resizing images, converting videos, calculating dates, and generating prime numbers. You can run it on your own computer using Docker, which means your data stays local. This app is open-source and free, allowing you to contribute new features or tools easily. Using OmniTools simplifies many everyday tasks and keeps your data private.
https://github.com/iib0011/omni-tools
OmniTools is a self-hosted web app that helps with many tasks like image and video editing, number crunching, and more. It offers tools for resizing images, converting videos, calculating dates, and generating prime numbers. You can run it on your own computer using Docker, which means your data stays local. This app is open-source and free, allowing you to contribute new features or tools easily. Using OmniTools simplifies many everyday tasks and keeps your data private.
https://github.com/iib0011/omni-tools
GitHub
GitHub - iib0011/omni-tools: Self-hosted collection of powerful web-based tools for everyday tasks. No ads, no tracking, just fast…
Self-hosted collection of powerful web-based tools for everyday tasks. No ads, no tracking, just fast, accessible utilities right from your browser! - iib0011/omni-tools
👍1
#rust #fpv #gopro #gpu #gpu_computing #gyroscope #insta360 #rolling_shutter_undistortion #rust #sony_alpha_cameras #stabilization #video #video_processing
Gyroflow is a powerful video stabilization software that uses gyroscope data from cameras like GoPro, Sony, and Insta360 to make your videos smooth and steady. It corrects lens distortion, rolling shutter effects, and can even level the horizon for a professional look. You can preview changes in real-time, use GPU acceleration for fast processing, and apply stabilization directly in popular video editors with plugins. It supports many video formats and works on Windows, Mac, Linux, Android, and iOS. Using Gyroflow helps you create high-quality, cinematic videos without bulky equipment or complicated setups[1][3][5].
https://github.com/gyroflow/gyroflow
Gyroflow is a powerful video stabilization software that uses gyroscope data from cameras like GoPro, Sony, and Insta360 to make your videos smooth and steady. It corrects lens distortion, rolling shutter effects, and can even level the horizon for a professional look. You can preview changes in real-time, use GPU acceleration for fast processing, and apply stabilization directly in popular video editors with plugins. It supports many video formats and works on Windows, Mac, Linux, Android, and iOS. Using Gyroflow helps you create high-quality, cinematic videos without bulky equipment or complicated setups[1][3][5].
https://github.com/gyroflow/gyroflow
GitHub
GitHub - gyroflow/gyroflow: Video stabilization using gyroscope data
Video stabilization using gyroscope data. Contribute to gyroflow/gyroflow development by creating an account on GitHub.
❤1
#python #ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Memvid lets you store millions of text pieces inside a single MP4 video file using QR codes, making your data 50-100 times smaller than usual databases. You can search this video instantly in under 100 milliseconds without needing servers or internet after setup. It works offline, is easy to use with simple Python code, and supports PDFs and chat with your data. The upcoming version 2 will add features like continuous memory updates, shareable capsules, fast local caching, and better video compression, making your AI memory smarter, faster, and more flexible. This means you get a powerful, portable, and efficient way to manage and search huge knowledge bases quickly and easily.
https://github.com/Olow304/memvid
Memvid lets you store millions of text pieces inside a single MP4 video file using QR codes, making your data 50-100 times smaller than usual databases. You can search this video instantly in under 100 milliseconds without needing servers or internet after setup. It works offline, is easy to use with simple Python code, and supports PDFs and chat with your data. The upcoming version 2 will add features like continuous memory updates, shareable capsules, fast local caching, and better video compression, making your AI memory smarter, faster, and more flexible. This means you get a powerful, portable, and efficient way to manage and search huge knowledge bases quickly and easily.
https://github.com/Olow304/memvid
GitHub
GitHub - memvid/memvid: Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer.…
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory. - memvid/memvid
#javascript #appimage #compressor #downloader #electron #electron_app #ffmpeg #flatpak #javascript #linux #linux_app #macos #nodejs #snap #ubuntu #video #windows #youtube #youtube_dl #youtube_downloader #ytdownloader
You can use ytDownloader, a modern app that lets you download videos and audio from hundreds of sites like YouTube, Facebook, Instagram, TikTok, and Twitter. It works on Windows, macOS, and Linux, offers fast downloads, supports playlists, subtitles, and video compression with hardware acceleration, and has multiple themes. It’s free of ads and trackers, making it safe and easy to use. You can install it via various methods like Flatpak, Snap, or package managers on different systems. This helps you save videos for offline viewing, enjoy faster access without ads, and keep your favorite content anytime.
https://github.com/aandrew-me/ytDownloader
You can use ytDownloader, a modern app that lets you download videos and audio from hundreds of sites like YouTube, Facebook, Instagram, TikTok, and Twitter. It works on Windows, macOS, and Linux, offers fast downloads, supports playlists, subtitles, and video compression with hardware acceleration, and has multiple themes. It’s free of ads and trackers, making it safe and easy to use. You can install it via various methods like Flatpak, Snap, or package managers on different systems. This helps you save videos for offline viewing, enjoy faster access without ads, and keep your favorite content anytime.
https://github.com/aandrew-me/ytDownloader
GitHub
GitHub - aandrew-me/ytDownloader: Desktop App for downloading Videos and Audios from hundreds of sites
Desktop App for downloading Videos and Audios from hundreds of sites - aandrew-me/ytDownloader
🔥1
#python #audio_generation #diffusion #image_generation #inference #model_serving #multimodal #pytorch #transformer #video_generation
vLLM-Omni is a free, open-source tool that makes serving AI models for text, images, videos, and audio fast, easy, and cheap. It builds on vLLM for top speed using smart memory tricks, overlapping tasks, and flexible resource sharing across GPUs. You get 2x higher throughput, 35% less delay, and simple setup with Hugging Face models via OpenAI API—perfect for building quick multi-modal apps like chatbots or media generators without high costs.
https://github.com/vllm-project/vllm-omni
vLLM-Omni is a free, open-source tool that makes serving AI models for text, images, videos, and audio fast, easy, and cheap. It builds on vLLM for top speed using smart memory tricks, overlapping tasks, and flexible resource sharing across GPUs. You get 2x higher throughput, 35% less delay, and simple setup with Hugging Face models via OpenAI API—perfect for building quick multi-modal apps like chatbots or media generators without high costs.
https://github.com/vllm-project/vllm-omni
GitHub
GitHub - vllm-project/vllm-omni: A framework for efficient model inference with omni-modality models
A framework for efficient model inference with omni-modality models - vllm-project/vllm-omni
#python #auto_regressive_diffusion_model #diffusion_models #video_generation #wan_video
LightX2V is a fast, lightweight framework for generating videos from text or images, supporting models like HunyuanVideo-1.5 and Wan2.1/2.2 with up to 20x speedup via 4-step distillation, low VRAM use (8GB+), and features like offloading, quantization, and multi-GPU parallelism—outperforming rivals on H100/RTX 4090. You benefit by creating high-quality videos quickly on everyday hardware, saving time and costs for content creation, prototyping, or professional workflows, with easy Docker/ComfyUI setup and free online trials.
https://github.com/ModelTC/LightX2V
LightX2V is a fast, lightweight framework for generating videos from text or images, supporting models like HunyuanVideo-1.5 and Wan2.1/2.2 with up to 20x speedup via 4-step distillation, low VRAM use (8GB+), and features like offloading, quantization, and multi-GPU parallelism—outperforming rivals on H100/RTX 4090. You benefit by creating high-quality videos quickly on everyday hardware, saving time and costs for content creation, prototyping, or professional workflows, with easy Docker/ComfyUI setup and free online trials.
https://github.com/ModelTC/LightX2V
GitHub
GitHub - ModelTC/LightX2V: Light Image Video Generation Inference Framework
Light Image Video Generation Inference Framework. Contribute to ModelTC/LightX2V development by creating an account on GitHub.
#python #amd #anime #compression_artifact_reduction #deep_learning #directx_12 #gui_application #intel #manga #noise_reduction #nvidia #onnx #onnxruntime #opencv #python #python3 #pytorch #super_resolution #video #video_processing #windows
QualityScaler is a free Windows AI app that upscales, enhances, and denoises your images and videos with a simple drag-and-drop GUI. It supports formats like JPG, PNG, MP4, MKV; works offline on any DirectX12 GPU (4GB+ VRAM, 8GB RAM); and offers features like multi-GPU use, resize, interpolation, and stop/resume. Download from itch.io, Steam, or GitHub. Benefit: Quickly turn low-quality photos/videos into sharp HD masterpieces privately on your PC, saving time and money vs. online tools.
https://github.com/Djdefrag/QualityScaler
QualityScaler is a free Windows AI app that upscales, enhances, and denoises your images and videos with a simple drag-and-drop GUI. It supports formats like JPG, PNG, MP4, MKV; works offline on any DirectX12 GPU (4GB+ VRAM, 8GB RAM); and offers features like multi-GPU use, resize, interpolation, and stop/resume. Download from itch.io, Steam, or GitHub. Benefit: Quickly turn low-quality photos/videos into sharp HD masterpieces privately on your PC, saving time and money vs. online tools.
https://github.com/Djdefrag/QualityScaler
GitHub
GitHub - Djdefrag/QualityScaler: QualityScaler - image/video AI upscaler app
QualityScaler - image/video AI upscaler app. Contribute to Djdefrag/QualityScaler development by creating an account on GitHub.
#python #agentic_ai #agents #ai #ai_agents #realtime #stt #tts #video_agents #video_ai #vision_ai #voice_ai
Vision Agents is an open-source Python framework by Stream to build real-time AI agents that watch video, listen to audio, and respond instantly with low latency under 30ms. It integrates YOLO, Roboflow, OpenAI, Gemini, and 25+ tools for apps like golf coaching, security cameras detecting theft, or phone assistants. Install easily with `uv add vision-agents`, use free Stream credits, and deploy on any video network. You benefit by quickly creating smart video AI for gaming, safety, or coaching without vendor lock-in, saving time and costs on custom builds.
https://github.com/GetStream/Vision-Agents
Vision Agents is an open-source Python framework by Stream to build real-time AI agents that watch video, listen to audio, and respond instantly with low latency under 30ms. It integrates YOLO, Roboflow, OpenAI, Gemini, and 25+ tools for apps like golf coaching, security cameras detecting theft, or phone assistants. Install easily with `uv add vision-agents`, use free Stream credits, and deploy on any video network. You benefit by quickly creating smart video AI for gaming, safety, or coaching without vendor lock-in, saving time and costs on custom builds.
https://github.com/GetStream/Vision-Agents
GitHub
GitHub - GetStream/Vision-Agents: Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses…
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency. - GetStream/Vision-Agents
#java #4k #android #bandcamp #download_videos #newpipe #peertube #soundcloud #translation #video #watch #youtube_video
NewPipe is a free, open-source Android app for ad-free streaming and downloading videos/audio from YouTube, SoundCloud, PeerTube and more, with background play, pop-up mode, subscriptions without accounts, and no Google tracking for full privacy. The team is rewriting the code for a modern, stable version—download nightly builds to try new features early. This benefits you by saving data/battery, enabling offline/multitasking use, and protecting your data on any device.
https://github.com/TeamNewPipe/NewPipe
NewPipe is a free, open-source Android app for ad-free streaming and downloading videos/audio from YouTube, SoundCloud, PeerTube and more, with background play, pop-up mode, subscriptions without accounts, and no Google tracking for full privacy. The team is rewriting the code for a modern, stable version—download nightly builds to try new features early. This benefits you by saving data/battery, enabling offline/multitasking use, and protecting your data on any device.
https://github.com/TeamNewPipe/NewPipe
GitHub
GitHub - TeamNewPipe/NewPipe: A libre lightweight streaming front-end for Android.
A libre lightweight streaming front-end for Android. - TeamNewPipe/NewPipe
#python #audio_driven_talking_face #dance_generation #end_to_end_filming #long_video_generation #video_diffusion_transformers
Stable Video Infinity (SVI) lets you create videos of any length—from seconds to minutes or infinite—starting from one image and text prompts, with no quality loss, drift, or repetition. It uses error-recycling to fix mistakes automatically, supporting stories, cartoons like Tom & Jerry (up to 10+ minutes), talking heads with audio, dancing from skeletons, and multi-scene films. Everything is open-source with models, scripts, datasets, and ComfyUI workflows on Hugging Face and GitHub—you can download, run demos, or train your own easily. This saves you time and effort for pro filmmaking or fun animations without limits.
https://github.com/vita-epfl/Stable-Video-Infinity
Stable Video Infinity (SVI) lets you create videos of any length—from seconds to minutes or infinite—starting from one image and text prompts, with no quality loss, drift, or repetition. It uses error-recycling to fix mistakes automatically, supporting stories, cartoons like Tom & Jerry (up to 10+ minutes), talking heads with audio, dancing from skeletons, and multi-scene films. Everything is open-source with models, scripts, datasets, and ComfyUI workflows on Hugging Face and GitHub—you can download, run demos, or train your own easily. This saves you time and effort for pro filmmaking or fun animations without limits.
https://github.com/vita-epfl/Stable-Video-Infinity
GitHub
GitHub - vita-epfl/Stable-Video-Infinity: [ICLR 26] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
[ICLR 26] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling - vita-epfl/Stable-Video-Infinity