This media is not supported in your browser
VIEW IN TELEGRAM
π‘4K4D: Real-Time 4D at 4Kπ‘
πTHE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
πReview https://t.ly/6ddQh
πPaper arxiv.org/pdf/2310.11448.pdf
πProject zju3dv.github.io/4k4d/
πCode github.com/zju3dv/4K4D
πTHE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
πReview https://t.ly/6ddQh
πPaper arxiv.org/pdf/2310.11448.pdf
πProject zju3dv.github.io/4k4d/
πCode github.com/zju3dv/4K4D
π₯8π5π€―5β€1π±1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π£οΈ Holistic Parking Detection (YOLO) π£οΈ
π One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
πReview https://t.ly/2l4ZG
πPaper arxiv.org/pdf/2310.11629.pdf
π One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
πReview https://t.ly/2l4ZG
πPaper arxiv.org/pdf/2310.11629.pdf
π₯8π€―6β€4π€©3π1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
π Cutie: VOS with heavy occlusionsπ
πCutie: novel VOS for challenging scenarios with heavy occlusions & distractors
πReview https://t.ly/W3FR-
πPaper arxiv.org/pdf/2310.12982.pdf
πProject https://hkchengrex.com/Cutie
πCode https://github.com/hkchengrex/Cutie
πCutie: novel VOS for challenging scenarios with heavy occlusions & distractors
πReview https://t.ly/W3FR-
πPaper arxiv.org/pdf/2310.12982.pdf
πProject https://hkchengrex.com/Cutie
πCode https://github.com/hkchengrex/Cutie
π13π€£3β€1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§‘ Rotoscoping Prince Of Persia (1985) π§‘
π A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
π More https://t.ly/xJife
π A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
π More https://t.ly/xJife
β€17π2π2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺPACE: new SOTA Motionπͺ
π#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
πReview https://t.ly/20you
πProject https://nvlabs.github.io/PACE
πPaper https://arxiv.org/pdf/2310.13768.pdf
π#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
πReview https://t.ly/20you
πProject https://nvlabs.github.io/PACE
πPaper https://arxiv.org/pdf/2310.13768.pdf
π€£5β€4π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯€NanoSAM: SAM on low-cost boardsπ₯€
πNanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
πReview https://t.ly/UErq_
πTutorial https://github.com/NVIDIA-AI-IOT/nanosam
πNanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
πReview https://t.ly/UErq_
πTutorial https://github.com/NVIDIA-AI-IOT/nanosam
π₯11π1π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§ SOTA RGB-D Video Salient Object π§
π DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
πReview https://t.ly/DapLV
πCode github.com/kerenfu/RDVS
πPaper arxiv.org/pdf/2310.15482.pdf
π DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
πReview https://t.ly/DapLV
πCode github.com/kerenfu/RDVS
πPaper arxiv.org/pdf/2310.15482.pdf
π₯4π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
βοΈ Relighted 3D Hands π€
π#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
πReview https://t.ly/I1dQk
πPaper arxiv.org/pdf/2310.17768.pdf
πProject mks0601.github.io/ReInterHand
πData github.com/mks0601/ReInterHand
π#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
πReview https://t.ly/I1dQk
πPaper arxiv.org/pdf/2310.17768.pdf
πProject mks0601.github.io/ReInterHand
πData github.com/mks0601/ReInterHand
π€―8β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π Video Understanding with GPT-4V(ision) π
π #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
πReview https://t.ly/RISMm
πPaper arxiv.org/pdf/2310.19773.pdf
πProject https://multimodal-vid.github.io
π #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
πReview https://t.ly/RISMm
πPaper arxiv.org/pdf/2310.19773.pdf
πProject https://multimodal-vid.github.io
π€―22π9π₯2π1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π£ Foot via Synthetic Data π£
π 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
πReview https://t.ly/TVanP
πPaper https://arxiv.org/pdf/2310.18279.pdf
πProject https://ollieboyne.github.io/FOUND
πCode https://github.com/OllieBoyne/FOUND
π 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
πReview https://t.ly/TVanP
πPaper https://arxiv.org/pdf/2310.18279.pdf
πProject https://ollieboyne.github.io/FOUND
πCode https://github.com/OllieBoyne/FOUND
π€£8π4β€2π₯°2π€©2
This media is not supported in your browser
VIEW IN TELEGRAM
π OYSTER: unsupervised detection w/ LIDAR π
πWaabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
πReview https://t.ly/EMi58
πProject https://waabi.ai/oyster/
πPaper arxiv.org/pdf/2311.02007.pdf
πWaabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
πReview https://t.ly/EMi58
πProject https://waabi.ai/oyster/
πPaper arxiv.org/pdf/2311.02007.pdf
β€16π3π₯2π1
π₯GPT-4 Pass the Turing Test?π₯
πNo. I mean...not yet. Read this Paper from UC San Diegoπ
πReview https://t.ly/o8HgM
πPaper https://arxiv.org/pdf/2310.20216.pdf
πNo. I mean...not yet. Read this Paper from UC San Diegoπ
πReview https://t.ly/o8HgM
πPaper https://arxiv.org/pdf/2310.20216.pdf
β€4π₯3π1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯»SF: Towards Virtual Clothπ₯»
πSEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
πReview https://t.ly/MwpAV
πProject https://sewformer.github.io/
πPaper https://arxiv.org/pdf/2311.04218.pdf
πCode https://github.com/sail-sg/sewformer
πSEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
πReview https://t.ly/MwpAV
πProject https://sewformer.github.io/
πPaper https://arxiv.org/pdf/2311.04218.pdf
πCode https://github.com/sail-sg/sewformer
π4π₯2π₯°2π2π€―1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
ποΈ 3DiffTection: new SOTA 3D detection ποΈ
π#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
πReview https://t.ly/PciXY
πPaper https://arxiv.org/pdf/2311.04391.pdf
πCode https://github.com/nv-tlabs/3DiffTection
πProject research.nvidia.com/labs/toronto-ai/3difftection
π#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
πReview https://t.ly/PciXY
πPaper https://arxiv.org/pdf/2311.04391.pdf
πCode https://github.com/nv-tlabs/3DiffTection
πProject research.nvidia.com/labs/toronto-ai/3difftection
π₯8β€6π3π±3π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ 30x Faster Neural Scenes πͺ
π NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30Γ faster rendering than previous SOTA w/ comparable or better realism
πReview https://t.ly/ELJSE
πPaper https://arxiv.org/pdf/2311.05607.pdf
πProject https://waabi.ai/NeuRas/
π NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30Γ faster rendering than previous SOTA w/ comparable or better realism
πReview https://t.ly/ELJSE
πPaper https://arxiv.org/pdf/2311.05607.pdf
πProject https://waabi.ai/NeuRas/
π₯9β€1π1π€―1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Hu.ma.ne #AI Pin is out! π₯
πHu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
π More https://t.ly/IvoN7
πHu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
π More https://t.ly/IvoN7
β€6π₯4π©2π1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π« Segmentation of Human π«
πTotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
πReview https://t.ly/yHMm1
πCode https://lnkd.in/dvgrbsCE
πPaper https://lnkd.in/dkwHuuzU
πTotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
πReview https://t.ly/yHMm1
πCode https://lnkd.in/dvgrbsCE
πPaper https://lnkd.in/dkwHuuzU
π₯14π7π€―6π±2β€1π€©1
πͺ Spacecraft Pose Estimation πͺ
πSnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
πReview https://t.ly/m8JPB
πPaper https://lnkd.in/d_edvc3n
πProject https://lnkd.in/dPp375aY
πSnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
πReview https://t.ly/m8JPB
πPaper https://lnkd.in/d_edvc3n
πProject https://lnkd.in/dPp375aY
β€7π€―2π1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Florence-2: unified Computer Visionπ₯
π#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
πReview https://t.ly/pOins
πPaper arxiv.org/pdf/2311.06242.pdf
πProject www.microsoft.com/en-us/research/project/projectflorence/
π#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
πReview https://t.ly/pOins
πPaper arxiv.org/pdf/2311.06242.pdf
πProject www.microsoft.com/en-us/research/project/projectflorence/
π±9β€5π₯3π1π1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯π CrashCar101: Generative Damaged Carsπ₯π
π CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations
π Review https://t.ly/pITHm
π Paper https://lnkd.in/dzp6q3T5
π Project https://lnkd.in/daRXg73N
π CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations
π Review https://t.ly/pITHm
π Paper https://lnkd.in/dzp6q3T5
π Project https://lnkd.in/daRXg73N
β€7π1π₯1π€―1