AI with Papers - Artificial Intelligence & Deep Learning
17.3K subscribers
158 photos
276 videos
14 files
1.45K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🍑4K4D: Real-Time 4D at 4K🍑

πŸ‘‰THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!

😎Review https://t.ly/6ddQh
😎Paper arxiv.org/pdf/2310.11448.pdf
😎Project zju3dv.github.io/4k4d/
😎Code github.com/zju3dv/4K4D
πŸ”₯8πŸ‘5🀯5❀1😱1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›£οΈ Holistic Parking Detection (YOLO) πŸ›£οΈ

πŸ‘‰ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection

😎Review https://t.ly/2l4ZG
😎Paper arxiv.org/pdf/2310.11629.pdf
πŸ”₯8🀯6❀4🀩3πŸ‘1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈 Cutie: VOS with heavy occlusions🍈

πŸ‘‰Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors

😎Review https://t.ly/W3FR-
😎Paper arxiv.org/pdf/2310.12982.pdf
😎Project https://hkchengrex.com/Cutie
😎Code https://github.com/hkchengrex/Cutie
πŸ‘13🀣3❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧑 Rotoscoping Prince Of Persia (1985) 🧑

πŸ‘‰ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.

😎 More https://t.ly/xJife
❀17πŸ‘2πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ›PACE: new SOTA MotionπŸͺ›

πŸ‘‰#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.

😎Review https://t.ly/20you
😎Project https://nvlabs.github.io/PACE
😎Paper https://arxiv.org/pdf/2310.13768.pdf
🀣5❀4πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯€NanoSAM: SAM on low-cost boardsπŸ₯€

πŸ‘‰NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT

😎Review https://t.ly/UErq_
😎Tutorial https://github.com/NVIDIA-AI-IOT/nanosam
πŸ”₯11πŸ‘1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§‚ SOTA RGB-D Video Salient Object πŸ§‚

πŸ‘‰ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection

😎Review https://t.ly/DapLV
😎Code github.com/kerenfu/RDVS
😎Paper arxiv.org/pdf/2310.15482.pdf
πŸ”₯4πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
✌️ Relighted 3D Hands 🀞

πŸ‘‰#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands

😎Review https://t.ly/I1dQk
😎Paper arxiv.org/pdf/2310.17768.pdf
😎Project mks0601.github.io/ReInterHand
😎Data github.com/mks0601/ReInterHand
🀯8❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ„ Video Understanding with GPT-4V(ision) πŸ„

πŸ‘‰ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension

😎Review https://t.ly/RISMm
😎Paper arxiv.org/pdf/2310.19773.pdf
😎Project https://multimodal-vid.github.io
🀯22πŸ‘9πŸ”₯2πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘£ Foot via Synthetic Data πŸ‘£

πŸ‘‰ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot

😎Review https://t.ly/TVanP
😎Paper https://arxiv.org/pdf/2310.18279.pdf
😎Project https://ollieboyne.github.io/FOUND
😎Code https://github.com/OllieBoyne/FOUND
🀣8πŸ‘4❀2πŸ₯°2🀩2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš› OYSTER: unsupervised detection w/ LIDAR πŸš›

πŸ‘‰Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.

😎Review https://t.ly/EMi58
😎Project https://waabi.ai/oyster/
😎Paper arxiv.org/pdf/2311.02007.pdf
❀16πŸ‘3πŸ”₯2πŸ‘1
πŸ”₯GPT-4 Pass the Turing Test?πŸ”₯

πŸ‘‰No. I mean...not yet. Read this Paper from UC San DiegoπŸ‘‡

😎Review https://t.ly/o8HgM
😎Paper https://arxiv.org/pdf/2310.20216.pdf
❀4πŸ”₯3πŸ‘1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯»SF: Towards Virtual ClothπŸ₯»

πŸ‘‰SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds

😎Review https://t.ly/MwpAV
😎Project https://sewformer.github.io/
😎Paper https://arxiv.org/pdf/2311.04218.pdf
😎Code https://github.com/sail-sg/sewformer
πŸ‘4πŸ”₯2πŸ₯°2πŸ‘2🀯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›‹οΈ 3DiffTection: new SOTA 3D detection πŸ›‹οΈ

πŸ‘‰#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model

😎Review https://t.ly/PciXY
😎Paper https://arxiv.org/pdf/2311.04391.pdf
😎Code https://github.com/nv-tlabs/3DiffTection
😎Project research.nvidia.com/labs/toronto-ai/3difftection
πŸ”₯8❀6πŸ‘3😱3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ 30x Faster Neural Scenes πŸͺ

πŸ‘‰ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30Γ— faster rendering than previous SOTA w/ comparable or better realism

😎Review https://t.ly/ELJSE
😎Paper https://arxiv.org/pdf/2311.05607.pdf
😎Project https://waabi.ai/NeuRas/
πŸ”₯9❀1πŸ‘1🀯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Hu.ma.ne #AI Pin is out! πŸ”₯

πŸ‘‰Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector

😎 More https://t.ly/IvoN7
❀6πŸ”₯4πŸ’©2πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ«€ Segmentation of Human πŸ«€

πŸ‘‰TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.

😎Review https://t.ly/yHMm1
😎Code https://lnkd.in/dvgrbsCE
😎Paper https://lnkd.in/dkwHuuzU
πŸ”₯14πŸ‘7🀯6😱2❀1🀩1
πŸͺ Spacecraft Pose Estimation πŸͺ

πŸ‘‰SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab

😎Review https://t.ly/m8JPB
😎Paper https://lnkd.in/d_edvc3n
😎Project https://lnkd.in/dPp375aY
❀7🀯2πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Florence-2: unified Computer VisionπŸ”₯

πŸ‘‰#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!

πŸ‘‰Review https://t.ly/pOins
πŸ‘‰Paper arxiv.org/pdf/2311.06242.pdf
πŸ‘‰Project www.microsoft.com/en-us/research/project/projectflorence/
😱9❀5πŸ”₯3πŸ‘1πŸ‘1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’₯πŸš— CrashCar101: Generative Damaged CarsπŸ’₯πŸš—

πŸ‘‰ CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations

πŸ‘‰ Review https://t.ly/pITHm
πŸ‘‰ Paper https://lnkd.in/dzp6q3T5
πŸ‘‰ Project https://lnkd.in/daRXg73N
❀7πŸ‘1πŸ”₯1🀯1