AI with Papers - Artificial Intelligence & Deep Learning
17.5K subscribers
156 photos
274 videos
14 files
1.43K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🐍Pose-Appearance-Motion for HOI🐍

πŸ‘‰PAM is a novel Pose–Appearance–Motion Engine for controllable Hand–Object Interaction SOTA video generation. Repo/models availableπŸ’™

πŸ‘‰Review https://t.ly/JU4MD
πŸ‘‰Paper arxiv.org/pdf/2603.22193
πŸ‘‰Project gasaiyu.github.io/PAM.github.io/
πŸ‘‰Repo https://github.com/GasaiYU/PAM
❀7πŸ‘2πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’₯ GaussianGPT 3D GSCπŸ’₯

πŸ‘‰From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announcedπŸ’™

πŸ‘‰Review https://t.ly/bj-lL
πŸ‘‰Paper arxiv.org/pdf/2603.26661
πŸ‘‰Project nicolasvonluetzow.github.io/GaussianGPT/
πŸ‘‰Repo TBA
πŸ”₯8❀2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ŒHandX: Scaling Hands MotionπŸ‘Œ

πŸ‘‰ HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo availableπŸ’™

πŸ‘‰Review https://t.ly/1nGxw
πŸ‘‰Paper https://arxiv.org/pdf/2603.28766
πŸ‘‰Project https://handx-project.github.io/
πŸ‘‰Repo github.com/handx-project/HandX
πŸ”₯9❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌡SOTA Training-Free In-Context Segmentation🌡

πŸ‘‰INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/NVWHN
πŸ‘‰Paper arxiv.org/pdf/2603.28480
πŸ‘‰Project visinf.github.io/INSID3/
πŸ‘‰Repo github.com/visinf/INSID3
❀16πŸ”₯2🀩2πŸ‘1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ¬Camera Raw Image GenerationπŸͺ¬

πŸ‘‰RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announcedπŸ’™

πŸ‘‰Review https://t.ly/_QVKP
πŸ‘‰Paper https://arxiv.org/pdf/2604.00093
πŸ‘‰Project https://dy112.github.io/rawgen-page/
πŸ‘‰Repo TBA
❀3πŸ”₯2πŸ‘1
If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i
🀣3❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍎Video Object Deletion🍎

πŸ‘‰Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/cMVny
πŸ‘‰Paper https://arxiv.org/pdf/2604.02296
πŸ‘‰Project https://void-model.github.io/
πŸ‘‰Repo https://github.com/Netflix/void-model
❀3🀯2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Vanast: VTON w/ Human AnimationπŸ”₯

πŸ‘‰SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announcedπŸ’™

πŸ‘‰Review https://t.ly/c0t79
πŸ‘‰Paper arxiv.org/pdf/2604.04934
πŸ‘‰Project hyunsoocha.github.io/vanast/
πŸ‘‰Repo github.com/snuvclab/vanast
❀5πŸ”₯1🀯1