AI with Papers - Artificial Intelligence & Deep Learning

🐍Pose-Appearance-Motion for HOI🐍

👉PAM is a novel Pose–Appearance–Motion Engine for controllable Hand–Object Interaction SOTA video generation. Repo/models available💙

👉Review https://t.ly/JU4MD
👉Paper arxiv.org/pdf/2603.22193
👉Project gasaiyu.github.io/PAM.github.io/
👉Repo https://github.com/GasaiYU/PAM

❤7👍2🔥2

4.04K viewsedited 14:03

Please open Telegram to view this post

VIEW IN TELEGRAM

09:45

AI with Papers - Artificial Intelligence & Deep Learning

0:04

This media is not supported in your browser

VIEW IN TELEGRAM

💥 GaussianGPT 3D GSC💥

👉From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announced💙

👉Review https://t.ly/bj-lL
👉Paper arxiv.org/pdf/2603.26661
👉Project nicolasvonluetzow.github.io/GaussianGPT/
👉Repo TBA

🔥8❤2👍1👏1

2.41K viewsedited 07:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👌HandX: Scaling Hands Motion👌

👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙

👉Review https://t.ly/1nGxw
👉Paper https://arxiv.org/pdf/2603.28766
👉Project https://handx-project.github.io/
👉Repo github.com/handx-project/HandX

🔥9❤2👏1

2.4K views11:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌵SOTA Training-Free In-Context Segmentation🌵

👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙

👉Review https://t.ly/NVWHN
👉Paper arxiv.org/pdf/2603.28480
👉Project visinf.github.io/INSID3/
👉Repo github.com/visinf/INSID3

❤16🔥2🤩2👍1🍾1

2.44K viewsedited 07:24

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪬Camera Raw Image Generation🪬

👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙

👉Review https://t.ly/_QVKP
👉Paper https://arxiv.org/pdf/2604.00093
👉Project https://dy112.github.io/rawgen-page/
👉Repo TBA

❤3🔥2👍1

2.47K views07:54

AI with Papers - Artificial Intelligence & Deep Learning

If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i

🤣3❤1🔥1

2.38K views14:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍎Video Object Deletion🍎

👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙

👉Review https://t.ly/cMVny
👉Paper https://arxiv.org/pdf/2604.02296
👉Project https://void-model.github.io/
👉Repo https://github.com/Netflix/void-model

❤3🤯2👍1👏1

2.62K views06:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Vanast: VTON w/ Human Animation🔥

👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙

👉Review https://t.ly/c0t79
👉Paper arxiv.org/pdf/2604.04934
👉Project hyunsoocha.github.io/vanast/
👉Repo github.com/snuvclab/vanast

❤5🔥1🤯1

1.21K views06:31

About

Blog

Apps

Platform