AI with Papers - Artificial Intelligence & Deep Learning

🌈 New SOTA Video Depth 🌈

👉DVD is the new Video Depth Estimation SOTA with full training suite available under Apache2.0💙

👉Review https://t.ly/gpCkG
👉Paper https://arxiv.org/pdf/2603.12250
👉Project https://dvd-project.github.io/
👉Repo github.com/EnVision-Research/DVD

❤7🔥3👍2👏1

4.49K viewsedited 12:51

This media is not supported in your browser

VIEW IN TELEGRAM

🤖Physically-Plausible Human🤖

👉PhysMoDPO is a novel direct preference optimization framework for humanoid motion generation. Repo under MIT💙

👉Review https://t.ly/clf8w
👉Paper https://arxiv.org/pdf/2603.13228
👉Project https://mael-zys.github.io/PhysMoDPO/
👉Repo https://github.com/Mael-zys/PhysMoDPO

1❤4🔥2

4.25K views13:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍧10,000× faster SAM-3D🍧

👉Fast SAM 3D Body achieves up to 10.9× speedup, over 10,000× faster MHR-to-SMPL conversion -> real-time humanoid control from RGB. Repo available💙

👉Review https://t.ly/uHx84
👉Paper https://arxiv.org/pdf/2603.15603
👉Project yangtiming.github.io/Fast-SAM-3D-Body-Page/
👉Repo https://github.com/yangtiming/Fast-SAM-3D-Body

🔥9❤2👏2

4.67K views10:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍓Material-Aware Grouping🍓

👉Material Magic Wand (Adobe) is a tool for material-aware grouping of parts in untextured 3D meshes. Given one selected part, it automatically retrieves the other parts in the same shape by its material. Repo announced💙

👉Review https://t.ly/q00SU
👉Paper https://arxiv.org/pdf/2603.17370
👉Project umangi-jain.github.io/material-magic-wand/
👉Repo TBA

🔥4

5.28K views07:51

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🦪OccAny: Universal 3D Occupancy🦪

👉OccAny by Valeo is a novel unified framework for generalized unconstrained urban 3D occupancy prediction. Repo under Apache 2.0💙

👉Review https://t.ly/FFiU0
👉Paper https://arxiv.org/pdf/2603.23502
👉Project https://valeoai.github.io/OccAny/
👉Repo https://github.com/valeoai/OccAny

🔥6👍2❤1

4.47K viewsedited 08:05

AI with Papers - Artificial Intelligence & Deep Learning

0:02

This media is not supported in your browser

VIEW IN TELEGRAM

🐍Pose-Appearance-Motion for HOI🐍

👉PAM is a novel Pose–Appearance–Motion Engine for controllable Hand–Object Interaction SOTA video generation. Repo/models available💙

👉Review https://t.ly/JU4MD
👉Paper arxiv.org/pdf/2603.22193
👉Project gasaiyu.github.io/PAM.github.io/
👉Repo https://github.com/GasaiYU/PAM

❤7👍2🔥2

4.84K viewsedited 14:03

AI with Papers - Artificial Intelligence & Deep Learning

Please open Telegram to view this post

VIEW IN TELEGRAM

09:45

AI with Papers - Artificial Intelligence & Deep Learning

0:04

This media is not supported in your browser

VIEW IN TELEGRAM

💥 GaussianGPT 3D GSC💥

👉From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announced💙

👉Review https://t.ly/bj-lL
👉Paper arxiv.org/pdf/2603.26661
👉Project nicolasvonluetzow.github.io/GaussianGPT/
👉Repo TBA

🔥8❤2👍1👏1

3.11K viewsedited 07:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👌HandX: Scaling Hands Motion👌

👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙

👉Review https://t.ly/1nGxw
👉Paper https://arxiv.org/pdf/2603.28766
👉Project https://handx-project.github.io/
👉Repo github.com/handx-project/HandX

🔥9❤2👏1

2.89K views11:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌵SOTA Training-Free In-Context Segmentation🌵

👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙

👉Review https://t.ly/NVWHN
👉Paper arxiv.org/pdf/2603.28480
👉Project visinf.github.io/INSID3/
👉Repo github.com/visinf/INSID3

❤16🔥2🤩2👍1🍾1

3K viewsedited 07:24

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪬Camera Raw Image Generation🪬

👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙

👉Review https://t.ly/_QVKP
👉Paper https://arxiv.org/pdf/2604.00093
👉Project https://dy112.github.io/rawgen-page/
👉Repo TBA

❤4🔥2👍1

3.18K views07:54

AI with Papers - Artificial Intelligence & Deep Learning

If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i

🤣3❤1🔥1

3.23K views14:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍎Video Object Deletion🍎

👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙

👉Review https://t.ly/cMVny
👉Paper https://arxiv.org/pdf/2604.02296
👉Project https://void-model.github.io/
👉Repo https://github.com/Netflix/void-model

❤4🤯3👍1👏1

3.64K views06:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Vanast: VTON w/ Human Animation🔥

👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙

👉Review https://t.ly/c0t79
👉Paper arxiv.org/pdf/2604.04934
👉Project hyunsoocha.github.io/vanast/
👉Repo github.com/snuvclab/vanast

❤6👍2🔥1🤯1🍾1

2.42K views06:31

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥BoxerNet: SOTA 2D->3D BBs🔥

👉Boxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 International💙

👉Review https://t.ly/mlmV1
👉Paper https://arxiv.org/pdf/2604.05212
👉Project facebookresearch.github.io/boxer/
👉Repo github.com/facebookresearch/boxer

🤯9👍1🔥1

2.34K viewsedited 06:53

AI with Papers - Artificial Intelligence & Deep Learning

Hinton our guest in Pavia (remotely) 💚😈

Would you see a clip about the interview?

👍12❤6🔥2😍1

2.33K viewsedited 20:15

AI with Papers - Artificial Intelligence & Deep Learning

3:05

Media is too big

VIEW IN TELEGRAM

Here the preview, tomorrow the full clip from official source :)

❤5🔥1🍾1

2.48K views21:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪞1.1M Metric VTON Dataset🪞

👉Google's Fit-Inclusive Try-on: large-scale VTO dataset comprising over 1.13M try-on image triplets accompanied by precise body and garment measurements. Repo & dataset announced💙

👉Review https://t.ly/cs-pt
👉Paper arxiv.org/pdf/2604.08526
👉Project johannakarras.github.io/FIT/
👉Repo TBA

🔥6❤2👍1

1.96K views06:34

AI with Papers - Artificial Intelligence & Deep Learning

🐞6D Object Pose w/ Deformation🐞

👉DeSOPE by Xidian & #MagicLeap is a novel large-scale dataset for 6DoF deformed objects: 665K pose annotations produced via a semiautomatic pipeline. Repo & Dataset announced💙

👉Review https://t.ly/M5VgX
👉Paper https://arxiv.org/pdf/2604.06720
👉Project https://desope-6d.github.io/
👉Repo TBA

🔥6❤2👏1

1.45K views06:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥SOTA 3D Detection in the wild🔥

👉WildDet3D is a novel unified geometry-aware architecture for 3D detection that natively accepts text, point, and box prompts and can incorporate auxiliary depth signals at inference time. New SOTA! Repo, models and iphone 💙

👉Review https://t.ly/8NxBN
👉Paper arxiv.org/pdf/2604.08626
👉Project allenai.github.io/WildDet3D/
👉Repo github.com/allenai/WildDet3D

❤4🔥4🤯1

976 viewsedited 06:03

About

Blog

Apps

Platform