AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

17.5K subscribers

156 photos

274 videos

14 files

1.43K links

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

17.5K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦜Geometry-Aware 4D Head🦜

👉 GeoDiff4D is a novel framework that reconstructs animatable 4D head avatars from a single portrait image through geometry-aware diffusion. Code announced💙

👉Review https://t.ly/J9L-t
👉Paper https://lnkd.in/ddpv-78g
👉Project https://lnkd.in/d-vhukyj
👉Repo https://lnkd.in/dzd6mnFv

❤5👏3👍1🔥1🤯1🍾1

3.51K views15:06

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍓Fully Offline Mobile-VTON🍓

👉A novel, hq, privacy-preserving framework that enables fully offline virtual try-on on commodity mobile devices using only a single user image and a garment image. Repo announced, to be released💙

👉Review https://t.ly/dsrIn
👉Paper arxiv.org/pdf/2603.00947
👉Project zhenchenwan.github.io/Mobile-VTON/
👉Repo https://github.com/tmllab/2026_CVPR_Mobile-VTON

❤11🤯3👏2🔥1

3.7K views12:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪿All Point Clouds-One Encoder🪿

👉Utonia is a step toward one-from-all and one-for-all point cloud encoder. It pretrains a single encoder on diverse point cloud data and reuses it as a reliable backbone for downstream tasks. Code under Apache 2.0💙

👉Review https://t.ly/yqSyZ
👉Paper https://arxiv.org/pdf/2603.03283
👉Project pointcept.github.io/Utonia/
👉Repo https://github.com/Pointcept/Utonia

❤7🔥2👍1👏1

3.57K viewsedited 08:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐪DuoMo: Dual Motion Diffusion🐪

👉DuoMo by META is a novel generative method that recovers human motion in world-space coordinates from unconstrained videos with noisy or incomplete observations. Code announced💙

👉Review https://t.ly/dnA3K
👉Paper arxiv.org/pdf/2603.03265
👉Project yufu-wang.github.io/duomo/
👉Repo TBA

❤7👍2🤯2👏1

3.7K viewsedited 13:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍙Any Resolution, Any Geometry🍙

👉Ultra Resolution Geometry Transformer (URGT) for arbitrary resolutions (e.g. 4K, 6K, 8K) depth–normal estimation. New SOTA. Repo under MIT💙

👉Review https://t.ly/HXg1n
👉Paper arxiv.org/pdf/2603.03026
👉Project dreamaker-mrc.github.io/Any-Resolution-Any-Geometry/
👉Repo github.com/Dreamaker-MrC/Any-Resolution-Any-Geometry

🔥8❤6👍1👏1

4.08K views06:55

AI with Papers - Artificial Intelligence & Deep Learning

Could be useful for you seeing a few (verified) job posting about AI in this channel?

Anonymous Poll

💚YES, why not?!

❌ NO, only damn AI & Papers

❤5

347 voters3.75K views14:09

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍧Monocular 3D Clothed Human🍧

👉MultiGO++ is a novel framework for monocular 3D clothed human reconstruction via geometry-texture collaboration. New SOTA but no code announced🥲

👉Review https://t.ly/YKY44
👉Paper arxiv.org/pdf/2603.04993
👉Project 3dagentworld.github.io/multigo++

❤4👍1👏1

4.03K views07:07

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎪SOTA Arbitrary Tracking🎪

👉TAPFormer is the novel SOTA transformer-based framework that performs asynchronous temporal-consistent fusion of frames and events for robust and high-freq point tracking. Repo & Dataset under MIT💙

👉Review https://t.ly/-q4wm
👉Paper https://arxiv.org/pdf/2603.04989
👉Project http://tapformer.github.io/
👉Repo https://github.com/ljx1002/TAPFormer

❤5👍3🔥3👏2🍾1

4.6K viewsedited 08:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📊Real-Time Scene Graph📊

👉REACT++ by Umea University is the new state-of-the-art model for real-time SGG: 20% faster with a gain of 10% in relation prediction accuracy on average. Code under MIT💙

👉Review https://t.ly/c12VX
👉Paper https://arxiv.org/pdf/2603.06386
👉Repo https://github.com/Maelic/SGG-Benchmark

🔥6❤3👏3👍1

4.23K views07:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Holistic 3D Spatial Intelligence🔥

👉Holi-Spatial is the first fully automated pipeline capable of converting raw video streams into holistic 3D spatial annotations without human intervention. Code/Data announced💙

👉Review https://t.ly/PDpr9
👉Paper https://lnkd.in/dTbMuZCm
👉Project https://lnkd.in/d66CYB4q
👉Repo https://lnkd.in/dAGzShXj

❤8🔥7👍2👏1

3.85K views07:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍓Surface Light Tokenizer🍓

👉Apple unveils LITO a novel latent flow matching model enables HQ image-to-3D. Latent representation that encodes a surface light field into a compact set of latent vectors. Impressive results but no code🥲

👉Review https://t.ly/xcWNe
👉Paper https://lnkd.in/dYHwY4YX
👉Project https://lnkd.in/dtJT8bXy

❤8👍4🔥2👏2🤯1🍾1

3.94K views07:46

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☄️ OmniStream Backbone ☄️

👉Novel unified streaming visual backbone that effectively perceives, reconstructs, and acts from diverse visual inputs. Repo/Models announced💙

👉Review https://t.ly/_zZMO
👉Paper arxiv.org/pdf/2603.12265
👉Project go2heart.github.io/omnistream/
👉Repo github.com/Go2Heart/OmniStream

❤6👏2🤯2💩1

4.01K viewsedited 07:40

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌈 New SOTA Video Depth 🌈

👉DVD is the new Video Depth Estimation SOTA with full training suite available under Apache2.0💙

👉Review https://t.ly/gpCkG
👉Paper https://arxiv.org/pdf/2603.12250
👉Project https://dvd-project.github.io/
👉Repo github.com/EnVision-Research/DVD

❤7🔥3👍2👏1

4.24K viewsedited 12:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤖Physically-Plausible Human🤖

👉PhysMoDPO is a novel direct preference optimization framework for humanoid motion generation. Repo under MIT💙

👉Review https://t.ly/clf8w
👉Paper https://arxiv.org/pdf/2603.13228
👉Project https://mael-zys.github.io/PhysMoDPO/
👉Repo https://github.com/Mael-zys/PhysMoDPO

1❤4🔥2

3.61K views13:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍧10,000× faster SAM-3D🍧

👉Fast SAM 3D Body achieves up to 10.9× speedup, over 10,000× faster MHR-to-SMPL conversion -> real-time humanoid control from RGB. Repo available💙

👉Review https://t.ly/uHx84
👉Paper https://arxiv.org/pdf/2603.15603
👉Project yangtiming.github.io/Fast-SAM-3D-Body-Page/
👉Repo https://github.com/yangtiming/Fast-SAM-3D-Body

🔥9❤2👏2

4.08K views10:02

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍓Material-Aware Grouping🍓

👉Material Magic Wand (Adobe) is a tool for material-aware grouping of parts in untextured 3D meshes. Given one selected part, it automatically retrieves the other parts in the same shape by its material. Repo announced💙

👉Review https://t.ly/q00SU
👉Paper https://arxiv.org/pdf/2603.17370
👉Project umangi-jain.github.io/material-magic-wand/
👉Repo TBA

🔥4

4.63K views07:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦪OccAny: Universal 3D Occupancy🦪

👉OccAny by Valeo is a novel unified framework for generalized unconstrained urban 3D occupancy prediction. Repo under Apache 2.0💙

👉Review https://t.ly/FFiU0
👉Paper https://arxiv.org/pdf/2603.23502
👉Project https://valeoai.github.io/OccAny/
👉Repo https://github.com/valeoai/OccAny

🔥6👍2❤1

3.87K viewsedited 08:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐍Pose-Appearance-Motion for HOI🐍

👉PAM is a novel Pose–Appearance–Motion Engine for controllable Hand–Object Interaction SOTA video generation. Repo/models available💙

👉Review https://t.ly/JU4MD
👉Paper arxiv.org/pdf/2603.22193
👉Project gasaiyu.github.io/PAM.github.io/
👉Repo https://github.com/GasaiYU/PAM

❤7👍2🔥2

3.88K viewsedited 14:03

AI with Papers - Artificial Intelligence & Deep Learning

Please open Telegram to view this post

VIEW IN TELEGRAM

09:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💥 GaussianGPT 3D GSC💥

👉From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announced💙

👉Review https://t.ly/bj-lL
👉Paper arxiv.org/pdf/2603.26661
👉Project nicolasvonluetzow.github.io/GaussianGPT/
👉Repo TBA

🔥8❤2👍1👏1

2.28K viewsedited 07:03