AI with Papers - Artificial Intelligence & Deep Learning

💜MoRo: Human Motion💜

👉Masked modeling for human motion Recovery under Occlusions. Given a monocular video captured from a static camera, MoRo (by ETHZ & META) robustly reconstructs accurate/physically plausible human motion, even under challenging occlusions. Repo released💙

👉Review https://t.ly/kK_je
👉Paper arxiv.org/pdf/2601.16079
👉Project mikeqzy.github.io/MoRo/
👉Repo github.com/mikeqzy/MoRo

❤6👏1

4.18K viewsedited 12:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 BBoxMaskPose v2 is fire 🔥

👉BBoxMaskPose v2 by ČVUT offers SOTA performance in detection, segmentation & 2D pose in crowded scenes. It enables 3D human reconstruction even in scenes with complex interactions. Code, Models & data available💙

👉Review https://t.ly/GkkDl
👉Paper arxiv.org/pdf/2601.15200
👉Project https://lnkd.in/dQ_3hxjC
👉Repo https://lnkd.in/dVqwD3jN

❤5👍3👏1

4.27K views12:52

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🦠Generalized-Scale Counting🦠

👉GeCo2 (Ljubljana) is a novel e2e SOTA few-shot method that explicitly addresses the object scale issues. Repo & Demo 💙

👉Review https://t.ly/2_7I8
👉Paper https://arxiv.org/pdf/2511.08048
👉Repo https://github.com/jerpelhan/GECO2
👉Demo huggingface.co/spaces/jerpelhan/GECO2-demo

👍11❤1🔥1

4.5K viewsedited 12:47

AI with Papers - Artificial Intelligence & Deep Learning

🔥🔥Super-Hard Poll folks🔥🔥

👉 This dilemma is driving me crazy. Vote: https://www.linkedin.com/posts/visionarynet_activity-7421974594917588992-YNAG

(and of course comment here)

❤5👍1🔥1

4.14K viewsedited 18:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌻MLLMs Fine Segmentation🌻

👉SimpleSeg: MLLMs with native pixel-level perception. Repo & Model available💙

👉Review https://t.ly/eVguh
👉Paper arxiv.org/pdf/2601.19228
👉Project simpleseg.github.io/
👉Repo github.com/songtianhui/SimpleSeg

🔥4👍3❤2👏1

4.34K viewsedited 07:39

AI with Papers - Artificial Intelligence & Deep Learning

🔥 DeepSeek-OCR 2 is out 🔥

👉DeepSeek-AI announced the new version of its powerful SOTA OCR. A new architectural approach with the potential to achieve genuine 2D reasoning. Codes & weights💙

👉Review https://t.ly/gX4bX
👉Paper https://arxiv.org/pdf/2601.20552
👉Repo github.com/deepseek-ai/DeepSeek-OCR-2

❤8🔥7👏1

4.21K views07:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📊 SOTA Style Transfer 📊

👉TeleAI unveils TeleStyle, a lightweight yet effective model for image/video stylization. Built upon Qwen-Image-Edit, TeleStyle leverages the base model’s robust capabilities in content preservation & style customization. Code & Model released💙

👉Review https://t.ly/viVR0
👉Paper arxiv.org/pdf/2601.20175
👉Project tele-ai.github.io/TeleStyle/
👉Repo github.com/Tele-AI/TeleStyle

❤12👍2🔥1🤯1🤣1

4.45K views13:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍑 Metric Anything is out 🍑

👉Metric Anything (Li Auto inc.) is a simple and scalable pretraining framework that learns metric depth from noisy, diverse 3D sources without manually engineered prompts, camera-specific modeling, or task-specific architectures. Impressive. Code announced 💙

👉Review https://t.ly/54Ccr
👉Paper arxiv.org/pdf/2601.22054
👉Project metric-anything.github.io/metric-anything-io/
👉Repo github.com/metric-anything/metric-anything

🔥11❤5👏1

5.02K views08:02

AI with Papers - Artificial Intelligence & Deep Learning

Still in love with this channel?

Anonymous Poll

❤8

353 voters4.5K views21:37

AI with Papers - Artificial Intelligence & Deep Learning

The hottest website on Earth right now: https://www.moltbook.com

What do you think about it?

moltbook

moltbook - the front page of the agent internet

A social network built exclusively for AI agents. Where AI agents share, discuss, and upvote. 🦞🤖

❤2🤯1😢1

5.03K views22:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌈Segment Any Events by Language🌈

👉SEAL (by NUS) is the first Semantic-aware Segment Any Events framework that addresses Open-Vocabulary Event Instance Segmentation. Code announced💙

👉Review https://t.ly/1ZMF0
👉Paper https://arxiv.org/pdf/2601.23159
👉Project https://0nandon.github.io/SEAL/
👉Repo https://github.com/0nandon/SEAL

🔥7❤4👏1🤯1

5.17K views08:06

AI with Papers - Artificial Intelligence & Deep Learning

👉RAM prices skyrocketing

👉Me acting like a rich kid.

Let's talk: https://www.linkedin.com/posts/visionarynet_ai-ram-ddr5-activity-7424127924020072448-NbaO

🤣25❤4🔥1

4.97K viewsedited 16:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐮CoWTracker: Track-Warping🐮

👉CoWTracker (VGG + META) is a novel dense point tracker that eschews cost volumes in favor of warping. Code/Models under FAIR NC💙

👉Review https://t.ly/6bAn9
👉Paper https://arxiv.org/pdf/2602.04877
👉Project https://cowtracker.github.io/
👉Repo https://github.com/facebookresearch/cowtracker

🔥4❤2👍1

5K viewsedited 07:12

AI with Papers - Artificial Intelligence & Deep Learning

0:02

This media is not supported in your browser

VIEW IN TELEGRAM

🌈TrajVG Trajectory-Geometry🌈

👉TrajVG is a novel reconstruction framework that makes cross-frame 3D correspondence an explicit prediction by estimating camera-coordinate 3D trajectories. Code announced💙

👉Review https://t.ly/yVi01
👉Paper arxiv.org/pdf/2602.04439
👉Project xingy038.github.io/TrajVG/
👉Repo github.com/xingy038/TrajVG

❤7🔥1👏1

5.33K viewsedited 09:26

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪙MOMENTUM #NeurIPS 2025 🪙

👉MOMENTUM by Google (H/T Huguens Jean, Ph.D.) is a production multimodal agent architecture built on the Google ADK. It orchestrates 22 specialized tools (Gemini for reasoning, Imagen 4.0 for image generation, and Veo 3.1 for synthesis). Code announced💙

👉Review https://t.ly/06h7Q
👉Paper https://momentum-project-page-232993426383.us-central1.run.app/momentum_paper.pdf
👉Project https://momentum-project-page-232993426383.us-central1.run.app/
👉Repo TBA

👍3🔥2❤1

4.12K views13:46

AI with Papers - Artificial Intelligence & Deep Learning

😶‍🌫️ SOTA Full-Head Synthesis 😶‍🌫️

👉HyPlaneHead, the new SOTA in full-head image synthesis, delivering HQ results with significantly fewer artifacts compared to existing 3D-aware models. Repo announced💙

👉Review https://t.ly/WYfP3
👉Paper arxiv.org/pdf/2509.16748
👉Project https://lhyfst.github.io/hyplanehead/
👉Repo github.com/lhyfst/HyPlaneHead

❤3🔥3👍2👏1😢1

4.22K viewsedited 13:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍟 AnyTouch 2 is out 🍟

👉AnyTouch 2 is a general tactile representation learning framework for diverse optical tactile sensors that unifies object-level understanding with fine-grained, force-aware dynamic perception. Repo, Model & Data💙

👉Review https://t.ly/fP4dP
👉Paper https://arxiv.org/pdf/2602.09617
👉Project gewu-lab.github.io/AnyTouch2/
👉Repo github.com/GeWu-Lab/AnyTouch2

❤6🔥1

4.09K views09:36

AI with Papers - Artificial Intelligence & Deep Learning

Vote here please 💙

https://www.linkedin.com/posts/visionarynet_py4ai-2026-coming-soon-activity-7427290532034265088-y69e

❤2🔥1

3.77K views10:02

AI with Papers - Artificial Intelligence & Deep Learning

🍌 AGENT BANANA (SOTA) 🍌

👉Agent Banana is the novel SOTA agentic system for HD, native-resolution image editing through reasoning-based NL interaction, where each edit is context-aware, logically dependent, and locally precise. Code announced💙

👉Review https://t.ly/EXaCH
👉Paper https://arxiv.org/pdf/2602.09084
👉Project https://agent-banana.github.io/
👉Repo https://github.com/taco-group/agent-banana

❤12👏1

4.21K views13:14

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛠️ IndustryShapes 6D Pose 🛠️

👉IndustryShapes by NTUA is a new RGB-D dataset of industrial tools, designed for both instance-level and novel object 6D pose estimation. Dataset available💙

👉Review https://t.ly/KKcuH
👉Paper https://arxiv.org/pdf/2602.05555
👉Project https://pose-lab.github.io/IndustryShapes/
👉Dataset https://huggingface.co/datasets/POSE-Lab/IndustryShapes

❤8🔥2👏1

4.36K viewsedited 07:58

About

Blog

Apps

Platform