AI with Papers - Artificial Intelligence & Deep Learning

🦙 Depth as Neural Implicit 🦙

👉InfiniDepth represents depth as neural implicit fields, "infinite" (i.e.16K) resolution and geometrical details. Repo under Apache 2.0💙

👉Review https://t.ly/4we5t
👉Paper https://lnkd.in/dpiHQExj
👉Project https://lnkd.in/dy3JxKye
👉Repo https://lnkd.in/dAXbnK5z

1🔥12❤2👍1👏1

4.95K views13:23

AI with Papers - Artificial Intelligence & Deep Learning

🔥 Back from Holidays mood 🔥

🤣24❤4🔥2👍1

4.75K viewsedited 08:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌍Label Any Object in 3D 🌍

👉LabelAny3D: novel analysis-by-synthesis framework that reconstructs holistic 3D scenes from 2D to efficiently produce HQ 3D BBs annotations. Repo under CC-BY-4.0 license💙

👉Review https://t.ly/bO93j
👉Paper https://lnkd.in/dYb97zWG
👉Project https://lnkd.in/dJ9UKERb
👉Repo https://lnkd.in/d9SxtmiA

❤10🔥7👍1👏1

4.82K views10:00

AI with Papers - Artificial Intelligence & Deep Learning

🔥 New #AI Startups in 2026? 🔥

In 2026, which area would you focus on?
🤖Agents → workflows, copilots, etc.
🏭Vertical AI → Pharma, Automotive, Energy ...
🧠Infrastructure → MLOps, Security, Cost Control ...
🎨AI for Creators/Media → Video, avatars, contents ...

Please, help me understanding what's next with this poll on LinkedIn :)

https://www.linkedin.com/posts/visionarynet_ai-ai-deeplearning-activity-7415377341779996672-sQO1

LUV U \m/

#ai #ai #deeplearning #aiwithpapers #metaverse | Alessandro Ferrari

🔥🔥 New #AI Startups in 2026? 🔥🔥

👉 Looking ahead to 2026, the question is no longer “can we build it?” but “where does it actually create durable value?” in the AI field. So, if you were to launch an AI startup in 2026, which area would you focus on?

🤖Agents…

🔥5❤1👍1

4.99K views15:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Orient Anything V2 is out🔥

👉Orient Anything V2 is a foundation model for unified understanding of object 3D orientation and rotation from single or paired images. Repo under CC-BY-4.0💙

👉Review https://t.ly/Ht7Xd
👉Paper arxiv.org/pdf/2601.05573
👉Project orient-anythingv2.github.io/
👉Repo github.com/SpatialVision/Orient-Anything-V2

❤5🔥2👍1

4.46K views08:25

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🫛Active Object Reconstruction🫛

👉ObjSplat (Beijing) autonomously plans viewpoints and progressively reconstructs an unknown object into a Hi-Fi Gaussian model and water-tight mesh, enabling direct use in physics simulations. Tough paper and repo announced💙

👉Review https://t.ly/au6HE
👉Paper arxiv.org/pdf/2601.06997
👉Project li-yuetao.github.io/ObjSplat-page/
👉Repo https://github.com/Li-Yuetao/ObjSplat

❤8👍1

4.52K viewsedited 16:15

AI with Papers - Artificial Intelligence & Deep Learning

In 2026, who should we keep an eye on?

Vote: https://www.linkedin.com/posts/visionarynet_ai-deeplearning-aiwithpapers-activity-7416886610795077632-qQeP/

❤2🔥2🤯1

3.92K views17:01

AI with Papers - Artificial Intelligence & Deep Learning

👉Games Workshop (Warhammer) is banning the use of AI in creative and design processes to protect IP and human creativity. A decision that goes against the current hype of widespread AI adoption.

And what about your organization? I need your help👇

Vote: https://www.linkedin.com/posts/visionarynet_ai-activity-7417106327019196417-TpGL

❤3🤯1

4.18K views07:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💚Segment Anything Geometry💚

👉3AM (NYCU + #Nvidia) offers cross-view correspondence even under large viewpoint changes, cluttered scenes, and variations in capture conditions, enabling robust object tracking from both videos & casual multi-view images. Repo (coming) & Demo available💙

👉Review https://t.ly/olZwE
👉Paper https://arxiv.org/pdf/2601.08831
👉Project https://jayisaking.github.io/3AM-Page/
👉Repo https://github.com/jayisaking
👉Demo https://huggingface.co/spaces/nycu-cplab/3AM

🔥10❤4👍1

4.58K viewsedited 07:59

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎇 Multi-target SAM3 🎇

👉SAM3-DMS is a novel training-free decoupled strategy that utilizes fine-grained memory selection on individual objects. Robust identity preservation and tracking stability. Repo under SAM License💙

👉Review https://t.ly/jJOAr
👉Paper https://arxiv.org/pdf/2601.09699
👉Repo https://github.com/FudanCVL/SAM3-DMS

🔥5❤2👍1👏1

4.6K views08:14

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍿100M Video Action Dataset🍿

👉Action100M by META is a large-scale dataset w/ 1.2M instructional videos (14.6 years of duration), yielding O(100M) temporally localized segments with open-vocabulary action supervision and rich captions. Repo under FAIR NC Research License💙

👉Review https://t.ly/w5KXe
👉Paper arxiv.org/pdf/2601.10592
👉Repo github.com/facebookresearch/Action100M

🔥10👍2👏2❤1

4.74K viewsedited 15:45

AI with Papers - Artificial Intelligence & Deep Learning

0:46

This media is not supported in your browser

VIEW IN TELEGRAM

💜Interactive Humanoid Generation💜

👉FlowAct-R1 by ByteDance is a novel framework that enables lifelike, responsive, and high-fidelity humanoid video generation for seamless real-time interaction. No code but impressive results (see video with audio) 💙

👉Review https://t.ly/aQhol
👉Paper arxiv.org/pdf/2601.10103
👉Project grisoon.github.io/FlowAct-R1/

❤10🤯6🔥2👏1

4.64K views07:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💢3D Human Gen-Seg💢

👉CoMoVi takes an input image with a text description and generates 3D human motion & video sequence synchronously within a single diffusion denoising loop. Repo & Dataset releasing💙

👉Review https://t.ly/khSkm
👉Paper arxiv.org/pdf/2601.10632
👉Project igl-hkust.github.io/CoMoVi/
👉Repo github.com/IGL-HKUST/CoMoVi
👉Data huggingface.co/datasets/AfterJourney/CoMoVi-Dataset

🔥3❤1

4.27K views07:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👹SOTA Part-level Generator👹

👉A novel a text-to-motion model that learns to compose complex motions through hierarchical conditioning on part-, action- & sequence-level text, enabling fine-grained control over body parts & timing. Code, models & Dataset to be released💙

👉Review https://t.ly/leB_R
👉Paper arxiv.org/pdf/2601.10909
👉Project coral79.github.io/frankenmotion/
👉Repo github.com/Coral79/FrankenMotion-Code

❤3🔥2👏1

4.63K views12:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💚 #META 3D Casual Captures 💚

👉#META unveils ShapeR, a novel approach for conditional 3D object shape generation from casually captured sequences. Impressive results. Repo under CC BY-NC 4.0💙

👉Review https://t.ly/j08sJ
👉Paper arxiv.org/pdf/2601.11514
👉Project facebookresearch.github.io/ShapeR/
👉Repo github.com/facebookresearch/ShapeR

🔥7❤4👏1

4.59K views07:49

AI with Papers - Artificial Intelligence & Deep Learning

💊Foundation Medical SAM3 💊

👉Medical SAM3: foundation model for universal prompt-driven medical image segmentation, by fully fine-tuning SAM3 on large-scale, heterogeneous 2D/3D medical imaging datasets with paired segmentation masks-text prompts. Repo & Demo announced💙

👉Review https://t.ly/C6jcy
👉Paper https://arxiv.org/pdf/2601.10880
👉Project chongcongjiang.github.io/MedicalSAM3/#
👉Repo github.com/AIM-Research-Lab/Medical-SAM3

❤13🔥3👍2👏1

5.34K views12:54

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🦧Mask-Guided Matting🦧

👉VideoMaMa is novel a diffusion-based model that converts binary masks into continuous alpha mattes. Repo, Dataset & Demo💙

👉Review https://t.ly/l_0f8
👉Paper arxiv.org/pdf/2601.14255
👉Project cvlab-kaist.github.io/VideoMaMa
👉Repo github.com/cvlab-kaist/VideoMaMa
👉Demo huggingface.co/spaces/SammyLim/VideoMaMa

❤5🔥2👍1

5.2K viewsedited 11:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💜MoRo: Human Motion💜

👉Masked modeling for human motion Recovery under Occlusions. Given a monocular video captured from a static camera, MoRo (by ETHZ & META) robustly reconstructs accurate/physically plausible human motion, even under challenging occlusions. Repo released💙

👉Review https://t.ly/kK_je
👉Paper arxiv.org/pdf/2601.16079
👉Project mikeqzy.github.io/MoRo/
👉Repo github.com/mikeqzy/MoRo

❤6👏1

4.29K viewsedited 12:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 BBoxMaskPose v2 is fire 🔥

👉BBoxMaskPose v2 by ČVUT offers SOTA performance in detection, segmentation & 2D pose in crowded scenes. It enables 3D human reconstruction even in scenes with complex interactions. Code, Models & data available💙

👉Review https://t.ly/GkkDl
👉Paper arxiv.org/pdf/2601.15200
👉Project https://lnkd.in/dQ_3hxjC
👉Repo https://lnkd.in/dVqwD3jN

❤6👍3👏1

4.41K views12:52

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🦠Generalized-Scale Counting🦠

👉GeCo2 (Ljubljana) is a novel e2e SOTA few-shot method that explicitly addresses the object scale issues. Repo & Demo 💙

👉Review https://t.ly/2_7I8
👉Paper https://arxiv.org/pdf/2511.08048
👉Repo https://github.com/jerpelhan/GECO2
👉Demo huggingface.co/spaces/jerpelhan/GECO2-demo

👍11❤1🔥1

4.64K viewsedited 12:47

About

Blog

Apps

Platform