AI with Papers - Artificial Intelligence & Deep Learning

🛡️3D Prompted Vision-LLM🛡️

👉#Nvidia unveils SR-3D, a novel aware vision-language model that connects single-view 2D images and multi-view 3D data through a shared visual token space. Flexible region prompting, allowing users to annotate regions with bounding boxes, segmentation masks on any frame, or directly in 3D, without the need for exhaustive multi-frame labeling. Code & Dataset announced💙

👉Review https://t.ly/5Y2c5
👉Paper https://arxiv.org/pdf/2509.13317
👉Project https://www.anjiecheng.me/sr3d
👉Repo TBA

❤6🔥5👍1👏1

4.16K viewsedited 08:41

A few “leaks” for you from the #Nvidia presentation I’m right now in Milan. Impressive stuff.

Ps: sorry for the shitty quality of the pics ♥️

❤19🔥4👍1👏1🤯1🤩1

2.9K viewsedited 08:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🤖Real-time Interactive Video🤖

👉LONGLIVE by #Nvidia is a frame-level autoregressive framework for real-time & interactive long video generation. LONGLIVE accepts sequential user prompts and generates corresponding videos in real time. Repo under non-commercial license💙

👉Review https://t.ly/jJkdY
👉Paper arxiv.org/pdf/2509.22622
👉Project nvlabs.github.io/LongLive/
👉Repo github.com/NVlabs/LongLive
🤗huggingface.co/Efficient-Large-Model/LongLive-1.3B

🔥8❤1

4.29K viewsedited 07:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🟩 Foundational Humanoid 🟩

👉#NVIDIA unveils SONIC a novel foundational model for high-precision teleoperation & interactive control capabilities (running, jumping, crawling) with natural human-like movements. Code announced💙

👉Review https://t.ly/_3wnt
👉Paper https://lnkd.in/dctfShu8
👉Project https://lnkd.in/d_inmA2p

🤯9❤4🔥2

2.13K viewsedited 07:41

About

Blog

Apps

Platform