This media is not supported in your browser
VIEW IN TELEGRAM
πMaterial-Aware Groupingπ
πMaterial Magic Wand (Adobe) is a tool for material-aware grouping of parts in untextured 3D meshes. Given one selected part, it automatically retrieves the other parts in the same shape by its material. Repo announcedπ
πReview https://t.ly/q00SU
πPaper https://arxiv.org/pdf/2603.17370
πProject umangi-jain.github.io/material-magic-wand/
πRepo TBA
πMaterial Magic Wand (Adobe) is a tool for material-aware grouping of parts in untextured 3D meshes. Given one selected part, it automatically retrieves the other parts in the same shape by its material. Repo announcedπ
πReview https://t.ly/q00SU
πPaper https://arxiv.org/pdf/2603.17370
πProject umangi-jain.github.io/material-magic-wand/
πRepo TBA
π₯4
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ͺOccAny: Universal 3D Occupancyπ¦ͺ
πOccAny by Valeo is a novel unified framework for generalized unconstrained urban 3D occupancy prediction. Repo under Apache 2.0π
πReview https://t.ly/FFiU0
πPaper https://arxiv.org/pdf/2603.23502
πProject https://valeoai.github.io/OccAny/
πRepo https://github.com/valeoai/OccAny
πOccAny by Valeo is a novel unified framework for generalized unconstrained urban 3D occupancy prediction. Repo under Apache 2.0π
πReview https://t.ly/FFiU0
πPaper https://arxiv.org/pdf/2603.23502
πProject https://valeoai.github.io/OccAny/
πRepo https://github.com/valeoai/OccAny
π₯6π2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
πPose-Appearance-Motion for HOIπ
πPAM is a novel PoseβAppearanceβMotion Engine for controllable HandβObject Interaction SOTA video generation. Repo/models availableπ
πReview https://t.ly/JU4MD
πPaper arxiv.org/pdf/2603.22193
πProject gasaiyu.github.io/PAM.github.io/
πRepo https://github.com/GasaiYU/PAM
πPAM is a novel PoseβAppearanceβMotion Engine for controllable HandβObject Interaction SOTA video generation. Repo/models availableπ
πReview https://t.ly/JU4MD
πPaper arxiv.org/pdf/2603.22193
πProject gasaiyu.github.io/PAM.github.io/
πRepo https://github.com/GasaiYU/PAM
β€7π2π₯2
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ GaussianGPT 3D GSCπ₯
πFrom TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announcedπ
πReview https://t.ly/bj-lL
πPaper arxiv.org/pdf/2603.26661
πProject nicolasvonluetzow.github.io/GaussianGPT/
πRepo TBA
πFrom TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announcedπ
πReview https://t.ly/bj-lL
πPaper arxiv.org/pdf/2603.26661
πProject nicolasvonluetzow.github.io/GaussianGPT/
πRepo TBA
π₯8β€2π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
πHandX: Scaling Hands Motionπ
π HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo availableπ
πReview https://t.ly/1nGxw
πPaper https://arxiv.org/pdf/2603.28766
πProject https://handx-project.github.io/
πRepo github.com/handx-project/HandX
π HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo availableπ
πReview https://t.ly/1nGxw
πPaper https://arxiv.org/pdf/2603.28766
πProject https://handx-project.github.io/
πRepo github.com/handx-project/HandX
π₯9β€2π1
This media is not supported in your browser
VIEW IN TELEGRAM
π΅SOTA Training-Free In-Context Segmentationπ΅
πINSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0π
πReview https://t.ly/NVWHN
πPaper arxiv.org/pdf/2603.28480
πProject visinf.github.io/INSID3/
πRepo github.com/visinf/INSID3
πINSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0π
πReview https://t.ly/NVWHN
πPaper arxiv.org/pdf/2603.28480
πProject visinf.github.io/INSID3/
πRepo github.com/visinf/INSID3
β€16π₯2π€©2π1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ¬Camera Raw Image Generationπͺ¬
πRawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announcedπ
πReview https://t.ly/_QVKP
πPaper https://arxiv.org/pdf/2604.00093
πProject https://dy112.github.io/rawgen-page/
πRepo TBA
πRawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announcedπ
πReview https://t.ly/_QVKP
πPaper https://arxiv.org/pdf/2604.00093
πProject https://dy112.github.io/rawgen-page/
πRepo TBA
β€3π₯2π1
If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i
π€£3β€1π₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πVideo Object Deletionπ
πVoid by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0π
πReview https://t.ly/cMVny
πPaper https://arxiv.org/pdf/2604.02296
πProject https://void-model.github.io/
πRepo https://github.com/Netflix/void-model
πVoid by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0π
πReview https://t.ly/cMVny
πPaper https://arxiv.org/pdf/2604.02296
πProject https://void-model.github.io/
πRepo https://github.com/Netflix/void-model
β€3π€―2π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Vanast: VTON w/ Human Animationπ₯
πSNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announcedπ
πReview https://t.ly/c0t79
πPaper arxiv.org/pdf/2604.04934
πProject hyunsoocha.github.io/vanast/
πRepo github.com/snuvclab/vanast
πSNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announcedπ
πReview https://t.ly/c0t79
πPaper arxiv.org/pdf/2604.04934
πProject hyunsoocha.github.io/vanast/
πRepo github.com/snuvclab/vanast
β€5π₯2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯BoxerNet: SOTA 2D->3D BBsπ₯
πBoxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 Internationalπ
πReview https://t.ly/mlmV1
πPaper https://arxiv.org/pdf/2604.05212
πProject facebookresearch.github.io/boxer/
πRepo github.com/facebookresearch/boxer
πBoxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 Internationalπ
πReview https://t.ly/mlmV1
πPaper https://arxiv.org/pdf/2604.05212
πProject facebookresearch.github.io/boxer/
πRepo github.com/facebookresearch/boxer
π€―7π₯1
Media is too big
VIEW IN TELEGRAM
Here the preview, tomorrow the full clip from official source :)
β€3