This media is not supported in your browser
VIEW IN TELEGRAM
👌HandX: Scaling Hands Motion👌
👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙
👉Review https://t.ly/1nGxw
👉Paper https://arxiv.org/pdf/2603.28766
👉Project https://handx-project.github.io/
👉Repo github.com/handx-project/HandX
👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙
👉Review https://t.ly/1nGxw
👉Paper https://arxiv.org/pdf/2603.28766
👉Project https://handx-project.github.io/
👉Repo github.com/handx-project/HandX
🔥9❤2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🌵SOTA Training-Free In-Context Segmentation🌵
👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙
👉Review https://t.ly/NVWHN
👉Paper arxiv.org/pdf/2603.28480
👉Project visinf.github.io/INSID3/
👉Repo github.com/visinf/INSID3
👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙
👉Review https://t.ly/NVWHN
👉Paper arxiv.org/pdf/2603.28480
👉Project visinf.github.io/INSID3/
👉Repo github.com/visinf/INSID3
❤16🔥2🤩2👍1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🪬Camera Raw Image Generation🪬
👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙
👉Review https://t.ly/_QVKP
👉Paper https://arxiv.org/pdf/2604.00093
👉Project https://dy112.github.io/rawgen-page/
👉Repo TBA
👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙
👉Review https://t.ly/_QVKP
👉Paper https://arxiv.org/pdf/2604.00093
👉Project https://dy112.github.io/rawgen-page/
👉Repo TBA
❤4🔥2👍1
If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i
🤣3❤1🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🍎Video Object Deletion🍎
👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙
👉Review https://t.ly/cMVny
👉Paper https://arxiv.org/pdf/2604.02296
👉Project https://void-model.github.io/
👉Repo https://github.com/Netflix/void-model
👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙
👉Review https://t.ly/cMVny
👉Paper https://arxiv.org/pdf/2604.02296
👉Project https://void-model.github.io/
👉Repo https://github.com/Netflix/void-model
❤4🤯3👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Vanast: VTON w/ Human Animation🔥
👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙
👉Review https://t.ly/c0t79
👉Paper arxiv.org/pdf/2604.04934
👉Project hyunsoocha.github.io/vanast/
👉Repo github.com/snuvclab/vanast
👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙
👉Review https://t.ly/c0t79
👉Paper arxiv.org/pdf/2604.04934
👉Project hyunsoocha.github.io/vanast/
👉Repo github.com/snuvclab/vanast
❤6👍2🔥1🤯1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥BoxerNet: SOTA 2D->3D BBs🔥
👉Boxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 International💙
👉Review https://t.ly/mlmV1
👉Paper https://arxiv.org/pdf/2604.05212
👉Project facebookresearch.github.io/boxer/
👉Repo github.com/facebookresearch/boxer
👉Boxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 International💙
👉Review https://t.ly/mlmV1
👉Paper https://arxiv.org/pdf/2604.05212
👉Project facebookresearch.github.io/boxer/
👉Repo github.com/facebookresearch/boxer
🤯9👍1🔥1
Media is too big
VIEW IN TELEGRAM
Here the preview, tomorrow the full clip from official source :)
❤5🔥1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🪞1.1M Metric VTON Dataset🪞
👉Google's Fit-Inclusive Try-on: large-scale VTO dataset comprising over 1.13M try-on image triplets accompanied by precise body and garment measurements. Repo & dataset announced💙
👉Review https://t.ly/cs-pt
👉Paper arxiv.org/pdf/2604.08526
👉Project johannakarras.github.io/FIT/
👉Repo TBA
👉Google's Fit-Inclusive Try-on: large-scale VTO dataset comprising over 1.13M try-on image triplets accompanied by precise body and garment measurements. Repo & dataset announced💙
👉Review https://t.ly/cs-pt
👉Paper arxiv.org/pdf/2604.08526
👉Project johannakarras.github.io/FIT/
👉Repo TBA
🔥6❤2👍1
🐞6D Object Pose w/ Deformation🐞
👉DeSOPE by Xidian & #MagicLeap is a novel large-scale dataset for 6DoF deformed objects: 665K pose annotations produced via a semiautomatic pipeline. Repo & Dataset announced💙
👉Review https://t.ly/M5VgX
👉Paper https://arxiv.org/pdf/2604.06720
👉Project https://desope-6d.github.io/
👉Repo TBA
👉DeSOPE by Xidian & #MagicLeap is a novel large-scale dataset for 6DoF deformed objects: 665K pose annotations produced via a semiautomatic pipeline. Repo & Dataset announced💙
👉Review https://t.ly/M5VgX
👉Paper https://arxiv.org/pdf/2604.06720
👉Project https://desope-6d.github.io/
👉Repo TBA
🔥6❤2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥SOTA 3D Detection in the wild🔥
👉WildDet3D is a novel unified geometry-aware architecture for 3D detection that natively accepts text, point, and box prompts and can incorporate auxiliary depth signals at inference time. New SOTA! Repo, models and iphone 💙
👉Review https://t.ly/8NxBN
👉Paper arxiv.org/pdf/2604.08626
👉Project allenai.github.io/WildDet3D/
👉Repo github.com/allenai/WildDet3D
👉WildDet3D is a novel unified geometry-aware architecture for 3D detection that natively accepts text, point, and box prompts and can incorporate auxiliary depth signals at inference time. New SOTA! Repo, models and iphone 💙
👉Review https://t.ly/8NxBN
👉Paper arxiv.org/pdf/2604.08626
👉Project allenai.github.io/WildDet3D/
👉Repo github.com/allenai/WildDet3D
❤3🔥3🤯1