This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Animate Anyone: new SOTA! 🍡
👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀
👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀
👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
🤯22👍8🔥4⚡1❤1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔎 Generative Powers of Ten 🔍
👉A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell 🤯
👉Review https://t.ly/2DG44
👉Paper https://lnkd.in/eDcSpU59
👉Project https://lnkd.in/e6NKu8n9
👉A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell 🤯
👉Review https://t.ly/2DG44
👉Paper https://lnkd.in/eDcSpU59
👉Project https://lnkd.in/e6NKu8n9
🤯21❤4🔥3👏2😱1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!
👍 FREE TO FORWARD TO OTHER TELEGRAM CHANNELS
🔥 NO COPY OF THE POSTS
🔥 NO COMMERCIAL USAGE
🔥 NO UNRESPECTFUL USAGE
⚠️ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION ⚠️
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!
👍 FREE TO FORWARD TO OTHER TELEGRAM CHANNELS
🔥 NO COPY OF THE POSTS
🔥 NO COMMERCIAL USAGE
🔥 NO UNRESPECTFUL USAGE
⚠️ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION ⚠️
❤19👍10👏3🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩰 Magic Animating Human 🩰
👉MagicAnimate: the new SOTA in human animation. Code available: let's dance!
👉Review https://t.ly/Oq7Za
👉Paper https://lnkd.in/dSUbGgCs
👉Project https://lnkd.in/dkVFf-SV
👉Code https://lnkd.in/dj2dbzdg
👉Demo https://lnkd.in/dHEKPE9q
👉MagicAnimate: the new SOTA in human animation. Code available: let's dance!
👉Review https://t.ly/Oq7Za
👉Paper https://lnkd.in/dSUbGgCs
👉Project https://lnkd.in/dkVFf-SV
👉Code https://lnkd.in/dj2dbzdg
👉Demo https://lnkd.in/dHEKPE9q
🤯6❤2👍1🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 EfficientSAM: 20x faster Segment Anything 🔥
👉Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!
👉Review https://t.ly/966QS
👉Paper https://lnkd.in/duijp_Rh
👉Project https://lnkd.in/dW-p2CuH
👉Code https://lnkd.in/dAbZaB2t
👉Demo https://lnkd.in/d-tjKiUd
👉Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!
👉Review https://t.ly/966QS
👉Paper https://lnkd.in/duijp_Rh
👉Project https://lnkd.in/dW-p2CuH
👉Code https://lnkd.in/dAbZaB2t
👉Demo https://lnkd.in/d-tjKiUd
🔥15❤4👍4🤯2
This media is not supported in your browser
VIEW IN TELEGRAM
🫶3D Hands with Transformers🫶
👉 HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.
👉Review https://t.ly/YtAW8
👉Paper https://arxiv.org/pdf/2312.05251.pdf
👉Project https://geopavlakos.github.io/hamer
👉Demo huggingface.co/spaces/geopavlakos/HaMeR
👉Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
👉 HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.
👉Review https://t.ly/YtAW8
👉Paper https://arxiv.org/pdf/2312.05251.pdf
👉Project https://geopavlakos.github.io/hamer
👉Demo huggingface.co/spaces/geopavlakos/HaMeR
👉Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
👍10❤1👏1🤯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🪩 DreaMoving: Human Dancer 🪩
👉Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.
👉Review https://t.ly/BD_Yf
👉Paper https://lnkd.in/gepP6Rjw
👉Project https://lnkd.in/gwm72cfS
👉Repo (empty) https://lnkd.in/gsc2Qt-F
👉Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.
👉Review https://t.ly/BD_Yf
👉Paper https://lnkd.in/gepP6Rjw
👉Project https://lnkd.in/gwm72cfS
👉Repo (empty) https://lnkd.in/gsc2Qt-F
👍7💩6❤2🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
📲 EdgeSAM: Mobile 40x SAM 📲
👉A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available 😉
👉Review https://t.ly/m_vLH
👉Paper https://lnkd.in/gHZVZN2x
👉Project https://lnkd.in/gK8qEK8p
👉Repo https://lnkd.in/gj6YAGNv
👉Hugging Face https://lnkd.in/gUUHJvxz
👉A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available 😉
👉Review https://t.ly/m_vLH
👉Paper https://lnkd.in/gHZVZN2x
👉Project https://lnkd.in/gK8qEK8p
👉Repo https://lnkd.in/gj6YAGNv
👉Hugging Face https://lnkd.in/gUUHJvxz
🔥20⚡2❤2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🪼PatchFusion: SOTA Mono-Depth🪼
👉PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able 🔥
👉Review https://t.ly/hv3yT
👉Paper https://lnkd.in/d9dXP7iP
👉Project https://lnkd.in/dQcvVJSx
👉Repo https://lnkd.in/dW2GdVR5
👉Demo https://lnkd.in/dFW-gAiY
👉PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able 🔥
👉Review https://t.ly/hv3yT
👉Paper https://lnkd.in/d9dXP7iP
👉Project https://lnkd.in/dQcvVJSx
👉Repo https://lnkd.in/dW2GdVR5
👉Demo https://lnkd.in/dFW-gAiY
🔥10❤5👏1🤯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
💃Outfit Anyone: Ultra-HQ VTO💃
👉Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
👉Review https://t.ly/o6UR9
👉Demo https://lnkd.in/dpQYdXhc
👉Repo (empty) https://lnkd.in/dBsNST6r
👉Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
👉Review https://t.ly/o6UR9
👉Demo https://lnkd.in/dpQYdXhc
👉Repo (empty) https://lnkd.in/dBsNST6r
🤯10👍4❤3🔥2
🔥 #AIwithPapers: we are 8k+ 🔥
👉 After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you 🧡
😈 Hey Telegram Premium Subscribers, what about boosting us? Click: https://xn--r1a.website/AI_DeepLearning?boost
😈 Invite -> https://xn--r1a.website/AI_DeepLearning
👉 After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you 🧡
😈 Hey Telegram Premium Subscribers, what about boosting us? Click: https://xn--r1a.website/AI_DeepLearning?boost
😈 Invite -> https://xn--r1a.website/AI_DeepLearning
❤16🤣7🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊 Depth Conditioning 🧊
👉LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
👉Review https://t.ly/9y72m
👉Paper https://arxiv.org/pdf/2312.03079.pdf
👉Project https://shariqfarooq123.github.io/loose-control/
👉Repo https://github.com/shariqfarooq123/LooseControl
👉LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
👉Review https://t.ly/9y72m
👉Paper https://arxiv.org/pdf/2312.03079.pdf
👉Project https://shariqfarooq123.github.io/loose-control/
👉Repo https://github.com/shariqfarooq123/LooseControl
🔥14❤6🤯4👍1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🖲️ Amodal Tracking Any Object 🖲️
👉Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking 🔥
👉Review https://t.ly/Rc6Ku
👉Paper https://lnkd.in/d39rFYT4
👉Project https://lnkd.in/d7bkEcni
👉(empty) Repo https://lnkd.in/dTsNKdfz
👉Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking 🔥
👉Review https://t.ly/Rc6Ku
👉Paper https://lnkd.in/d39rFYT4
👉Project https://lnkd.in/d7bkEcni
👉(empty) Repo https://lnkd.in/dTsNKdfz
❤16🤯8🔥3👍2👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🚿 Event-Cam (1000 fps) Hands 🚿
👉Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
👉Review https://t.ly/YpQpX
👉Paper arxiv.org/pdf/2312.14157.pdf
👉Project 4dqv.mpi-inf.mpg.de/Ev2Hands
👉Repo github.com/Chris10M/Ev2Hands
👉Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
👉Review https://t.ly/YpQpX
👉Paper arxiv.org/pdf/2312.14157.pdf
👉Project 4dqv.mpi-inf.mpg.de/Ev2Hands
👉Repo github.com/Chris10M/Ev2Hands
🔥3❤2👍2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🎄UniSDF: Unifying Neural Representations🎄
👉UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
👉Review https://t.ly/2QEul
👉Paper https://arxiv.org/pdf/2312.13285.pdf
👉Project https://fangjinhuawang.github.io/UniSDF/
👉Repo: No code :(
👉UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
👉Review https://t.ly/2QEul
👉Paper https://arxiv.org/pdf/2312.13285.pdf
👉Project https://fangjinhuawang.github.io/UniSDF/
👉Repo: No code :(
🔥7👍2❤1🥰1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🪮HAAR: Text-Driven Generative Hairstyles🪮
👉 HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
👉Review https://t.ly/L38iD
👉Project https://haar.is.tue.mpg.de/
👉Paper https://arxiv.org/pdf/2312.11666.pdf
👉Repo coming
👉 HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
👉Review https://t.ly/L38iD
👉Project https://haar.is.tue.mpg.de/
👉Paper https://arxiv.org/pdf/2312.11666.pdf
👉Repo coming
🤯4🍾3👍2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🪲UniRef++: Segment Every Reference🪲
👉 UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
👉Review https://t.ly/OxtOx
👉Paper https://lnkd.in/eTrmDTK3
👉Repo https://lnkd.in/etfTm4Wq
👉 UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
👉Review https://t.ly/OxtOx
👉Paper https://lnkd.in/eTrmDTK3
👉Repo https://lnkd.in/etfTm4Wq
👍11❤3🤯3⚡1
This media is not supported in your browser
VIEW IN TELEGRAM
🈚 Seeing Through Occlusions 🈚
👉Novel NSF to see through occlusions, reflection suppression & shadow removal.
👉Review https://t.ly/5jcIG
👉Project https://light.princeton.edu/publication/nsf
👉Paper https://arxiv.org/pdf/2312.14235.pdf
👉Repo https://github.com/princeton-computational-imaging/NSF
👉Novel NSF to see through occlusions, reflection suppression & shadow removal.
👉Review https://t.ly/5jcIG
👉Project https://light.princeton.edu/publication/nsf
👉Paper https://arxiv.org/pdf/2312.14235.pdf
👉Repo https://github.com/princeton-computational-imaging/NSF
❤11🤯7🔥3🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
👻 Avatar Behind Occlusions 👻
👉Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
👉Review https://t.ly/8q__B
👉Paper https://arxiv.org/pdf/2401.00431.pdf
👉Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
👉Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
👉Review https://t.ly/8q__B
👉Paper https://arxiv.org/pdf/2401.00431.pdf
👉Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
🔥11❤3👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🕍 En3D: Generative 3D Humans 🕍
👉#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
👉Review https://t.ly/nGmDK
👉Project menyifang.github.io/projects/En3D/index.html
👉Paper https://arxiv.org/pdf/2401.01173.pdf
👉Repo (soon?) https://github.com/menyifang/En3D
👉#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
👉Review https://t.ly/nGmDK
👉Project menyifang.github.io/projects/En3D/index.html
👉Paper https://arxiv.org/pdf/2401.01173.pdf
👉Repo (soon?) https://github.com/menyifang/En3D
🤯5❤3🔥1