AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

17.5K subscribers

157 photos

275 videos

14 files

1.43K links

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

17.5K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧤 Two-Hand tracking via GCN 🧤

👉The first-ever GCN for two interacting hands in single RGB image

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Reconstruction by GCN mesh regression
✅PIFA: pyramid attention for local occlusion
✅CHA: cross hand attention for interaction
✅SOTA + generalization in-the-wild scenario
✅Source code available under GNU 🤯

More: https://bit.ly/3KH5FWO

👏10👍4🤯1

2.05K views13:57

AI with Papers - Artificial Intelligence & Deep Learning

AI with Papers - Artificial Intelligence & Deep Learning pinned a video

22:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🕹️Video K-Net, SOTA in Segmentation🕹️

👉Simple, strong, and unified framework for fully end-to-end video panoptic segmentation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Learnable kernels from K-Net
✅K-Net learns to segment & track
✅Appearance / cross-T kernel interaction
✅New SOTA without bells and whistles 🤷‍♂️

More: https://bit.ly/3uEEZQR

👍6🔥1🤯1

1.97K views07:38

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐭DeepLabCut: tracking animals in the wild🐭

👉A toolbox for markerless pose estimation of animals performing various tasks

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Multi-animal pose estimation
✅Datasets for multi-animal pose
✅Key-points, limbs, animal identity
✅Optimal key-points without input

More: https://bit.ly/37L1mLE

🔥6🤔4👏2🤯2❤1👍1😱1

2.11K views13:16

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍡Neural Articulated Human Body🍡

👉Novel neural implicit representation for articulated body

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅COmpositional Articulated People
✅Large variety of shapes & poses
✅Novel encoder-decoder architecture

More: https://bit.ly/3xvn7dl

👍4🥰2👏1

2.05K viewsedited 08:42

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦚 2K Resolution Generative #AI 🦚

👉Novel continuous-scale training with variable output resolutions

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Mixed-resolution data
✅Arbitrary scales during training
✅Generations beyond 1024×1024
✅Variant of FID metric for scales
✅Source code under MIT license

More: https://bit.ly/3uNfVY6

🤯11👍2🔥2😱1🤩1

2.1K views09:52

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐍DS Unsupervised Video Decomposition🐍

👉Novel method to extract persistent elements of a scene

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Scene element as Deformable Sprite (DS)
✅Deformable Sprites by video auto-encoder
✅Canonical texture image for appearance
✅Non-rigid geom. transformation

More: https://bit.ly/37WV9w1

👍4🤯3🔥1🥰1👏1😱1

2K viewsedited 08:13

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥓 L-SVPE for Deep Deblurring 🥓

👉L-SVPE to deblur scenes while recovering high-freq details

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Learned Spatially Varying Pixel Exposures
✅Next-gen focal-plane sensor + DL
✅Deep conv decoder for motion deblurring
✅Superior results over non-optimized exp.

More: https://bit.ly/3uRYQMT

🤩7👍2🤔2🎉1

2K viewsedited 06:48

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧧Hyper-Fast Instance Segmentation🧧

👉Novel Temporally Efficient Vision Transformer (TeViT) for VIS

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Video instance segmentation transformer
✅Contextual-info at frame/instance level
✅Nearly convolution-free framework 🤷‍♂️
✅The new SOTA for VIS, ~70 FPS!
✅Code & models under MIT license

More: https://bit.ly/3rCMXIn

🔥10👍3👏1🤯1

2.01K views12:29

AI with Papers - Artificial Intelligence & Deep Learning

📗Unified Scene Text/Layout Detection📗

👉World's first hierarchical scene text dataset + novel detection method

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unified detection & geometric layout
✅Hierarchical annotations in natural scenes
✅Word, line, & paragraph level annotations
✅Source under CC Attribution Share Alike 4.0

More: https://bit.ly/3jRpezV

🔥3🤯2❤1👍1

1.99K views19:17

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🙌 #Oculus' new Hand Tracking 🙌

👉Hands are able to move as naturally and intuitively in the #metaverse as do in real life

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hands2.0 powered by CV & ML
✅Tracking hand-over-hand interactions
✅Crossing hands, clapping, high-fives
✅Accurate thumbs-up gesture

More: https://bit.ly/3JXPvY2

🤯6❤4👍2👏1

2.01K viewsedited 06:26

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎗️New SOTA in #3D human avatar🎗️

👉PHORHUM: photorealistic 3D human from mono-RGB

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Pixel-aligned method for 3D geometry
✅Unshaded surface color + illumination
✅Patch-based rendering losses for visible
✅Plausible color estimation for non-visible

More: https://bit.ly/3MkvBrA

🤯4👍2🥰2❤1

2.1K viewsedited 08:07

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📟 What's in your hands (#3D) ? 📟

👉Reconstructing hand-held objects (from single RGB) without knowing their 3D templates🤷‍♂️

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hand is highly predictive of object shape
✅Conditional-based on the articulation
✅Visual feats. / articulation-aware coords.
✅Code and models available!

More: https://bit.ly/3vuYn2a

👍9🤯2🥰1

2.07K views12:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔋YODO: You Only Demonstrate Once🔋

👉A novel category-level manipulation learned in sim from single demonstration video🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅One-shot IL, model-free 6D pose tracking
✅Demonstration BY single 3rd-person-view
✅manipulation including hi-precision tasks
✅Category-level Behavior Cloning
✅Attention for dynamic coords selection
✅Generalizability to novel unseen obj/env

More: https://bit.ly/3v0V4R4

🤯8❤3👍2😱2🤩2👏1

2.11K viewsedited 06:48

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👗 Dress Code for Virtual Try-On 👗

👉UniMORE (+ YOOX) unveils a novel dataset/approach for virtual try-on.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Hi-Res paired front-view / full-body
✅Pixel-level Semantic-Aware Discriminator
✅9 SOTA VTON approaches / 3 baselines
✅New SOTA considering res. & garments

More: https://bit.ly/3xKXSUw

❤3👍3🔥1🤯1

2.17K viewsedited 14:17

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍃Deep Equilibrium for Optical Flow🍃

👉DEQ: converge faster, less memory, often more accurate

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel formulation of optical flow method
✅Compatible with prior modeling/data-related
✅Sparse fixed-point correction for stability
✅Code/models under GNU Affero GPL v3.0

More: https://bit.ly/3v4fZmi

👍3🥰2🤯1

2.09K viewsedited 07:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌳Ultra High-Resolution Neural Saliency🌳

👉A novel ultra high-resolution saliency detector with dataset!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Ultra Hi-Res Saliency Detection
✅5,920 pics at 4K-8K resolution
✅Pyramid Grafting Network
✅Cross-Model Grafting Module
✅AGL: Attention Guided Loss
✅Code/models under MIT

More: https://bit.ly/3MnU1Rf

❤6👍3🤯3🔥2🤩1

2.39K views10:39

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪆StyleGAN-Human for fashion 🪆

👉A novel unconditional human generation based on StyleGAN is out!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅200,000+ labeled sample (pose/texture)
✅1024x512 StyleGAN-Human StyleGAN3
✅512x256 StyleGAN-Human StyleGAN1
✅Face model for downstream: InsetGAN
✅Source code and model available!

More: https://bit.ly/3xMg5B2

❤5👍4🔥3🤯1💩1

2.56K viewsedited 14:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💀 OSSO: Skeletal Shape from Outside 💀

👉Anatomic skeleton of a person from 3D surface of body 🦴

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Max Planck + IMATI-CNR + INRIA
✅DXA images to obtain #3D shape
✅External body to internal skeleton

More: https://bit.ly/3v7Z5TQ

👍4🤯2🔥1😱1

2.55K viewsedited 14:09

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎷 Pix2Seq: object detection by #Google 🎷

👉A novel framework to perform object detection as a language modeling task

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Obj. detection as a lang-modeling task
✅BBs/labels -> seq. of discrete token
✅Encoder-decoder (one token at a time)
✅Code under Apache License 2.0

More: https://bit.ly/3F49PX3

👍8🤯3🔥1😱1🎉1🤩1

2.21K viewsedited 19:37