AI with Papers - Artificial Intelligence & Deep Learning
15.4K subscribers
140 photos
253 videos
14 files
1.33K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ "Segmenting Anything". CRAZY! ๐Ÿ”ฅ

๐Ÿ‘‰#Meta unveils a novel model and (1B+) dataset for neural segmentation ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3nM2uXx
๐Ÿ˜ŽPaper https://bit.ly/43788DC
๐Ÿ˜ŽProject https://segment-anything.com
๐Ÿ˜ŽCode github.com/facebookresearch/segment-anything
๐Ÿคฏ36โค16๐Ÿ˜ฑ3๐Ÿ‘2๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿชฌ META's Animated Drawings is out! ๐Ÿชฌ

๐Ÿ‘‰#META unveils an easy-to-use method for animating human-like figures drawn by children.

๐Ÿ˜ŽReview https://bit.ly/3mGeQQv
๐Ÿ˜ŽPaper arxiv.org/pdf/2303.12741.pdf
๐Ÿ˜ŽProject fairanimateddrawings.com
๐Ÿ˜ฑ16๐Ÿฅฐ5๐Ÿ‘4๐Ÿ‘2๐Ÿคฉ2โšก1๐Ÿ”ฅ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ• 6D Non-Prehensile Manipulation ๐Ÿฆ•

๐Ÿ‘‰#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects

๐Ÿ˜ŽReview https://bit.ly/3NP1jl1
๐Ÿ˜ŽPaper arxiv.org/pdf/2305.03942.pdf
๐Ÿ˜ŽProject hacman-2023.github.io
๐Ÿ‘6๐Ÿ”ฅ4๐Ÿคฏ3๐Ÿ˜ฑ1
๐Ÿฆ™ Llama-2: the Open-Source "ChatGPT" ๐Ÿฆ™

๐Ÿ‘‰GenAI, #Meta unveils Llama-2: a collection of LLMs ranging in scale 7-70B params. Challenging with #chatgpt, but open.

๐Ÿ˜ŽReview https://t.ly/bLJgP
๐Ÿ˜ŽPaper https://t.ly/AOXru
๐Ÿ˜ŽProject https://ai.meta.com/llama
๐Ÿคฏ19โค2๐Ÿ”ฅ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ˜ Controllable Synthetic Data (extending Image-Net) ๐Ÿ˜

๐Ÿ‘‰#META's PUG, a new generation of interactive environments for representation learning. Extending Image-Net!

๐Ÿ˜ŽReview https://t.ly/nCYs0
๐Ÿ˜ŽPaper arxiv.org/pdf/2308.03977.pdf
๐Ÿ˜ŽProject pug.metademolab.com
๐Ÿ˜ŽCode github.com/facebookresearch/PUG
๐Ÿ”ฅ4โค2๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
โ›บFACET: Fairness in Computer Visionโ›บ

๐Ÿ‘‰#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes

๐Ÿ˜ŽReview https://t.ly/mKn-t
๐Ÿ˜ŽPaper arxiv.org/pdf/2309.00035.pdf
๐Ÿ˜ŽDataset https://facet.metademolab.com/
๐Ÿ”ฅ10โค6๐Ÿ‘4๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ๐Ÿ”ฅ #META's DINOv2 is now commercial! ๐Ÿ”ฅ๐Ÿ”ฅ

๐Ÿ‘‰Universal features for image classification, instance retrieval, video understanding, depth & semantic segmentation. Now suitable for commercial.

๐Ÿ˜ŽReview https://t.ly/LNrGy
๐Ÿ˜ŽPaper arxiv.org/pdf/2304.07193.pdf
๐Ÿ˜ŽCode github.com/facebookresearch/dinov2
๐Ÿ˜ŽDemo dinov2.metademolab.com/
๐Ÿ”ฅ15๐Ÿ‘3โค1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
โœŒ๏ธ Relighted 3D Hands ๐Ÿคž

๐Ÿ‘‰#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands

๐Ÿ˜ŽReview https://t.ly/I1dQk
๐Ÿ˜ŽPaper arxiv.org/pdf/2310.17768.pdf
๐Ÿ˜ŽProject mks0601.github.io/ReInterHand
๐Ÿ˜ŽData github.com/mks0601/ReInterHand
๐Ÿคฏ8โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ“ Emu: image edit / video gen. ๐Ÿ“

๐Ÿ‘‰#Meta the new SOTA in text-to-video generation and instruction-based image editing

๐Ÿ‘‰ Review https://t.ly/PMTBc
๐Ÿ‘‰ Paper (images): https://lnkd.in/eVadH-QS
๐Ÿ‘‰ Project https://lnkd.in/eG8eWUJY
๐Ÿ‘‰ Paper (video): https://lnkd.in/eVadH-QS
๐Ÿ‘‰ Project https://lnkd.in/eu6Zu6gp
๐Ÿ”ฅ8๐Ÿคฏ2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช–RT Humanoid from Head-Mounted Sensors๐Ÿช–

๐Ÿ‘‰#META (+CMU) announced SimXR, a method for controlling a simulated avatar from info obtained from AR/VR headsets

๐Ÿ‘‰Review https://t.ly/Si2Mp
๐Ÿ‘‰Paper arxiv.org/pdf/2403.06862.pdf
๐Ÿ‘‰Project www.zhengyiluo.com/SimXR/
โค12โšก1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸงคHOT3D Hand/Object Tracking๐Ÿงค

๐Ÿ‘‰#Meta opens a novel egocentric dataset for 3D hand & object tracking. A new benchmark for vision-based understanding of 3D hand-object interactions. Dataset available ๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/cD76F
๐Ÿ‘‰Paper https://lnkd.in/e6_7UNny
๐Ÿ‘‰Data https://lnkd.in/e6P-sQFK
๐Ÿ”ฅ9โค3๐Ÿ‘3๐Ÿ‘2๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ๐Ÿ”ฅ SAM v2 is out! ๐Ÿ”ฅ๐Ÿ”ฅ

๐Ÿ‘‰#Meta announced SAM 2, the novel unified model for real-time promptable segmentation in images and videos. 6x faster, it's the new SOTA by a large margin. Source Code, Dataset, Models & Demo released under permissive licenses๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/oovJZ
๐Ÿ‘‰Paper https://t.ly/sCxMY
๐Ÿ‘‰Demo https://sam2.metademolab.com
๐Ÿ‘‰Project ai.meta.com/blog/segment-anything-2/
๐Ÿ‘‰Models github.com/facebookresearch/segment-anything-2
๐Ÿ”ฅ27โค10๐Ÿคฏ4๐Ÿ‘2๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ EFM3D: 3D Ego-Foundation ๐Ÿ

๐Ÿ‘‰#META presents EFM3D, the first benchmark for 3D object detection and surface regression on HQ annotated egocentric data of Project Aria. Datasets & Code released๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/cDJv6
๐Ÿ‘‰Paper arxiv.org/pdf/2406.10224
๐Ÿ‘‰Project www.projectaria.com/datasets/aeo/
๐Ÿ‘‰Repo github.com/facebookresearch/efm3d
๐Ÿ”ฅ9โค2๐Ÿ‘2โšก1๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ CoTracker3 by #META is out! ๐Ÿ”ฅ

๐Ÿ‘‰#Meta (+VGG Oxford) unveils CoTracker3, a new tracker that outperforms the previous SoTA by a large margin using only the 0.1% of the training data ๐Ÿคฏ๐Ÿคฏ๐Ÿคฏ

๐Ÿ‘‰Review https://t.ly/TcRIv
๐Ÿ‘‰Paper arxiv.org/pdf/2410.11831
๐Ÿ‘‰Project cotracker3.github.io/
๐Ÿ‘‰Code github.com/facebookresearch/co-tracker
โค14๐Ÿ”ฅ3๐Ÿคฏ3๐Ÿพ2๐Ÿ‘1๐Ÿ˜ฑ1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
โ˜€๏ธ Universal Relightable Avatars โ˜€๏ธ

๐Ÿ‘‰#Meta unveils URAvatar, photorealistic & relightable avatars from phone scan with unknown illumination. Stunning results!

๐Ÿ‘‰Review https://t.ly/U-ESX
๐Ÿ‘‰Paper arxiv.org/pdf/2410.24223
๐Ÿ‘‰Project junxuan-li.github.io/urgca-website
โค11๐Ÿ”ฅ5โšก1๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โค๏ธโ€๐Ÿ”ฅ Uncommon object in #3D โค๏ธโ€๐Ÿ”ฅ

๐Ÿ‘‰#META releases uCO3D, a new object-centric dataset for 3D AI. The largest publicly-available collection of HD videos of objects with 3D annotations that ensures full-360โ—ฆ coverage. Code & data under CCA 4.0๐Ÿ’™

๐Ÿ‘‰Review https://t.ly/Z_tvA
๐Ÿ‘‰Paper https://arxiv.org/pdf/2501.07574
๐Ÿ‘‰Project https://uco3d.github.io/
๐Ÿ‘‰Repo github.com/facebookresearch/uco3d
โค11โšก2๐Ÿ˜2๐Ÿ‘1๐Ÿ‘1๐Ÿคฉ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
โ˜€๏ธ Relightable Full-Body Avatars โ˜€๏ธ

๐Ÿ‘‰#Meta unveils the first approach ever to jointly model the relightable appearance of the body, face, and hands of drivable avatars.

๐Ÿ‘‰Review https://t.ly/kx9gf
๐Ÿ‘‰Paper arxiv.org/pdf/2501.14726
๐Ÿ‘‰Project neuralbodies.github.io/RFGCA
โค3๐Ÿ‘3๐Ÿ”ฅ3โšก1๐Ÿคฏ1๐Ÿ˜ข1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ VideoJAM: #META's Video-Model (SOTA) ๐Ÿ”ฅ

๐Ÿ‘‰#META's VideoJAM: the new SOTA (by large margin) in motion coherence for video generation, much better than SORA! A strong motion prior into any video-gen model. Impressive results, no code announced๐Ÿฅฒ

๐Ÿ‘‰Review https://shorturl.at/id7Bt
๐Ÿ‘‰Paper https://arxiv.org/pdf/2502.02492
๐Ÿ‘‰Project https://hila-chefer.github.io/videojam-paper.github.io/
๐Ÿ”ฅ9โค4๐Ÿ‘1๐Ÿ‘1๐Ÿคฉ1