AI with Papers - Artificial Intelligence & Deep Learning
15.4K subscribers
140 photos
253 videos
14 files
1.33K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
🍏 Open Source Vision from #Apple 🍏

👉CVNets: open-source (not a joke) lib for neural vision.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
PyTorch-based neural lib. for vision
Train 2−4× longer w/ augmentations
Plug-and-play components for CV
Source code under a custom license

More: https://bit.ly/39d1dSj
👍9
🔥One Millisecond Backbone. Fire!🔥

👉MobileOne by #Apple: efficient mobile backbone with inference <1 ms on #iPhone12!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
75.9% top-1 accuracy on ImageNet
38× faster than MobileFormer net
Classification, detection & segmentation
Source code & model soon available!

More: https://bit.ly/3tsT7f2
24👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🍏NeuMan: Human NeRF in the wild🍏

👉#Apple opens a novel human pose/view from just a single in-the-wild video

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
No extra devices/annotations
Both Human (novel poses) + Scene
E2E SMPL optimization + error-corr.
Applications such as "telegathering"

More: https://bit.ly/3K4iTO6
👍15
This media is not supported in your browser
VIEW IN TELEGRAM
🍏 f-DM: Diffusion Models by Apple 🍏

👉Spectacular work by #Apple on DMs: HQ generation with better efficiency and semantic

😎Review https://bit.ly/3Tils2u
😎Project https://jiataogu.me/fdm/
😎Paper arxiv.org/pdf/2210.04955.pdf
10😱2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 MobileBrick: #3D object on mobile 🧱

👉#Apple (+Oxford) exploiting #LEGO bricks to open the most precise #3D dataset ever. Suitable for mobile #AR

😎Review https://bit.ly/3ZqbiAh
😎Paper arxiv.org/pdf/2303.01932.pdf
😎Project code.active.vision/MobileBrick/
😎Code github.com/ActiveVisionLab/MobileBrick
🔥6👍2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 #Apple Co-Motion is out! 🔥

👉Apple unveils a novel approach for detecting & tracking detailed 3D poses of multiple people from single monocular stream. Temporally coherent predictions in crowded scenes with hard poses & occlusions. New SOTA, 10x faster! Code & Models released only for research💙

👉Review https://t.ly/-86CO
👉Paper https://lnkd.in/dQsVGY7q
👉Repo https://lnkd.in/dh7j7N89
👍7🤣65🔥2😍1