AI with Papers - Artificial Intelligence & Deep Learning
15.4K subscribers
140 photos
253 videos
14 files
1.33K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨŽHW-Accelerated Neuro-EvolutionðŸĨŽ

👉Scalable, general purpose, hardware accelerated neuro-evolution toolkit by Google

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Parallel on multiple TPU/GPUs
✅Neuro-evo algorithms with NNs
✅WaterWorld, Abstract paint, more
✅From Google, not an official product
✅Code under Apache License 2.0

More: https://bit.ly/3szEi9w
👍3ðŸ”Ĩ2ðŸĪŊ1ðŸ˜ą1
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 DeepETA: #Uber ETA via #AI🚛

👉Uber unveils the low-latency deep architecture for global ETA prediction

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Latency / Accuracy / Generality
✅7 NNs architectures tested
✅Encoder-decoder + Self-Attention
✅Linear transformer (kernel trick)
✅Feature sparsity for speed

More: https://bit.ly/3gFWmJh
👍3ðŸ”Ĩ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
✏ïļCLIPasso: Semantic Sketching via CLIP✏ïļ

👉Sketching method guided by geometric and semantic simplifications (CLIP)

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅EPFL, TAU and IDC Herzliya
✅CLIP image encoder for sketching
✅Sketching as a set of Bezier curves
✅Param-optimization on CLIP-loss
✅Source code and models available

More: https://bit.ly/3oLEDF4
ðŸ”Ĩ2ðŸĨ°2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🊂SAHI: slicing detection/segmentation🊂

👉An open-source lightweight library for large scale object detection & instance segmentation

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Slicing Aided Hyper Inference
✅Large-scale detection/segment.
✅Sliced inference and merging
✅Utils for conversion, slicing, etc.
✅Code licensed under MIT License

More: https://bit.ly/3uMJoBZ
ðŸ”Ĩ3âĪ2ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁100,000,000 image-text pairs!🎁

👉Large-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅100 Million <image, text> pairs
✅>200px size, aspect ratio (1/3~3)
✅Models of ResNet, ViT & SwinT
✅Methods: CLIP, FILIP and LiT
✅Privacy/Sensitive words ðŸĪ”

More: https://bit.ly/34BqlzX
👍5ðŸĪ”1
This media is not supported in your browser
VIEW IN TELEGRAM
🧁33 Million synthetic pedestrians🧁

👉A novel large, fully synthetic dataset

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Exploiting the #gta5 engine
✅764 full-HD videos @20 fps
✅33M+ person instances
✅BBs & segmentation masks
✅2D/3D keypoints & depth

More: https://bit.ly/36njlY1
👍6ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨMarker-free 6D-point trackingðŸĨ

👉Full position and rotation of skeletal joints, with only a RGB frame

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Full 3-axis joint rotations
✅V-markers, emulating mocap
✅#3D from monocular with NN
✅Generalization, no retraining
✅SOTA rotation/position est.

More: https://bit.ly/34GdoF5
ðŸ”Ĩ12ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🧞 Synthetic dataset for #Retail 🧞

👉A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Dataset from Standard.AI
✅2,134 unique scenes
✅25k+ annotated samples
✅Introducing the "change detection"
✅Multi-view representation learning
✅NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
ðŸĪŊ6ðŸĨ°3👍1ðŸ”Ĩ1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Graph Neural Nets Forecasting🌈

👉Data-driven approach for forecasting global weather using graph neural networks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Data-driven forecasting via GNNs
✅Model: 6.7M parameters, float32
✅6-hours forecast in 0.04 secs.
✅A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
👏4👍2ðŸĪ”1
Media is too big
VIEW IN TELEGRAM
ðŸĨŦWatch Those Words!ðŸĨŦ

👉Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Regardless of falsification
✅Semantic person-specific
✅Word-conditioned analysis
✅Generalization across fakes

More: https://bit.ly/3oXWmcd
👍5ðŸ˜ą1
This media is not supported in your browser
VIEW IN TELEGRAM
🔋V2X-sim for #selfdriving is out!🔋

👉V2X: collaboration between a vehicle and any surrounding entity

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Suitable for #selfdrivingcars
✅Rec. from road & vehicles
✅Multi-streams/perception
✅Detection, tracking, & segmentation
✅RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
ðŸ”Ĩ6ðŸĪĐ1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏Infinite Synthetic dataset for Fitness🍏

👉Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅60k images, 1-5 avatars
✅15 categories, 21 variations
✅Blender and ray-tracing
✅SMPL-X + facial expression
✅Cloth/skin tone sampled
✅147 4K HDRI panoramas
✅Creative Commons 4.0

More: https://bit.ly/33B1R9q
ðŸĪĐ5âĪ1👍1
This media is not supported in your browser
VIEW IN TELEGRAM
♊ DITTO: Digital Twins from Interaction ♊

👉Digitizing objects for #metaverse through interactive perception

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅DIgital Twin of arTiculated Objects
✅Geometry & kinematic articulation
✅Articulation & 3D via perception
✅Source code under MIT License

More:https://bit.ly/3LMazCV
ðŸ”Ĩ5âĪ2👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĪ– Robotic Telekinesis from Youtube ðŸĪ–

👉CMU unveils a Robot that observes humans and imitates their actions in real-time

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Enabling robo-hand teleoperation
✅Suitable for untrained operator
✅Single uncalibrated RGB camera
✅Leveraging unlabeled #youtube
✅No active fine-tuning or setup
✅No collision via Adv-Training

More: https://bit.ly/3H7zUnh
ðŸ”Ĩ3ðŸĪŊ2👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
💄DIGAN: #AI for video generation💄

👉A novel INR-based generative adversarial network for video generation

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Dynamics-aware generator
✅INR-based clip generator
✅Manipulating space/time
✅Identifying unnatural motion

More: https://bit.ly/3H6sHE4
ðŸ”Ĩ4ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶ„FILM Neural Frame InterpolationðŸĶ„

👉Frame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Single unified network
✅High quality output
✅SOTA on the Xiph
✅Apache License 2.0

More: https://bit.ly/3pl4ZxH
ðŸ”Ĩ5👍2ðŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
🔈Neural Maintenance via listening🔈

👉Novel neural-method to detect whether a machine is "healthy" or requires maintenance

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Defects at an early stage
✅FDWT, fast discrete wavelet
✅Learnable wavelet/denoising
✅Unsupervised learnable FDWT
✅The new SOTA in PM

More: https://bit.ly/3hiKWeX
ðŸĪŊ6ðŸĪ”1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸŸĶðŸŸĻ StyleGAN on Internet pics ðŸŸĶðŸŸĻ

👉StyleGAN on raw uncurated images collected from Internet

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Outliers & multi-modal
✅Self-distillation approach
✅Self-filtering of outliers
✅Perceptual clustering

More: https://bit.ly/33Z1d5H
âĪ2👍1ðŸ”Ĩ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶœThe new SOTA for Unsupervised ðŸĶœ

👉Self-supervised transformer to discover objects in images

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Visual tokens as nodes in graph
✅Edges as connectivity score
✅The second smallest eV = fg
✅Suitable for unsupervised saliency
✅Weakly supervised obj. detection
✅Code under MIT License


More: https://bit.ly/3sqbFg3
👍4ðŸ”Ĩ3ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨĶ GAN-generated CryptoPunks ðŸĨĶ

👉A simple (and funny) SN-GAN to generate cryptopunks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Spectral normalization (2018)
✅Easy to incorporate into training
✅A project by Teddy Koker ðŸŽĐ

More: https://bit.ly/35C1rQI
âĪ3😁3👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĪŠSEER: self-AI from BILLIONS picðŸĪŠ

👉META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Self-supervised on pics from web
✅Discovering properties in datasets
✅More fair, less biased & less harmful
✅Better OOD generalization
✅Source code available!

More: https://bit.ly/3vy69dd
ðŸ”Ĩ4👍3ðŸĪŊ1