This media is not supported in your browser
VIEW IN TELEGRAM
π DeepETA: #Uber ETA via #AIπ
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
π3π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
βοΈCLIPasso: Semantic Sketching via CLIPβοΈ
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
π₯2π₯°2π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺSAHI: slicing detection/segmentationπͺ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
π₯3β€2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π100,000,000 image-text pairs!π
πLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ππ’π π‘π₯π’π π‘ππ¬:
β 100 Million <image, text> pairs
β >200px size, aspect ratio (1/3~3)
β Models of ResNet, ViT & SwinT
β Methods: CLIP, FILIP and LiT
β Privacy/Sensitive words π€
More: https://bit.ly/34BqlzX
πLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ππ’π π‘π₯π’π π‘ππ¬:
β 100 Million <image, text> pairs
β >200px size, aspect ratio (1/3~3)
β Models of ResNet, ViT & SwinT
β Methods: CLIP, FILIP and LiT
β Privacy/Sensitive words π€
More: https://bit.ly/34BqlzX
π5π€1
This media is not supported in your browser
VIEW IN TELEGRAM
π§33 Million synthetic pedestriansπ§
πA novel large, fully synthetic dataset
ππ’π π‘π₯π’π π‘ππ¬:
β Exploiting the #gta5 engine
β 764 full-HD videos @20 fps
β 33M+ person instances
β BBs & segmentation masks
β 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
πA novel large, fully synthetic dataset
ππ’π π‘π₯π’π π‘ππ¬:
β Exploiting the #gta5 engine
β 764 full-HD videos @20 fps
β 33M+ person instances
β BBs & segmentation masks
β 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
π6π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Marker-free 6D-point trackingπ₯
πFull position and rotation of skeletal joints, with only a RGB frame
ππ’π π‘π₯π’π π‘ππ¬:
β Full 3-axis joint rotations
β V-markers, emulating mocap
β #3D from monocular with NN
β Generalization, no retraining
β SOTA rotation/position est.
More: https://bit.ly/34GdoF5
πFull position and rotation of skeletal joints, with only a RGB frame
ππ’π π‘π₯π’π π‘ππ¬:
β Full 3-axis joint rotations
β V-markers, emulating mocap
β #3D from monocular with NN
β Generalization, no retraining
β SOTA rotation/position est.
More: https://bit.ly/34GdoF5
π₯12π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Ό Synthetic dataset for #Retail π§Ό
πA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ππ’π π‘π₯π’π π‘ππ¬:
β Dataset from Standard.AI
β 2,134 unique scenes
β 25k+ annotated samples
β Introducing the "change detection"
β Multi-view representation learning
β NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
πA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ππ’π π‘π₯π’π π‘ππ¬:
β Dataset from Standard.AI
β 2,134 unique scenes
β 25k+ annotated samples
β Introducing the "change detection"
β Multi-view representation learning
β NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
π€―6π₯°3π1π₯1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π Graph Neural Nets Forecastingπ
πData-driven approach for forecasting global weather using graph neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β Data-driven forecasting via GNNs
β Model: 6.7M parameters, float32
β 6-hours forecast in 0.04 secs.
β A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
πData-driven approach for forecasting global weather using graph neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β Data-driven forecasting via GNNs
β Model: 6.7M parameters, float32
β 6-hours forecast in 0.04 secs.
β A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
π4π2π€1
Media is too big
VIEW IN TELEGRAM
π₯«Watch Those Words!π₯«
πBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ππ’π π‘π₯π’π π‘ππ¬:
β Regardless of falsification
β Semantic person-specific
β Word-conditioned analysis
β Generalization across fakes
More: https://bit.ly/3oXWmcd
πBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ππ’π π‘π₯π’π π‘ππ¬:
β Regardless of falsification
β Semantic person-specific
β Word-conditioned analysis
β Generalization across fakes
More: https://bit.ly/3oXWmcd
π5π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πV2X-sim for #selfdriving is out!π
πV2X: collaboration between a vehicle and any surrounding entity
ππ’π π‘π₯π’π π‘ππ¬:
β Suitable for #selfdrivingcars
β Rec. from road & vehicles
β Multi-streams/perception
β Detection, tracking, & segmentation
β RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
πV2X: collaboration between a vehicle and any surrounding entity
ππ’π π‘π₯π’π π‘ππ¬:
β Suitable for #selfdrivingcars
β Rec. from road & vehicles
β Multi-streams/perception
β Detection, tracking, & segmentation
β RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
π₯6π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πInfinite Synthetic dataset for Fitnessπ
πOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ππ’π π‘π₯π’π π‘ππ¬:
β 60k images, 1-5 avatars
β 15 categories, 21 variations
β Blender and ray-tracing
β SMPL-X + facial expression
β Cloth/skin tone sampled
β 147 4K HDRI panoramas
β Creative Commons 4.0
More: https://bit.ly/33B1R9q
πOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ππ’π π‘π₯π’π π‘ππ¬:
β 60k images, 1-5 avatars
β 15 categories, 21 variations
β Blender and ray-tracing
β SMPL-X + facial expression
β Cloth/skin tone sampled
β 147 4K HDRI panoramas
β Creative Commons 4.0
More: https://bit.ly/33B1R9q
π€©5β€1π1
This media is not supported in your browser
VIEW IN TELEGRAM
β DITTO: Digital Twins from Interaction β
πDigitizing objects for #metaverse through interactive perception
ππ’π π‘π₯π’π π‘ππ¬:
β DIgital Twin of arTiculated Objects
β Geometry & kinematic articulation
β Articulation & 3D via perception
β Source code under MIT License
More:https://bit.ly/3LMazCV
πDigitizing objects for #metaverse through interactive perception
ππ’π π‘π₯π’π π‘ππ¬:
β DIgital Twin of arTiculated Objects
β Geometry & kinematic articulation
β Articulation & 3D via perception
β Source code under MIT License
More:https://bit.ly/3LMazCV
π₯5β€2π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π€ Robotic Telekinesis from Youtube π€
πCMU unveils a Robot that observes humans and imitates their actions in real-time
ππ’π π‘π₯π’π π‘ππ¬:
β Enabling robo-hand teleoperation
β Suitable for untrained operator
β Single uncalibrated RGB camera
β Leveraging unlabeled #youtube
β No active fine-tuning or setup
β No collision via Adv-Training
More: https://bit.ly/3H7zUnh
πCMU unveils a Robot that observes humans and imitates their actions in real-time
ππ’π π‘π₯π’π π‘ππ¬:
β Enabling robo-hand teleoperation
β Suitable for untrained operator
β Single uncalibrated RGB camera
β Leveraging unlabeled #youtube
β No active fine-tuning or setup
β No collision via Adv-Training
More: https://bit.ly/3H7zUnh
π₯3π€―2π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
πDIGAN: #AI for video generationπ
πA novel INR-based generative adversarial network for video generation
ππ’π π‘π₯π’π π‘ππ¬:
β Dynamics-aware generator
β INR-based clip generator
β Manipulating space/time
β Identifying unnatural motion
More: https://bit.ly/3H6sHE4
πA novel INR-based generative adversarial network for video generation
ππ’π π‘π₯π’π π‘ππ¬:
β Dynamics-aware generator
β INR-based clip generator
β Manipulating space/time
β Identifying unnatural motion
More: https://bit.ly/3H6sHE4
π₯4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦FILM Neural Frame Interpolationπ¦
πFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ππ’π π‘π₯π’π π‘ππ¬:
β Single unified network
β High quality output
β SOTA on the Xiph
β Apache License 2.0
More: https://bit.ly/3pl4ZxH
πFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ππ’π π‘π₯π’π π‘ππ¬:
β Single unified network
β High quality output
β SOTA on the Xiph
β Apache License 2.0
More: https://bit.ly/3pl4ZxH
π₯5π2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πNeural Maintenance via listeningπ
πNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ππ’π π‘π₯π’π π‘ππ¬:
β Defects at an early stage
β FDWT, fast discrete wavelet
β Learnable wavelet/denoising
β Unsupervised learnable FDWT
β The new SOTA in PM
More: https://bit.ly/3hiKWeX
πNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ππ’π π‘π₯π’π π‘ππ¬:
β Defects at an early stage
β FDWT, fast discrete wavelet
β Learnable wavelet/denoising
β Unsupervised learnable FDWT
β The new SOTA in PM
More: https://bit.ly/3hiKWeX
π€―6π€1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦π¨ StyleGAN on Internet pics π¦π¨
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
β€2π1π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦The new SOTA for Unsupervised π¦
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ GAN-generated CryptoPunks π₯¦
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
β€3π3π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π€ͺSEER: self-AI from BILLIONS picπ€ͺ
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
π₯4π3π€―1