AI with Papers - Artificial Intelligence & Deep Learning
17.3K subscribers
158 photos
276 videos
14 files
1.45K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
🍈SegNeXt: new SOTA in Semantic Seg.🍈

πŸ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel tailored network architecture
βœ…Spatial attention via multi-scale feats
βœ…Encoder + conv. better than transformers
βœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
πŸ”₯9πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦ͺStereoVoxelNet: RT Obstacles DetectionπŸ¦ͺ

πŸ‘‰Novel deep neural approach to detect occupancy from stereo images directly

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Occupancy voxels via deep learning
βœ…RT on Jetson-TX2 (-98% CPU of SOTA)
βœ…Optimization via octrees / sparse conv.
βœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
πŸ‘10πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜

πŸ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF: Project | Paper | Code
βœ…NeRF++: Paper | Code
βœ…DVGO: Project | Paper v1/v2 | Code
βœ…Plenoxels: Project | Paper | Code
βœ…Mip-NeRF: Project | Paper | Code
βœ…Mip-NeRF360: Project | Paper | Code
βœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Ά Lumos by #Nvidia: Relighting Portrait πŸ₯Ά

πŸ‘‰The new SOTA in relighting without requiring a light stage

😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
❀11πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜

πŸ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
πŸ‘4❀2❀‍πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#Google just announced "TensorStore"πŸ”₯

πŸ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

😎Review https://bit.ly/3DLwbha
😎Project https://bit.ly/3C4T2TR
😎Code github.com/google/tensorstore
πŸ”₯14πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Motion Transformer for #selfdriving 🦠

πŸ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

😎Review https://bit.ly/3f8G4LD
😎Paper arxiv.org/pdf/2209.10033.pdf
😎Code github.com/sshaoshuai/MTR
πŸ”₯17πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’Ή Image Synthesis @160+ FPS! πŸ’Ή

πŸ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

😎Review https://bit.ly/3r3ZNij
😎Paper arxiv.org/pdf/2206.07695.pdf
😎Project katjaschwarz.github.io/voxgraf
πŸ‘3🀯2πŸ”₯1πŸ’―1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘› #Nvidia GET3D: #3D generative #AI πŸ‘›

πŸ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

😎Review https://bit.ly/3SgnT5h
😎Code github.com/nv-tlabs/GET3D
😎Project nv-tlabs.github.io/GET3D/
😎Paper nv-tlabs.github.io/GET3D/assets/paper.pdf
❀‍πŸ”₯7πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ IDE-3D: source code is out! πŸ”₯πŸ”₯

πŸ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

😎Review https://bit.ly/3BNrO2C
😎Project mrtornado24.github.io/IDE-3D/
😎Code github.com/MrTornado24/IDE-3D
😎Paper arxiv.org/pdf/2205.15517.pdf
🀯8πŸ‘5πŸ”₯3🀩3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Diffusion Model of Neural CheckpointsπŸ”₯

πŸ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture 🀯

😎Review https://bit.ly/3SBR4Qb
😎Project www.wpeebles.com/Gpt
😎Code github.com/wpeebles/G.pt
😎Paper arxiv.org/pdf/2209.12892.pdf
🀯5❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Semantic VISOR dataset is out! πŸ”₯

πŸ‘‰Segmenting hands / active objects in egocentric video (millions masks)

😎Review https://bit.ly/3LOBLBv
😎Project epic-kitchens.github.io/VISOR/
😎Paper arxiv.org/pdf/2209.13064.pdf
🀯8πŸ”₯4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‡πŸ₯‡ Olympic Games in 2028? πŸ₯‡πŸ₯‡

πŸ‘‰ In a few years, the fastest runner on earth will not be a human πŸ₯Ά

😎Review https://bit.ly/3Rme3O3
😱8πŸ‘3πŸ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ SOTA ALERT: new Text-to-Video #AI πŸ”₯

πŸ‘‰#META unveils a novel Text-to-Video (T2V) generation #AI

😎Review https://bit.ly/3E1ZDzG
😎Project https://makeavideo.studio/
😎Paper makeavideo.studio/Make-A-Video.pdf
🀯9πŸ‘6😱1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯DreamFusion: Text-to-3D via DiffusionπŸ”₯

πŸ‘‰DeepDream-like procedure to create #3D assets just from a given text

😎Review https://bit.ly/3BYY5nu
😎Paper arxiv.org/pdf/2209.14988.pdf
😎Project dreamfusion3d.github.io/gallery.html
🀯12πŸ‘5πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§ͺ Light Field Neural Rendering πŸ§ͺ

πŸ‘‰Two-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)

😎Review https://bit.ly/3CpIFdm
😎Paper arxiv.org/pdf/2112.09687.pdf
😎Project light-field-neural-rendering.github.io
😎Code github.com/google-research/google-research/tree/master/light_field_neural_rendering
🀯14πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩Phenaki: Text-to(LOOONG)Video generation🦩

πŸ‘‰Phenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts

😎Review https://bit.ly/3RwUvXx
😎Project phenaki.video/index.h
😎Paper openreview.net/pdf?id=vOEXS39nOF
πŸ”₯7❀3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ VToonify: Neural Portrait Style Transfer πŸ”₯

πŸ‘‰VToonify for portrait style transfer. Powered by DualStyleGAN backbone, now with #stablediffusion!

😎Review https://bit.ly/3M9wgNP
😎Demo https://t.co/8gXzF3IrpB
😎Paper arxiv.org/pdf/2209.11224.pdf
😎Project mmlab-ntu.com/project/vtoonify
😎Code github.com/williamyang1991/VToonify
πŸ‘22❀3🀯2πŸ”₯1πŸ‘1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
🐒 Stable Diffusion for #Pokemon 🐒

πŸ‘‰Fine-tuning the stable diffusion to create a text-to-pokemon generation model

😎Review https://bit.ly/3C9qBTw
😎Tutorial https://lambdalabs.com/blog/how-to-fine-tune-stable-diffusion-how-we-made-the-text-to-pokemon-model-at-lambda/
❀8πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Imagen Video by #Google. SICK! πŸ”₯

πŸ‘‰Novel text-conditional video generation via cascade of video diffusion models 🀯

😎Review https://bit.ly/3SH2TVH
😎Project imagen.research.google/video/
😎Paper imagen.research.google/video/paper.pdf
🀯20πŸ”₯7πŸ‘1