πSegNeXt: new SOTA in Semantic Seg.π
πSOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Novel tailored network architecture
β Spatial attention via multi-scale feats
β Encoder + conv. better than transformers
β SOTA on several datasets (ADE20K, etc.)
More: https://bit.ly/3UrZhrH
πSOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Novel tailored network architecture
β Spatial attention via multi-scale feats
β Encoder + conv. better than transformers
β SOTA on several datasets (ADE20K, etc.)
More: https://bit.ly/3UrZhrH
π₯9π1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ͺStereoVoxelNet: RT Obstacles Detectionπ¦ͺ
πNovel deep neural approach to detect occupancy from stereo images directly
ππ’π π‘π₯π’π π‘ππ¬:
β Occupancy voxels via deep learning
β RT on Jetson-TX2 (-98% CPU of SOTA)
β Optimization via octrees / sparse conv.
β Real-world stereo in/outdoor dataset
More: https://bit.ly/3BylAn3
πNovel deep neural approach to detect occupancy from stereo images directly
ππ’π π‘π₯π’π π‘ππ¬:
β Occupancy voxels via deep learning
β RT on Jetson-TX2 (-98% CPU of SOTA)
β Optimization via octrees / sparse conv.
β Real-world stereo in/outdoor dataset
More: https://bit.ly/3BylAn3
π10π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π NeRF-Factory: a NeRF collection π
πPyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets
ππ’π π‘π₯π’π π‘ππ¬:
β NeRF: Project | Paper | Code
β NeRF++: Paper | Code
β DVGO: Project | Paper v1/v2 | Code
β Plenoxels: Project | Paper | Code
β Mip-NeRF: Project | Paper | Code
β Mip-NeRF360: Project | Paper | Code
β Ref-NeRF: Project | Paper | Code
More: https://bit.ly/3qUgmgC
πPyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets
ππ’π π‘π₯π’π π‘ππ¬:
β NeRF: Project | Paper | Code
β NeRF++: Paper | Code
β DVGO: Project | Paper v1/v2 | Code
β Plenoxels: Project | Paper | Code
β Mip-NeRF: Project | Paper | Code
β Mip-NeRF360: Project | Paper | Code
β Ref-NeRF: Project | Paper | Code
More: https://bit.ly/3qUgmgC
π7π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Ά Lumos by #Nvidia: Relighting Portrait π₯Ά
πThe new SOTA in relighting without requiring a light stage
πReview https://bit.ly/3dCH9ej
πProject deepimagination.cc/Lumos
πPaper arxiv.org/pdf/2209.10510.pdf
πDemo http://imaginaire.cc/Lumos/
πThe new SOTA in relighting without requiring a light stage
πReview https://bit.ly/3dCH9ej
πProject deepimagination.cc/Lumos
πPaper arxiv.org/pdf/2209.10510.pdf
πDemo http://imaginaire.cc/Lumos/
β€11π1
This media is not supported in your browser
VIEW IN TELEGRAM
π SURF-GAN: NeRF - >StyleGAN π
π Editable portraits by injecting the NeRF's prior into StyleGAN
πReview https://bit.ly/3SohEw3
πProject jgkwak95.github.io/surfgan
πPaper arxiv.org/pdf/2207.10257.pdf
πCode github.com/jgkwak95/SURF-GAN
π Editable portraits by injecting the NeRF's prior into StyleGAN
πReview https://bit.ly/3SohEw3
πProject jgkwak95.github.io/surfgan
πPaper arxiv.org/pdf/2207.10257.pdf
πCode github.com/jgkwak95/SURF-GAN
π4β€2β€βπ₯1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯#Google just announced "TensorStore"π₯
πNovel open-source C++ / #Python library for storage/manipulation of high-dim data
πReview https://bit.ly/3DLwbha
πProject https://bit.ly/3C4T2TR
πCode github.com/google/tensorstore
πNovel open-source C++ / #Python library for storage/manipulation of high-dim data
πReview https://bit.ly/3DLwbha
πProject https://bit.ly/3C4T2TR
πCode github.com/google/tensorstore
π₯14π2
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ Motion Transformer for #selfdriving π¦
πThe 1st place solution for 2022 #waymo "motion prediction" challenge
πReview https://bit.ly/3f8G4LD
πPaper arxiv.org/pdf/2209.10033.pdf
πCode github.com/sshaoshuai/MTR
πThe 1st place solution for 2022 #waymo "motion prediction" challenge
πReview https://bit.ly/3f8G4LD
πPaper arxiv.org/pdf/2209.10033.pdf
πCode github.com/sshaoshuai/MTR
π₯17π3
This media is not supported in your browser
VIEW IN TELEGRAM
πΉ Image Synthesis @160+ FPS! πΉ
πSuper-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!
πReview https://bit.ly/3r3ZNij
πPaper arxiv.org/pdf/2206.07695.pdf
πProject katjaschwarz.github.io/voxgraf
πSuper-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!
πReview https://bit.ly/3r3ZNij
πPaper arxiv.org/pdf/2206.07695.pdf
πProject katjaschwarz.github.io/voxgraf
π3π€―2π₯1π―1
This media is not supported in your browser
VIEW IN TELEGRAM
π #Nvidia GET3D: #3D generative #AI π
πAI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures
πReview https://bit.ly/3SgnT5h
πCode github.com/nv-tlabs/GET3D
πProject nv-tlabs.github.io/GET3D/
πPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
πAI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures
πReview https://bit.ly/3SgnT5h
πCode github.com/nv-tlabs/GET3D
πProject nv-tlabs.github.io/GET3D/
πPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
β€βπ₯7π5
This media is not supported in your browser
VIEW IN TELEGRAM
π₯π₯ IDE-3D: source code is out! π₯π₯
πNovel, photorealistic, 3D-aware facial generator: source code just released!
πReview https://bit.ly/3BNrO2C
πProject mrtornado24.github.io/IDE-3D/
πCode github.com/MrTornado24/IDE-3D
πPaper arxiv.org/pdf/2205.15517.pdf
πNovel, photorealistic, 3D-aware facial generator: source code just released!
πReview https://bit.ly/3BNrO2C
πProject mrtornado24.github.io/IDE-3D/
πCode github.com/MrTornado24/IDE-3D
πPaper arxiv.org/pdf/2205.15517.pdf
π€―8π5π₯3π€©3
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Diffusion Model of Neural Checkpointsπ₯
πConditional diffusion model on Millions of checkpoints of a given task/architecture π€―
πReview https://bit.ly/3SBR4Qb
πProject www.wpeebles.com/Gpt
πCode github.com/wpeebles/G.pt
πPaper arxiv.org/pdf/2209.12892.pdf
πConditional diffusion model on Millions of checkpoints of a given task/architecture π€―
πReview https://bit.ly/3SBR4Qb
πProject www.wpeebles.com/Gpt
πCode github.com/wpeebles/G.pt
πPaper arxiv.org/pdf/2209.12892.pdf
π€―5β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Semantic VISOR dataset is out! π₯
πSegmenting hands / active objects in egocentric video (millions masks)
πReview https://bit.ly/3LOBLBv
πProject epic-kitchens.github.io/VISOR/
πPaper arxiv.org/pdf/2209.13064.pdf
πSegmenting hands / active objects in egocentric video (millions masks)
πReview https://bit.ly/3LOBLBv
πProject epic-kitchens.github.io/VISOR/
πPaper arxiv.org/pdf/2209.13064.pdf
π€―8π₯4π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯π₯ Olympic Games in 2028? π₯π₯
π In a few years, the fastest runner on earth will not be a human π₯Ά
πReview https://bit.ly/3Rme3O3
π In a few years, the fastest runner on earth will not be a human π₯Ά
πReview https://bit.ly/3Rme3O3
π±8π3π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ SOTA ALERT: new Text-to-Video #AI π₯
π#META unveils a novel Text-to-Video (T2V) generation #AI
πReview https://bit.ly/3E1ZDzG
πProject https://makeavideo.studio/
πPaper makeavideo.studio/Make-A-Video.pdf
π#META unveils a novel Text-to-Video (T2V) generation #AI
πReview https://bit.ly/3E1ZDzG
πProject https://makeavideo.studio/
πPaper makeavideo.studio/Make-A-Video.pdf
π€―9π6π±1π©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯DreamFusion: Text-to-3D via Diffusionπ₯
πDeepDream-like procedure to create #3D assets just from a given text
πReview https://bit.ly/3BYY5nu
πPaper arxiv.org/pdf/2209.14988.pdf
πProject dreamfusion3d.github.io/gallery.html
πDeepDream-like procedure to create #3D assets just from a given text
πReview https://bit.ly/3BYY5nu
πPaper arxiv.org/pdf/2209.14988.pdf
πProject dreamfusion3d.github.io/gallery.html
π€―12π5π©1
This media is not supported in your browser
VIEW IN TELEGRAM
π§ͺ Light Field Neural Rendering π§ͺ
πTwo-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)
πReview https://bit.ly/3CpIFdm
πPaper arxiv.org/pdf/2112.09687.pdf
πProject light-field-neural-rendering.github.io
πCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
πTwo-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)
πReview https://bit.ly/3CpIFdm
πPaper arxiv.org/pdf/2112.09687.pdf
πProject light-field-neural-rendering.github.io
πCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
π€―14π1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦©Phenaki: Text-to(LOOONG)Video generationπ¦©
πPhenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts
πReview https://bit.ly/3RwUvXx
πProject phenaki.video/index.h
πPaper openreview.net/pdf?id=vOEXS39nOF
πPhenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts
πReview https://bit.ly/3RwUvXx
πProject phenaki.video/index.h
πPaper openreview.net/pdf?id=vOEXS39nOF
π₯7β€3π1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ VToonify: Neural Portrait Style Transfer π₯
πVToonify for portrait style transfer. Powered by DualStyleGAN backbone, now with #stablediffusion!
πReview https://bit.ly/3M9wgNP
πDemo https://t.co/8gXzF3IrpB
πPaper arxiv.org/pdf/2209.11224.pdf
πProject mmlab-ntu.com/project/vtoonify
πCode github.com/williamyang1991/VToonify
πVToonify for portrait style transfer. Powered by DualStyleGAN backbone, now with #stablediffusion!
πReview https://bit.ly/3M9wgNP
πDemo https://t.co/8gXzF3IrpB
πPaper arxiv.org/pdf/2209.11224.pdf
πProject mmlab-ntu.com/project/vtoonify
πCode github.com/williamyang1991/VToonify
π22β€3π€―2π₯1π1π©1
This media is not supported in your browser
VIEW IN TELEGRAM
π’ Stable Diffusion for #Pokemon π’
πFine-tuning the stable diffusion to create a text-to-pokemon generation model
πReview https://bit.ly/3C9qBTw
πTutorial https://lambdalabs.com/blog/how-to-fine-tune-stable-diffusion-how-we-made-the-text-to-pokemon-model-at-lambda/
πFine-tuning the stable diffusion to create a text-to-pokemon generation model
πReview https://bit.ly/3C9qBTw
πTutorial https://lambdalabs.com/blog/how-to-fine-tune-stable-diffusion-how-we-made-the-text-to-pokemon-model-at-lambda/
β€8π4
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Imagen Video by #Google. SICK! π₯
πNovel text-conditional video generation via cascade of video diffusion models π€―
πReview https://bit.ly/3SH2TVH
πProject imagen.research.google/video/
πPaper imagen.research.google/video/paper.pdf
πNovel text-conditional video generation via cascade of video diffusion models π€―
πReview https://bit.ly/3SH2TVH
πProject imagen.research.google/video/
πPaper imagen.research.google/video/paper.pdf
π€―20π₯7π1