AI with Papers - Artificial Intelligence & Deep Learning – Telegram
AI with Papers - Artificial Intelligence & Deep Learning
15.8K subscribers
146 photos
260 videos
14 files
1.36K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🏵️ TORAS: SOTA #AI for annotation 🏵️

👉TORAS: web-based AI-powered, cooperative, annotation platform.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
SOTA AI tools -> significant speedup
"Recipes" to define how to annotate
Repo with folder structure for storage
Also on-prem for (commercial) firms

More: https://bit.ly/3L78YI2
🔥9🤯2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💮MAXIM: Multi-Axis MLP for Vision💮

👉#Google opens MAXIM, a multi-axis MLP for low-level vision

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Denoising, deblurring, dehazing, etc
Multi-axis gated MLP, linear complexity
Cross gating block, separate features
SOTA results on several datasets!

More: https://bit.ly/3Dmp8LI
🔥121👎1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 A Survey on Diffusion Models 🔥

👉A comprehensive review of denoising diffusion models in #computervision 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Overview on diffusion models
Hot trend for the generative AI
A multi-perspective categorization
Current limitations / new directions

More: https://bit.ly/3RYG5zP
5👍3🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🉐#AI finds where IG photos are taken🉐

👉Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Recorded open cameras for weeks
Scraped all #Instagram photos
Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc
😱18👍13🥰2
This media is not supported in your browser
VIEW IN TELEGRAM
🈯SAMURAI: in-the-wild Shape/Material🈯

👉#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Parametrization for varying distances
Camera multiplex optimization
Posterior scaling of input images
Explicit meshes extraction with BRDF
Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3
👍8🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🟨 Lang<->Pics in 100+ Languages 🟨

👉#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
PaLI: Pathways Lang & Image model
Answering, captioning, reasoning, etc
From Eng. to 109 lang. understanding
The new SOTA on several datasets

More: https://bit.ly/3QMslHC
🔥6👍1💯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍐PeRFception: Largest IR Dataset🍐

👉#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
POSTECH + NVIDIA + Caltech = 🤯
Size: -96.4% from original dataset!
2D/3D image/object class/semantic
Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
9❤‍🔥1👍1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐸 CHARL-E: Stable Diffusion in 1 click 🐸

👉CHARL-E packages Stable Diffusion into a simple app.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
No setup, dependencies, or internet
Images with 1-click on #macbook
Suitable only for M1/M2 processor
Source code under MIT license

More: https://bit.ly/3xv2z3G
🔥11👍3❤‍🔥11
This media is not supported in your browser
VIEW IN TELEGRAM
🍋YOLOPv2: Better Driving Perception🍋

👉YOLOPv2: simultaneous object, road segmentation & lane detection

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
E2E perception net with better backbone
Efficient ELAN for reasonable memory
Stability for adapting to scenarios
SOTA on BDD100K, +50% faster!
Source code under MIT license

More: https://bit.ly/3LvYGBh
🔥12
🍈SegNeXt: new SOTA in Semantic Seg.🍈

👉SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Novel tailored network architecture
Spatial attention via multi-scale feats
Encoder + conv. better than transformers
SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
🔥9👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🦪StereoVoxelNet: RT Obstacles Detection🦪

👉Novel deep neural approach to detect occupancy from stereo images directly

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Occupancy voxels via deep learning
RT on Jetson-TX2 (-98% CPU of SOTA)
Optimization via octrees / sparse conv.
Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
👍10🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜

👉PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
NeRF: Project | Paper | Code
NeRF++: Paper | Code
DVGO: Project | Paper v1/v2 | Code
Plenoxels: Project | Paper | Code
Mip-NeRF: Project | Paper | Code
Mip-NeRF360: Project | Paper | Code
Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
👍7🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🥶 Lumos by #Nvidia: Relighting Portrait 🥶

👉The new SOTA in relighting without requiring a light stage

😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
11👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜

👉 Editable portraits by injecting the NeRF's prior into StyleGAN

😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
👍42❤‍🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥#Google just announced "TensorStore"🔥

👉Novel open-source C++ / #Python library for storage/manipulation of high-dim data

😎Review https://bit.ly/3DLwbha
😎Project https://bit.ly/3C4T2TR
😎Code github.com/google/tensorstore
🔥14👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Motion Transformer for #selfdriving 🦠

👉The 1st place solution for 2022 #waymo "motion prediction" challenge

😎Review https://bit.ly/3f8G4LD
😎Paper arxiv.org/pdf/2209.10033.pdf
😎Code github.com/sshaoshuai/MTR
🔥17👍3
This media is not supported in your browser
VIEW IN TELEGRAM
💹 Image Synthesis @160+ FPS! 💹

👉Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

😎Review https://bit.ly/3r3ZNij
😎Paper arxiv.org/pdf/2206.07695.pdf
😎Project katjaschwarz.github.io/voxgraf
👏3🤯2🔥1💯1
This media is not supported in your browser
VIEW IN TELEGRAM
👛 #Nvidia GET3D: #3D generative #AI 👛

👉AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

😎Review https://bit.ly/3SgnT5h
😎Code github.com/nv-tlabs/GET3D
😎Project nv-tlabs.github.io/GET3D/
😎Paper nv-tlabs.github.io/GET3D/assets/paper.pdf
❤‍🔥7👍5
This media is not supported in your browser
VIEW IN TELEGRAM
🔥🔥 IDE-3D: source code is out! 🔥🔥

👉Novel, photorealistic, 3D-aware facial generator: source code just released!

😎Review https://bit.ly/3BNrO2C
😎Project mrtornado24.github.io/IDE-3D/
😎Code github.com/MrTornado24/IDE-3D
😎Paper arxiv.org/pdf/2205.15517.pdf
🤯8👍5🔥3🤩3
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Diffusion Model of Neural Checkpoints🔥

👉Conditional diffusion model on Millions of checkpoints of a given task/architecture 🤯

😎Review https://bit.ly/3SBR4Qb
😎Project www.wpeebles.com/Gpt
😎Code github.com/wpeebles/G.pt
😎Paper arxiv.org/pdf/2209.12892.pdf
🤯51