AI with Papers - Artificial Intelligence & Deep Learning – Telegram
AI with Papers - Artificial Intelligence & Deep Learning
15.8K subscribers
146 photos
260 videos
14 files
1.36K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
👄LVD: new SOTA for #3D human👄

👉Corona et al. unveils a novel 3D human model fitting

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Solution via neural field
Not sensitive to initialization
SOTA in shape from single pic
SOTA in fitting 3D scans

More: https://bit.ly/3Ng4lLr
👍4🔥2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🏳️‍🌈Deep Clustering on ImageNet & Co.🏳️‍🌈

👉World's first deep nonparametric clustering on large dataset such as ImageNet

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Deep clustering that infers nr. of clusters
Loss: amortized inference in mixt-models
Deep nonparametric clustering on ImageNet
Code and model available under MIT license

More: https://bit.ly/38p62rn
🔥9🤯3👍2🤩2
This media is not supported in your browser
VIEW IN TELEGRAM
💥HQ-E²FGVI just released💥💥

👉Flow-Guided Video Inpainting through three trainable modules

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Flow, pixel-prop, content hallucination
Three stage-modules, jointly optimized
The new SOTA, promising efficiency
Code and Models under MIT license

More: https://bit.ly/3Ln0ICj
🤯10👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🪔 AvatarCLIP: Text-Driven Avatar 🪔

👉Zero-shot text-driven for #3D avatar in #metaverse

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
First text-driven synthesis
Shape, texture, and motion
Animation-ready, HQ texture/geometry
Zero-shot text-guided ref-based motion
Code and model under MIT license

More: https://bit.ly/3LjTWgB
🔥4👍2🤯21
This media is not supported in your browser
VIEW IN TELEGRAM
🔥#AIwithPapers: we are 2,500!🔥

💙💛Only 2 Billion papers remaining on arXiv. The more we are, the faster we read💙💛

😈 Invite your friends -> https://news.1rj.ru/str/AI_DeepLearning
🔥94👍2🤔2👏1
💥Podcasting AI & CV💥

👉🏼For people fluent in Italian: 1 hour podcast in which I talk about AI, CV, Startup and more (included this wonderful project).

More: https://bit.ly/38DtBwB
👏63👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Inpainting: new SOTA! INSANE🔥

👉Novel two-stream approach: inpainting at the next level!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
High-freq locally, low-freq globally
Local to global -> error correction
44% / 26% improvements FID/scores
Source code, more clips available

More: https://bit.ly/3ltIX9R
👍8🤯3🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Super-Human Crossword Solver🔥

👉Solving crosswords outperforming best humans

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Crossword solving based on NNs
Q&A, structured decoding, local search
Wide domains with perfect accuracy
Large question-answer dataset

More: https://bit.ly/3a3zzqQ
🔥4🤯3👏2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🥸Imagen: far beyond DALL·E 2🥸

👉#Google: unprecedented photorealism and deep level of language understanding

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Dynamic thresh diffusion sampling
Efficient U-Net, efficient++ variant
DrawBench, new text-to-image
The new SOTA, COCO FID of 7.27

More: https://bit.ly/3lVtkbz
🔥9🤯6👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🪤Tracking over SOTA detectors🪤

👉Lightweight Python lib for real-time 2D object tracking 💥

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Layer of tracking over SOTA detectors
Suitable for complex video processing
Source code under BSD 3-Clause
Maintained by Tryolabs team

More: https://bit.ly/3wKtGqg
👍7🔥3🤩3
This media is not supported in your browser
VIEW IN TELEGRAM
🥷🏿 FCA: #3D Neural Camouflage 🥷🏿

👉#3D full-camouflage adversarial patch to fool neural detectors

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Attack by diff-neural render
E2E physical adversarial attack
Envs, vehicles & detectors
Source code available!

More: https://bit.ly/38kKyfa
👍5🔥3🤯2👏1
Media is too big
VIEW IN TELEGRAM
🍋 One-Shot Object Pose 🍋

👉A novel one-shot object pose estimator

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Visual localization pipeline for object pose
Handling novel objects without CAD model
Novel graph attention for 2D-3D matching
Large dataset for one-shot object pose

More: https://bit.ly/3MTogjJ
🔥114👍2🤯2
This media is not supported in your browser
VIEW IN TELEGRAM
☄️STEVE: Slot-TransformEr for VidEos☄️

👉STEVE: unsupervised model for object-centric learning in videos

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Adoption of a slot decoder (SLATE)
SLATE with slot-level recurrence model
Complex and naturalistic videos
Significantly outperforms previous SOTA

More: https://bit.ly/3PNxxM3
🔥7👍1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦔 CogVideo: insane text-to-clip 🦔

👉CogVideo: 9B-parameters world's first large scale open-source text-to-video 😵

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Largest open-source T2C transformer
Finetuning of text-to-image model
Multi-frame-rate hierarchical training
From pretrained model CogView2

More: https://bit.ly/3Gzfl4n
🔥9👍6
This media is not supported in your browser
VIEW IN TELEGRAM
🦄Time-Aware Neural Voxels🦄

👉TiNeuVox: "NeRF" with time-aware voxel features 😵

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Dynamic scene w/ optimizable structure
Temporal information in radiance net
Small/large motion w/ single-res of feats
192× faster than previous Hyper-NeRF

More: https://bit.ly/3wR4O08
👍11🔥2🤯1
🫐Neural Anomaly Detection by AWS🫐

👉Ultra-competitive inference and SOTA for both detection and localization

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Locally aggregated, mid-level feats patch
Maximizing nominal information at test time
Reducing biases towards ImageNet classes
Image-level anomaly AUROC of up to 99.6%

More: https://bit.ly/3t7Ndjg
🔥7🤯3👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🛹 Project Skate from Google #AI 🛹

👉#AI tool to analyze the skateboarder's tricks in real-time

More: https://bit.ly/3zbQS3M
🔥15🤩3👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🧬Neural Text2Human Generation🧬

👉Text-driven neural human generation

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Full-body from a given human pose
Hierarchical texture-aware codebook
DeepFashion -> 44k Hi-Res images
Code and models available!

More: https://bit.ly/3Mdnpt0
🔥15👍1
🧨EfficientFormers: 1.6ms inference 🧨

👉Transformers fast as MobileNet? Snap shows that on #iphone!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Low latency on mobile, high performance!
Revisiting the design of ViT through latency
New dimension-consistent design paradigm
EfficientFormers: a new ViT for mobile!

More: https://bit.ly/3MdgW15
🔥16👍1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐢 Transformer-Based Sens-Fusion 🐢

👉Updating TransFuser (CVPR21): image + LiDAR representations with self-attention

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Existing approach can't handle traffic 😢
Novel multi-modal fusion transformer
The new SOTA in driving performance
Reducing avg collisions per KM by 48%
Insights on current limitations of E2E

More: https://bit.ly/391dmd6
👍11🔥2