AI with Papers - Artificial Intelligence & Deep Learning – Telegram
AI with Papers - Artificial Intelligence & Deep Learning
15.8K subscribers
148 photos
260 videos
14 files
1.37K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🥛 Transparent Object Tracking 🥛

👉Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

👉Review https://t.ly/mEI6O
👉Paper https://lnkd.in/dsudY3DB
👉Project https://lnkd.in/d48SSJJ3
👉TOB https://lnkd.in/dykBUNfC
🔥18🤯73👍2😱2👏1
💊💊 AGNOSTIC Object Counting 💊💊

👉PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

👉Review https://t.ly/e4iza
👉Paper https://lnkd.in/dbzMXKWG
👉Repo https://lnkd.in/db9Q9Pse
🔥17👍5🥰1👏1
💥 Announcing #Py4Ai Conference💥

👉 Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

𝐓𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐛𝐚𝐭𝐜𝐡 𝐨𝐟 𝐬𝐩𝐞𝐚𝐤𝐞𝐫𝐬:
🚀Merve Noyan | #HuggingFace 🤗
🚀Gabriele Lombardi | ARGO Vision
🚀Amanda Cercas Curry | Uni. Bocconi
🚀Piero Savastano | Cheshire Cat AI
🚀Francesco Zuppichini | Zurich Insurance
🚀Andrea Palladino, PhD | Sr. Data Scientist

👉 More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
👍10👏21🥰1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
💃Timeline Text-Driven Humans💃

👉Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

👉Review https://t.ly/HLm-N
👉Paper https://lnkd.in/esaR_M_9
👉Project https://lnkd.in/epCZDvFW
👉Repo coming
🔥136👍4👏3🤩1
🫒 AlphaGeometry: Olympiad-level AI 🫒

👉 Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity 🤯

👉Review https://t.ly/2-Z7C
👉Paper https://lnkd.in/g3QkqwCE
👉Blog https://lnkd.in/ge-mpM7q
👉Repo https://lnkd.in/gHjwks_9
🤯20👍3🥰2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 XINC: Pixels to Neurons 🦠

👉eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuron’s contribution to each output pixel

👉Review https://t.ly/wwAmz
👉Paper arxiv.org/pdf/2401.10217.pdf
👉Project namithap10.github.io/xinc
👉Repo github.com/namithap10/xinc
🤯9👍3👏2🔥1
👽 One Model <-> All Segmentations 👽

👉 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!

👉Review https://t.ly/fywVz
👉Paper https://lnkd.in/dw3S4B74
👉Project https://lnkd.in/dzHT9v45
👉Repo https://lnkd.in/d6fDCnSp
🔥17👍52🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
😻 GARField: Group Anything 😻

👉 GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.

👉Review https://t.ly/6Hkeq
👉Paper https://lnkd.in/d28mfRcZ
👉Project https://lnkd.in/dzYdRNKy
👉Repo (coming) https://lnkd.in/d2VeRJCS
👍83🥰1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Depth Anything: new SOTA 🔥

👉Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!

👉Review https://t.ly/tCBwO
👉Paper https://lnkd.in/djx-9k2J
👉Project https://lnkd.in/dYetqZFa
👉Repo https://lnkd.in/d87CrUGv
👉Demo🤗 https://lnkd.in/dJhvKBep
🔥173🥰2🤩2
This media is not supported in your browser
VIEW IN TELEGRAM
🎭 ULTRA-Realistic Avatar 🎭

👉Novel 3D avatar with enhanced fidelity of geometry, and superior quality of physically based rendering (PBR) textures without unwanted lighting.

👉Review https://t.ly/B3BEu
👉Project https://lnkd.in/dkUQHFEV
👉Paper https://lnkd.in/dtEQxrBu
👉Code coming 🩷
💩175👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Lumiere: SOTA video-gen🔥

👉#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

👉Review https://t.ly/nalJR
👉Paper https://lnkd.in/d-PvrGjT
👉Project https://t.ly/gK8hz
🔥184👍3👏2🤩2🥰1🤯1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🧪 SUPIR: SOTA restoration 🧪

👉SUPIR is the new SOTA in image restoration; suitable for restoration of blurry objects, defining the material texture of objects, and adjusting restoration based on high-level semantics

👉Review https://t.ly/wgObH
👉Project https://supir.xpixel.group/
👉Paper https://lnkd.in/dZPYcUuq
👉Demo coming 🩷 but no code announced :(
8🔥4🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🫧 SAM + Open Models 🫧

👉Grounded SAM (w/ DINO) as an open-set detector to combine with SAM. It can seamlessly integrate with other Open-World models to accomplish more intricate visual tasks.

👉Review https://t.ly/FwasQ
👉Paper arxiv.org/pdf/2401.14159.pdf
👉Code github.com/IDEA-Research/Grounded-Segment-Anything
🔥9👏2👍1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
👢"Virtual Try-All" by #Amazon 👢

👉#Amazon announces ”Diffuse to Choose”: diffusion-based image-conditioned inpainting for VTON. Virtually place any e-commerce item in any setting.

👉Review https://t.ly/at07Y
👉Paper https://lnkd.in/dxR7nGtd
👉Project diffuse2choose.github.io/
15👍7🤯4🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 WildRGB-D: Objects in the Wild 🦩

👉#NVIDIA unveils a novel RGB-D object dataset captured in the wild: ~8500 recorded objects, ~20,000 RGBD videos, 46 categories with corresponding masks and 3D point clouds.

👉Review https://t.ly/WCqVz
👉Data github.com/wildrgbd/wildrgbd
👉Paper arxiv.org/pdf/2401.12592.pdf
👉Project wildrgbd.github.io/
👍93🔥2👏1🤩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌋EasyVolcap: Accelerating Neural Volumetric🌋

👉Novel #PyTorch library for accelerating neural video:volumetric video capturing, reconstruction & rendering

👉Review https://t.ly/8BISl
👉Paper arxiv.org/pdf/2312.06575.pdf
👉Code github.com/zju3dv/EasyVolcap
🔥10👍21🥰1👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐙 Rock-Track announced! 🐙

👉Rock-Track: the evolution of Poly-MOT, the previous SOTA in 3D MOT Tracking-By-Detection framework.

👉Review https://t.ly/hC0ak
👉Repo, coming: https://lnkd.in/dtDkPwCC
👉Paper coming
👍4👏4🔥21🥰1
🧠350+ Free #AI Courses by #Google🧠

👉350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.

𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈
𝐈𝐧𝐭𝐫𝐨 𝐭𝐨 𝐋𝐋𝐌𝐬
𝐂𝐕 𝐰𝐢𝐭𝐡 𝐓𝐅
𝐃𝐚𝐭𝐚, 𝐌𝐋, 𝐀𝐈
𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞 𝐀𝐈

👉Review: https://t.ly/517Dr
👉Full list: https://www.cloudskillsboost.google/catalog?page=1
13👍3👏2🍾2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🍋 Diffutoon: new SOTA video 🍋

👉Diffutoon is a cartoon shading approach, aiming to transform photorealistic videos in anime styles. It can handle exceptionally high resolutions and rapid motions. Source code released!

👉Review https://t.ly/sim2O
👉Paper https://lnkd.in/dPcSnAUu
👉Code https://lnkd.in/d9B_dGrf
👉Project https://lnkd.in/dpcsJcX2
🔥193🤯3👍1🥰1🤩1💩1🍾1
🥓 RANSAC -> PARSAC (neural) 🥓

👉Neural PARSAC: estimating multiple vanishing points (V), fundamental matrices (F) or homographies (H) at the speed of light! Source Code released 💙

👉Review https://t.ly/r9ngg
👉Paper https://lnkd.in/dadQ4Qec
👉Code https://lnkd.in/dYp6gADd
14👍31🥰1👏1