AI with Papers - Artificial Intelligence & Deep Learning – Telegram
AI with Papers - Artificial Intelligence & Deep Learning
15.8K subscribers
148 photos
260 videos
14 files
1.37K links
All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🈚 Seeing Through Occlusions 🈚

👉Novel NSF to see through occlusions, reflection suppression & shadow removal.

👉Review https://t.ly/5jcIG
👉Project https://light.princeton.edu/publication/nsf
👉Paper https://arxiv.org/pdf/2312.14235.pdf
👉Repo https://github.com/princeton-computational-imaging/NSF
11🤯7🔥3🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
👻 Avatar Behind Occlusions 👻

👉Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

👉Review https://t.ly/8q__B
👉Paper https://arxiv.org/pdf/2401.00431.pdf
👉Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
🔥113👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🕍 En3D: Generative 3D Humans 🕍

👉#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

👉Review https://t.ly/nGmDK
👉Project menyifang.github.io/projects/En3D/index.html
👉Paper https://arxiv.org/pdf/2401.01173.pdf
👉Repo (soon?) https://github.com/menyifang/En3D
🤯53🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🐤 MagicVideo-V2 announced! 🐤

👉#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual denoscription

👉Review https://t.ly/zIq4v
👉Project https://lnkd.in/dKUrJPJd
👉Paper https://lnkd.in/dixnN-kU
🔥71👍1🥰1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 #6D Foundation Pose 🔥

👉#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

👉Review https://t.ly/HGd4h
👉Project https://lnkd.in/dPcnBKWm
👉Paper https://lnkd.in/dixn_iHZ
👉Code coming 🩷
🔥125👏1🤯1
🃏ReplaceAnything: demo is out!🃏

👉ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

👉Review https://t.ly/FMyvf
👉Project https://lnkd.in/dcyZvP2b
👉ModelScope https://lnkd.in/dU4x4nE6
👉Hugging Face https://lnkd.in/dn3uXWgd
👉Empty report https://lnkd.in/dcuGXd6c
👉Paper coming?
11👍3👏2😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🥛 Transparent Object Tracking 🥛

👉Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

👉Review https://t.ly/mEI6O
👉Paper https://lnkd.in/dsudY3DB
👉Project https://lnkd.in/d48SSJJ3
👉TOB https://lnkd.in/dykBUNfC
🔥18🤯73👍2😱2👏1
💊💊 AGNOSTIC Object Counting 💊💊

👉PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

👉Review https://t.ly/e4iza
👉Paper https://lnkd.in/dbzMXKWG
👉Repo https://lnkd.in/db9Q9Pse
🔥17👍5🥰1👏1
💥 Announcing #Py4Ai Conference💥

👉 Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

𝐓𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐛𝐚𝐭𝐜𝐡 𝐨𝐟 𝐬𝐩𝐞𝐚𝐤𝐞𝐫𝐬:
🚀Merve Noyan | #HuggingFace 🤗
🚀Gabriele Lombardi | ARGO Vision
🚀Amanda Cercas Curry | Uni. Bocconi
🚀Piero Savastano | Cheshire Cat AI
🚀Francesco Zuppichini | Zurich Insurance
🚀Andrea Palladino, PhD | Sr. Data Scientist

👉 More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
👍10👏21🥰1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
💃Timeline Text-Driven Humans💃

👉Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

👉Review https://t.ly/HLm-N
👉Paper https://lnkd.in/esaR_M_9
👉Project https://lnkd.in/epCZDvFW
👉Repo coming
🔥136👍4👏3🤩1
🫒 AlphaGeometry: Olympiad-level AI 🫒

👉 Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity 🤯

👉Review https://t.ly/2-Z7C
👉Paper https://lnkd.in/g3QkqwCE
👉Blog https://lnkd.in/ge-mpM7q
👉Repo https://lnkd.in/gHjwks_9
🤯20👍3🥰2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 XINC: Pixels to Neurons 🦠

👉eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuron’s contribution to each output pixel

👉Review https://t.ly/wwAmz
👉Paper arxiv.org/pdf/2401.10217.pdf
👉Project namithap10.github.io/xinc
👉Repo github.com/namithap10/xinc
🤯9👍3👏2🔥1
👽 One Model <-> All Segmentations 👽

👉 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!

👉Review https://t.ly/fywVz
👉Paper https://lnkd.in/dw3S4B74
👉Project https://lnkd.in/dzHT9v45
👉Repo https://lnkd.in/d6fDCnSp
🔥17👍52🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
😻 GARField: Group Anything 😻

👉 GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.

👉Review https://t.ly/6Hkeq
👉Paper https://lnkd.in/d28mfRcZ
👉Project https://lnkd.in/dzYdRNKy
👉Repo (coming) https://lnkd.in/d2VeRJCS
👍83🥰1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Depth Anything: new SOTA 🔥

👉Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!

👉Review https://t.ly/tCBwO
👉Paper https://lnkd.in/djx-9k2J
👉Project https://lnkd.in/dYetqZFa
👉Repo https://lnkd.in/d87CrUGv
👉Demo🤗 https://lnkd.in/dJhvKBep
🔥173🥰2🤩2
This media is not supported in your browser
VIEW IN TELEGRAM
🎭 ULTRA-Realistic Avatar 🎭

👉Novel 3D avatar with enhanced fidelity of geometry, and superior quality of physically based rendering (PBR) textures without unwanted lighting.

👉Review https://t.ly/B3BEu
👉Project https://lnkd.in/dkUQHFEV
👉Paper https://lnkd.in/dtEQxrBu
👉Code coming 🩷
💩175👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Lumiere: SOTA video-gen🔥

👉#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

👉Review https://t.ly/nalJR
👉Paper https://lnkd.in/d-PvrGjT
👉Project https://t.ly/gK8hz
🔥184👍3👏2🤩2🥰1🤯1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🧪 SUPIR: SOTA restoration 🧪

👉SUPIR is the new SOTA in image restoration; suitable for restoration of blurry objects, defining the material texture of objects, and adjusting restoration based on high-level semantics

👉Review https://t.ly/wgObH
👉Project https://supir.xpixel.group/
👉Paper https://lnkd.in/dZPYcUuq
👉Demo coming 🩷 but no code announced :(
8🔥4🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🫧 SAM + Open Models 🫧

👉Grounded SAM (w/ DINO) as an open-set detector to combine with SAM. It can seamlessly integrate with other Open-World models to accomplish more intricate visual tasks.

👉Review https://t.ly/FwasQ
👉Paper arxiv.org/pdf/2401.14159.pdf
👉Code github.com/IDEA-Research/Grounded-Segment-Anything
🔥9👏2👍1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
👢"Virtual Try-All" by #Amazon 👢

👉#Amazon announces ”Diffuse to Choose”: diffusion-based image-conditioned inpainting for VTON. Virtually place any e-commerce item in any setting.

👉Review https://t.ly/at07Y
👉Paper https://lnkd.in/dxR7nGtd
👉Project diffuse2choose.github.io/
15👍7🤯4🔥1🥰1