This media is not supported in your browser
VIEW IN TELEGRAM
💬 Expanding Language-Image Pretrained Models for General Video Recognition by Microsoft.
Video-specific prompting scheme, which leverages video content information for generating discriminative textual prompts.
Github: https://github.com/microsoft/VideoX/tree/master/X-CLIP
Paper: https://arxiv.org/abs/2208.02816v1
Dataset: https://paperswithcode.com/dataset/ucf101
@ai_machinelearning_big_data
Video-specific prompting scheme, which leverages video content information for generating discriminative textual prompts.
Github: https://github.com/microsoft/VideoX/tree/master/X-CLIP
Paper: https://arxiv.org/abs/2208.02816v1
Dataset: https://paperswithcode.com/dataset/ucf101
@ai_machinelearning_big_data
👍8❤3🔥3
This media is not supported in your browser
VIEW IN TELEGRAM
➡️ Prompt Tuning for Generative Multimodal Pretrained Models
Github: https://github.com/ofa-sys/ofa
Paper: https://arxiv.org/abs/2208.02532v1
Dataset: https://paperswithcode.com/dataset/snli-ve
Demo: https://huggingface.co/spaces/OFA-Sys/OFA-Generic_Interface
@ai_machinelearning_big_data
Github: https://github.com/ofa-sys/ofa
Paper: https://arxiv.org/abs/2208.02532v1
Dataset: https://paperswithcode.com/dataset/snli-ve
Demo: https://huggingface.co/spaces/OFA-Sys/OFA-Generic_Interface
@ai_machinelearning_big_data
🔥14🎉3👍2
🚀 @machinelearning_interview - в Канале собраны все возможные вопросы и ответы с собеседований по Аналитике данных и Машинному обучению. Канал от Data Analytics.
Материалы канала реально помогут подготовиться к data science собеседованию.
👉Перейти
Материалы канала реально помогут подготовиться к data science собеседованию.
👉Перейти
👍11❤1
💻 P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting
Github: https://github.com/wangzy22/P2P
Paper: https://arxiv.org/abs/2208.02812v1
Dataset: https://paperswithcode.com/dataset/imagenet
Model: https://shapenet.cs.stanford.edu/media/modelnet40_normal_resampled.zip
@ai_machinelearning_big_data
Github: https://github.com/wangzy22/P2P
Paper: https://arxiv.org/abs/2208.02812v1
Dataset: https://paperswithcode.com/dataset/imagenet
Model: https://shapenet.cs.stanford.edu/media/modelnet40_normal_resampled.zip
@ai_machinelearning_big_data
👍11🔥2
🧩 Per-Clip Video Object Segmentation
Progressive matching mechanism for efficient information-passing within a clip.
Github: https://github.com/pkyong95/PCVOS
Paper: https://arxiv.org/abs/2208.01924v1
Dataset: https://paperswithcode.com/dataset/davis
Video: https://youtu.be/6QATHDwrUx0
@ai_machinelearning_big_data
Progressive matching mechanism for efficient information-passing within a clip.
Github: https://github.com/pkyong95/PCVOS
Paper: https://arxiv.org/abs/2208.01924v1
Dataset: https://paperswithcode.com/dataset/davis
Video: https://youtu.be/6QATHDwrUx0
@ai_machinelearning_big_data
👍13
Эволюция машинного обучения и проблема «черного ящика»
Какие бенчмарки используются в ML?
Благодаря чему произошел прорыв в технологиях машинного обучения?
Как метод LIME помогает интерпретировать алгоритмы ИИ?
Почему нейронки по-прежнему работают не так, как хотелось бы ученым?
Ученые Yandex Research рассказали, как они исследуют логику нейросетей и почему людям до сих пор сложно прочитать их «мысли».
Какие бенчмарки используются в ML?
Благодаря чему произошел прорыв в технологиях машинного обучения?
Как метод LIME помогает интерпретировать алгоритмы ИИ?
Почему нейронки по-прежнему работают не так, как хотелось бы ученым?
Ученые Yandex Research рассказали, как они исследуют логику нейросетей и почему людям до сих пор сложно прочитать их «мысли».
nplus1.ru
Внимание, черный ящик
Как и зачем исследовать логику нейросетей
👍21🔥5❤1
🧠 Free AI for Beginners Course
Artificial Intelligence for Beginners
MindMap
Denoscription
@ai_machinelearning_big_data
Artificial Intelligence for Beginners
MindMap
Denoscription
@ai_machinelearning_big_data
🔥16❤5👍3😢2
🐚 Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
Github: https://github.com/lkeab/BCNet
Paper: https://arxiv.org/abs/2208.04438v1
Dataset: https://paperswithcode.com/dataset/bdd100k
Video: https://www.youtube.com/watch?v=iHlGJppJGiQ
@ai_machinelearning_big_data
Github: https://github.com/lkeab/BCNet
Paper: https://arxiv.org/abs/2208.04438v1
Dataset: https://paperswithcode.com/dataset/bdd100k
Video: https://www.youtube.com/watch?v=iHlGJppJGiQ
@ai_machinelearning_big_data
👍8🔥2
📲LAMDA-SSL: Semi-Supervised Learning in Python
Github: https://github.com/ygzwqzd/lamda-ssl
Paper: https://arxiv.org/pdf/2208.04610.pdf
Docs: https://ygzwqzd.github.io/LAMDA-SSL
@ai_machinelearning_big_data
Github: https://github.com/ygzwqzd/lamda-ssl
Paper: https://arxiv.org/pdf/2208.04610.pdf
Docs: https://ygzwqzd.github.io/LAMDA-SSL
@ai_machinelearning_big_data
👍13🔥2👎1
🎼 ROC: A New Paradigm for Lyric-to-Melody Generation
Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence.
Github: https://github.com/microsoft/muzic
Paper: https://arxiv.org/abs/2208.05697v1
Project: https://www.microsoft.com/en-us/research/project/ai-music/
@ai_machinelearning_big_data
Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence.
Github: https://github.com/microsoft/muzic
Paper: https://arxiv.org/abs/2208.05697v1
Project: https://www.microsoft.com/en-us/research/project/ai-music/
@ai_machinelearning_big_data
👍17🔥6❤2
🗣 Speech Enhancement and Dereverberation with Diffusion-based Generative Models
Github: https://github.com/sp-uhh/sgmse
Paper: https://arxiv.org/abs/2208.05830v1
Pretrained checkpoints: https://drive.google.com/drive/folders/1CSnkhUSoiv3RG0xg7WEcVapyLuwDaLbe?usp=sharing
@ai_machinelearning_big_data
Github: https://github.com/sp-uhh/sgmse
Paper: https://arxiv.org/abs/2208.05830v1
Pretrained checkpoints: https://drive.google.com/drive/folders/1CSnkhUSoiv3RG0xg7WEcVapyLuwDaLbe?usp=sharing
@ai_machinelearning_big_data
🔥10👍3
🧔 StyleFaceV - Official PyTorch Implementation
StyleFaceV produces high-fidelity identity-preserving face videos with vivid movements
Github: https://github.com/arthur-qiu/stylefacev
Project: http://haonanqiu.com/projects/StyleFaceV.html
Video: https://youtu.be/BZNLcD04-Fc
Paper: https://arxiv.org/abs/2208.07862v1
Dataset: https://paperswithcode.com/dataset/faceforensics-1
@ai_machinelearning_big_data
StyleFaceV produces high-fidelity identity-preserving face videos with vivid movements
Github: https://github.com/arthur-qiu/stylefacev
Project: http://haonanqiu.com/projects/StyleFaceV.html
Video: https://youtu.be/BZNLcD04-Fc
Paper: https://arxiv.org/abs/2208.07862v1
Dataset: https://paperswithcode.com/dataset/faceforensics-1
@ai_machinelearning_big_data
🔥12👍3
🎆 Unifying Visual Perception by Dispersible Points Learning
Conceptually simple, flexible, and universal visual perception head for variant visual task
Github: https://github.com/sense-x/unihead
Paper: https://arxiv.org/abs/2208.08630v1
Model: https://drive.google.com/file/d/1TwFCog_PMd1HWA7s-s9pN2F_fgyMyR3x/view
Datasets: https://paperswithcode.com/dataset/imagenet
@ai_machinelearning_big_data
Conceptually simple, flexible, and universal visual perception head for variant visual task
Github: https://github.com/sense-x/unihead
Paper: https://arxiv.org/abs/2208.08630v1
Model: https://drive.google.com/file/d/1TwFCog_PMd1HWA7s-s9pN2F_fgyMyR3x/view
Datasets: https://paperswithcode.com/dataset/imagenet
@ai_machinelearning_big_data
👍12🔥3
⚙️ Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
Github: https://github.com/arpitbansal297/cold-diffusion-models
Paper: https://arxiv.org/abs/2208.09392v1
Cold-Diffusion: https://arxiv.org/abs/2208.09392
Datasets: https://paperswithcode.com/dataset/celeba
@ai_machinelearning_big_data
Github: https://github.com/arpitbansal297/cold-diffusion-models
Paper: https://arxiv.org/abs/2208.09392v1
Cold-Diffusion: https://arxiv.org/abs/2208.09392
Datasets: https://paperswithcode.com/dataset/celeba
@ai_machinelearning_big_data
👍10🐳4🔥3
🔥 Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Masked "language" modeling on images (Imglish), texts (English), and image-text pairs ("parallel sentences") in a unified manner.
Github: https://github.com/microsoft/unilm/tree/master/beit
Paper: https://arxiv.org/abs/2208.10442v1
Datasets: https://paperswithcode.com/dataset/visual-genome
@ai_machinelearning_big_data
Masked "language" modeling on images (Imglish), texts (English), and image-text pairs ("parallel sentences") in a unified manner.
Github: https://github.com/microsoft/unilm/tree/master/beit
Paper: https://arxiv.org/abs/2208.10442v1
Datasets: https://paperswithcode.com/dataset/visual-genome
@ai_machinelearning_big_data
🔥12👍6😱4👏1
✅ Awesome-Dataset-Distillation
Github: https://github.com/Guang000/Awesome-Dataset-Distillation
Awesome Computer Vision: https://github.com/jbhuang0604/awesome-computer-vision
Paper: https://arxiv.org/abs/2208.11311v1
Datasets: https://paperswithcode.com/dataset/cifar-10
@ai_machinelearning_big_data
Github: https://github.com/Guang000/Awesome-Dataset-Distillation
Awesome Computer Vision: https://github.com/jbhuang0604/awesome-computer-vision
Paper: https://arxiv.org/abs/2208.11311v1
Datasets: https://paperswithcode.com/dataset/cifar-10
@ai_machinelearning_big_data
👍20🔥5❤3
🥇 The Complete Data Science Study Roadmap
➡️ Read
🎲 Statistics Fundamental by Josh Starmer
👨🎓 CS229 machine learning Stanford
@ai_machinelearning_big_data
➡️ Read
🎲 Statistics Fundamental by Josh Starmer
👨🎓 CS229 machine learning Stanford
@ai_machinelearning_big_data
👍19🔥2🕊1
⭐️ YOLOX-PAI: An Improved YOLOX Version by PAI
Github: https://github.com/alibaba/EasyCV
Paper: https://arxiv.org/abs/2208.13040v1
Datasets: https://paperswithcode.com/dataset/coco
@ai_machinelearning_big_data
Github: https://github.com/alibaba/EasyCV
Paper: https://arxiv.org/abs/2208.13040v1
Datasets: https://paperswithcode.com/dataset/coco
@ai_machinelearning_big_data
👍10🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🔋 Self-Supervised Pyramid Representation Learning
for Multi-Label Visual Analysis and Beyond
Github: https://github.com/wesleyhsieh0806/ss-prl
Paper: https://arxiv.org/abs/2208.14439v1
Datasets: https://github.com/wesleyhsieh0806/ss-prl#books-prepare-dataset
Downstream: http://host.robots.ox.ac.uk/pascal/VOC/
@ai_machinelearning_big_data
for Multi-Label Visual Analysis and Beyond
Github: https://github.com/wesleyhsieh0806/ss-prl
Paper: https://arxiv.org/abs/2208.14439v1
Datasets: https://github.com/wesleyhsieh0806/ss-prl#books-prepare-dataset
Downstream: http://host.robots.ox.ac.uk/pascal/VOC/
@ai_machinelearning_big_data
🔥7👍3
🖼 PyTorch Image Quality (PIQ) is a collection of measures and metrics for image quality assessment.
$ pip install piq
Github: https://github.com/photosynthesis-team/piq
Paper: https://arxiv.org/abs/2208.14818v1
Docs: https://piq.readthedocs.io.
Datasets: https://paperswithcode.com/dataset/kadid-10k
@ai_machinelearning_big_data
$ pip install piq
Github: https://github.com/photosynthesis-team/piq
Paper: https://arxiv.org/abs/2208.14818v1
Docs: https://piq.readthedocs.io.
Datasets: https://paperswithcode.com/dataset/kadid-10k
@ai_machinelearning_big_data
🔥12👍8🤩1