UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling
Paper: https://arxiv.org/pdf/2408.04810v1.pdf
Code: https://github.com/facebookresearch/unibench
Datasets: https://www.cs.toronto.edu/~kriz/cifar.html
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.04810v1.pdf
Code: https://github.com/facebookresearch/unibench
Datasets: https://www.cs.toronto.edu/~kriz/cifar.html
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper: https://arxiv.org/pdf/2408.02900v1.pdf
Code: https://github.com/UCSC-VLAA/MedTrinity-25M
Datasets: https://yunfeixie233.github.io/MedTrinity-25M/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.02900v1.pdf
Code: https://github.com/UCSC-VLAA/MedTrinity-25M
Datasets: https://yunfeixie233.github.io/MedTrinity-25M/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Paper: https://arxiv.org/pdf/2408.08152v1.pdf
Code: https://github.com/deepseek-ai/deepseek-prover-v1.5
Datasets: https://github.com/zhangir-azerbayev/ProofNet
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08152v1.pdf
Code: https://github.com/deepseek-ai/deepseek-prover-v1.5
Datasets: https://github.com/zhangir-azerbayev/ProofNet
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective
Paper: https://arxiv.org/pdf/2408.08228v1.pdf
Code: https://github.com/zx-pan/medanomalydetection-iqa
Datasets: http://braintumorsegmentation.org/ || https://brain-development.org/ixi-dataset/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08228v1.pdf
Code: https://github.com/zx-pan/medanomalydetection-iqa
Datasets: http://braintumorsegmentation.org/ || https://brain-development.org/ixi-dataset/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
HAIR: Hypernetworks-based All-in-One Image Restoration
Paper: https://arxiv.org/pdf/2408.08091v1.pdf
Code: https://github.com/toummHus/HAIR
Datasets: https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ ||https://seungjunnah.github.io/Datasets/gopro
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08091v1.pdf
Code: https://github.com/toummHus/HAIR
Datasets: https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ ||https://seungjunnah.github.io/Datasets/gopro
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
MambaMIM: Pre-training Mamba with State Space Token-interpolation
Paper: https://arxiv.org/pdf/2408.08070v1.pdf
Code: https://github.com/fenghetan9/mambamim
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.08070v1.pdf
Code: https://github.com/fenghetan9/mambamim
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
ChatGPT Telegram Bot: GPT-4o. Fast. No daily limits.
Group Chat support (/help_group_chat to get instructions) Voice message recognition Code highlighting. Write code with AI!
15 special chat modes:
👩🏼🎓 Assistant,
👩🏼💻 Code Assistant,
👩🎨 Artist,
🧠 Psychologist,
🚀 Elon Musk and other
Group Chat support (/help_group_chat to get instructions) Voice message recognition Code highlighting. Write code with AI!
15 special chat modes:
👩🏼🎓 Assistant,
👩🏼💻 Code Assistant,
👩🎨 Artist,
🧠 Psychologist,
🚀 Elon Musk and other
BadMerging: Backdoor Attacks Against Model Merging
Paper: https://arxiv.org/pdf/2408.07362v1.pdf
Code: https://github.com/jzhang538/badmerging
Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.07362v1.pdf
Code: https://github.com/jzhang538/badmerging
Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
❤1👍1
ML Research Hub
BadMerging: Backdoor Attacks Against Model Merging Paper: https://arxiv.org/pdf/2408.07362v1.pdf Code: https://github.com/jzhang538/badmerging Datasets: ImageNet ||CIFAR-100 || SVHN || EuroSAT https://news.1rj.ru/str/DataScienceT ⭐️
If the articles we publish have changed your life, then kindly interact with the books using Telegram stars ⭐️
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.06292v2.pdf
Code: https://github.com/sakanaai/ai-scientist
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper: https://arxiv.org/pdf/2408.06070v2.pdf
Code: https://github.com/dvlab-research/controlnext
Datasets: https://laion.ai/blog/laion-5b/
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.06070v2.pdf
Code: https://github.com/dvlab-research/controlnext
Datasets: https://laion.ai/blog/laion-5b/
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
2e8dd625_3527_4643_86b6_3b96575881a9.gif
24 MB
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper: https://arxiv.org/pdf/2407.20183v1.pdf
Code: https://github.com/internlm/mindsearch
Datasets: HotpotQA || Bamboogle
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2407.20183v1.pdf
Code: https://github.com/internlm/mindsearch
Datasets: HotpotQA || Bamboogle
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
MixTex: Unambiguous Recognition Should Not Rely Solely on Real Data
Paper: https://arxiv.org/pdf/2406.17148v2.pdf
Code: https://github.com/RQLuo/MixTeX
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2406.17148v2.pdf
Code: https://github.com/RQLuo/MixTeX
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Paper: https://arxiv.org/pdf/2408.02657v1.pdf
Code: https://github.com/alpha-vllm/lumina-mgpt || https://github.com/alpha-vllm/lumina-t2x
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.02657v1.pdf
Code: https://github.com/alpha-vllm/lumina-mgpt || https://github.com/alpha-vllm/lumina-t2x
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍2
Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving
Paper: https://arxiv.org/pdf/2406.03877v2.pdf
Code: https://github.com/Thinklab-SJTU/Bench2Drive
Datasets: nuScenes || CARLA
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2406.03877v2.pdf
Code: https://github.com/Thinklab-SJTU/Bench2Drive
Datasets: nuScenes || CARLA
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
EasySpider: A No-Code Visual System for Crawling the Web
Paper: https://dl.acm.org/doi/pdf/10.1145/3543873.3587345
Code: https://github.com/NaiboWang/EasySpider
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://dl.acm.org/doi/pdf/10.1145/3543873.3587345
Code: https://github.com/NaiboWang/EasySpider
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
RepViT: Revisiting Mobile CNN From ViT Perspective
Papers: https://arxiv.org/pdf/2307.09283v8.pdf
http://openaccess.thecvf.com//content/CVPR2024/papers/Wang_RepViT_Revisiting_Mobile_CNN_From_ViT_Perspective_CVPR_2024_paper.PDF
Codes: https://github.com/rwightman/pytorch-image-models
https://github.com/jameslahm/RepViT
https://github.com/leondgarse/keras_cv_attention_models/tree/main/keras_cv_attention_models/repvit
https://github.com/2023-MindSpore-4/Code10/tree/main/VIT
Datasets: ImageNet || MS COCO || ADE20K
https://news.1rj.ru/str/DataScienceT⭐️
Papers: https://arxiv.org/pdf/2307.09283v8.pdf
http://openaccess.thecvf.com//content/CVPR2024/papers/Wang_RepViT_Revisiting_Mobile_CNN_From_ViT_Perspective_CVPR_2024_paper.PDF
Codes: https://github.com/rwightman/pytorch-image-models
https://github.com/jameslahm/RepViT
https://github.com/leondgarse/keras_cv_attention_models/tree/main/keras_cv_attention_models/repvit
https://github.com/2023-MindSpore-4/Code10/tree/main/VIT
Datasets: ImageNet || MS COCO || ADE20K
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍4
AudioLM: a Language Modeling Approach to Audio Generation
Paper: https://arxiv.org/pdf/2209.03143v2.pdf
Codes: https://github.com/suno-ai/bark
https://github.com/plachtaa/vall-e-x
https://github.com/serp-ai/bark-with-voice-clone
https://github.com/lucidrains/audiolm-pytorch
https://github.com/RoganInglis/AudioLM
Datasets: LibriSpeech || Libri-Light
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2209.03143v2.pdf
Codes: https://github.com/suno-ai/bark
https://github.com/plachtaa/vall-e-x
https://github.com/serp-ai/bark-with-voice-clone
https://github.com/lucidrains/audiolm-pytorch
https://github.com/RoganInglis/AudioLM
Datasets: LibriSpeech || Libri-Light
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍4
Perturb-and-Compare Approach for Detecting Out-of-Distribution Samples in Constrained Access Environments
Paper: https://arxiv.org/pdf/2408.10107v1.pdf
Code: https://github.com/hy18284/mixdiff
Datasets: CIFAR-10 || CIFAR-100 || Tiny ImageNet
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2408.10107v1.pdf
Code: https://github.com/hy18284/mixdiff
Datasets: CIFAR-10 || CIFAR-100 || Tiny ImageNet
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models
Paper: https://arxiv.org/pdf/2407.07895v2.pdf
Code: https://github.com/LLaVA-VL/LLaVA-NeXT
Datasets: ALFRED - ActivityNet-QA - NExT-QA
https://news.1rj.ru/str/DataScienceT⭐️
Paper: https://arxiv.org/pdf/2407.07895v2.pdf
Code: https://github.com/LLaVA-VL/LLaVA-NeXT
Datasets: ALFRED - ActivityNet-QA - NExT-QA
https://news.1rj.ru/str/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
👍4❤1