ML Research Hub – Telegram
ML Research Hub
32.8K subscribers
4.38K photos
270 videos
23 files
4.74K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🔹 Title: PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13809
• PDF: https://arxiv.org/pdf/2510.13809
• Project Page: https://sihuiji.github.io/PhysMaster-Page/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Hard2Verify: A Step-Level Verification Benchmark for Open-Ended Frontier Math

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13744
• PDF: https://arxiv.org/pdf/2510.13744
• Github: https://github.com/SalesforceAIResearch/Hard2Verify

🔹 Datasets citing this paper:
https://huggingface.co/datasets/Salesforce/Hard2Verify

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13747
• PDF: https://arxiv.org/pdf/2510.13747

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13515
• PDF: https://arxiv.org/pdf/2510.13515
• Project Page: https://garygutc.github.io/UniME-v2/
• Github: https://github.com/GaryGuTC/UniME-v2

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

🔹 Publication Date: Published on Oct 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.04767
• PDF: https://arxiv.org/pdf/2510.04767
• Project Page: https://parallelbench.github.io
• Github: https://github.com/furiosa-ai/ParallelBench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.07944
• PDF: https://arxiv.org/pdf/2510.07944
• Project Page: https://sensetime-fvg.github.io/CVD-STORM/
• Github: https://github.com/SenseTime-FVG/OpenDWM

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Revisiting Model Interpolation for Efficient Reasoning

🔹 Publication Date: Published on Oct 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10977
• PDF: https://arxiv.org/pdf/2510.10977
• Github: https://github.com/wutaiqiang/MI

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13795
• PDF: https://arxiv.org/pdf/2510.13795

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: NOSA: Native and Offloadable Sparse Attention

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13602
• PDF: https://arxiv.org/pdf/2510.13602

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

🔹 Publication Date: Published on Oct 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.12560
• PDF: https://arxiv.org/pdf/2510.12560
• Project Page: https://seu-zxj.github.io/CoIRL-AD/
• Github: https://github.com/SEU-zxj/CoIRL-AD

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training

🔹 Publication Date: Published on Oct 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.12831
• PDF: https://arxiv.org/pdf/2510.12831
• Github: https://github.com/taichengguo/MTSQL-R1

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: HyperAgent: Leveraging Hypergraphs for Topology Optimization in Multi-Agent Communication

🔹 Publication Date: Published on Oct 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10611
• PDF: https://arxiv.org/pdf/2510.10611

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search

🔹 Publication Date: Published on Oct 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10581
• PDF: https://arxiv.org/pdf/2510.10581

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13778
• PDF: https://arxiv.org/pdf/2510.13778
• Project Page: https://internrobotics.github.io/internvla-m1.github.io/
• Github: https://github.com/InternRobotics/InternVLA-M1

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13759
• PDF: https://arxiv.org/pdf/2510.13759

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13626
• PDF: https://arxiv.org/pdf/2510.13626
• Project Page: https://sylvestf.github.io/LIBERO-plus/
• Github: https://github.com/sylvestf/LIBERO-plus

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13344
• PDF: https://arxiv.org/pdf/2510.13344
• Github: https://github.com/HITsz-TMG/Uni-MoE/blob/master/UniMoE-Audio

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Direct Multi-Token Decoding

🔹 Publication Date: Published on Oct 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.11958
• PDF: https://arxiv.org/pdf/2510.11958

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13554
• PDF: https://arxiv.org/pdf/2510.13554

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

🔹 Publication Date: Published on Oct 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10921
• PDF: https://arxiv.org/pdf/2510.10921
• Project Page: https://360cvgroup.github.io/FG-CLIP/

🔹 Datasets citing this paper:
https://huggingface.co/datasets/qihoo360/DCI-CN
https://huggingface.co/datasets/qihoo360/BoxClass-CN
https://huggingface.co/datasets/qihoo360/LIT-CN
https://huggingface.co/datasets/qihoo360/DOCCI-CN

🔹 Spaces citing this paper:
https://huggingface.co/spaces/qihoo360/FG-CLIP2-Retrieval-demo
https://huggingface.co/spaces/qihoo360/FG-CLIP2-Densefeature-demo
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model

🔹 Publication Date: Published on Oct 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10274
• PDF: https://arxiv.org/pdf/2510.10274
• Project Page: https://thu-air-dream.github.io/X-VLA/
• Github: https://github.com/2toinf/X-VLA.git

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT