NEW BOT Телеграм, страница

ML Research Hub

🔹 Title: Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned

🔹 Publication Date: Published on Sep 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.23250
• PDF: https://arxiv.org/pdf/2509.23250
• Github: https://github.com/theogbrand/vlprm

🔹 Datasets citing this paper:
• https://huggingface.co/datasets/ob11/VL-PRM-Evaluation-Results
• https://huggingface.co/datasets/ob11/VL-PRM300K
• https://huggingface.co/datasets/ob11/VL-PRM300K-train

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

184 views02:00

Explore Data Science

ML Research Hub

🔹 Title: GEM: A Gym for Agentic LLMs

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.01051
• PDF: https://arxiv.org/pdf/2510.01051
• Github: https://github.com/axon-rl/gem

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

227 views03:00

Explore Data Science

ML Research Hub

🔹 Title: Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00526
• PDF: https://arxiv.org/pdf/2510.00526
• Github: https://github.com/GaotangLi/Beyond-Log-Likelihood

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

146 views03:01

Explore Data Science

ML Research Hub

🔹 Title: On Predictability of Reinforcement Learning Dynamics for Large Language Models

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00553
• PDF: https://arxiv.org/pdf/2510.00553

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

153 views03:01

Explore Data Science

ML Research Hub

🔹 Title: GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00536
• PDF: https://arxiv.org/pdf/2510.00536

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

133 views03:01

Explore Data Science

ML Research Hub

🔹 Title: JoyAgent-JDGenie: Technical Report on the GAIA

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00510
• PDF: https://arxiv.org/pdf/2510.00510
• Github: https://github.com/jd-opensource/joyagent-jdgenie

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

177 views03:01

Explore Data Science

ML Research Hub

🔹 Title: VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00406
• PDF: https://arxiv.org/pdf/2510.00406
• Project Page: https://vla-rft.github.io/
• Github: https://github.com/OpenHelix-Team/VLA-RFT

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

223 views03:01

Explore Data Science

ML Research Hub

🔹 Title: Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

🔹 Publication Date: Published on Sep 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.25301
• PDF: https://arxiv.org/pdf/2509.25301
• Github: https://github.com/OPPO-PersonalAI/Flash-Searcher

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

177 views04:01

Explore Data Science

ML Research Hub

🔹 Title: Boolean Satisfiability via Imitation Learning

🔹 Publication Date: Published on Sep 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.25411
• PDF: https://arxiv.org/pdf/2509.25411
• Github: https://github.com/zewei-Zhang/ImitSAT

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

163 views04:01

Explore Data Science

ML Research Hub

🔹 Title: An Empirical Study of Testing Practices in Open Source AI Agent Frameworks and Agentic Applications

🔹 Publication Date: Published on Sep 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.19185
• PDF: https://arxiv.org/pdf/2509.19185

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

205 views04:01

Explore Data Science

ML Research Hub

🔹 Title: DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

🔹 Publication Date: Published on Sep 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2509.25454
• PDF: https://arxiv.org/pdf/2509.25454

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

225 views05:02

Explore Data Science

ML Research Hub

🔹 Title: Infusing Theory of Mind into Socially Intelligent LLM Agents

🔹 Publication Date: Published on Sep 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.22887
• PDF: https://arxiv.org/pdf/2509.22887

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

❤1

221 views05:02

Explore Data Science

ML Research Hub

🔹 Title: Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

🔹 Publication Date: Published on Sep 30

🔹 Paper Links:
• arXiv Page: https://www.arxiv.org/abs/2509.25849
• PDF: https://arxiv.org/pdf/2509.25849

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

❤1

214 views05:02

Explore Data Science

ML Research Hub

🔹 Title: Making, not Taking, the Best of N

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00931
• PDF: https://arxiv.org/pdf/2510.00931

🔹 Datasets citing this paper:
• https://huggingface.co/datasets/CohereLabs/fusion-synth-data-geofactx
• https://huggingface.co/datasets/CohereLabs/fusion-pairwise-evals-test-time-scaling
• https://huggingface.co/datasets/CohereLabs/fusion-pairwise-evals-finetuned
• https://huggingface.co/datasets/CohereLabs/fusion-synth-data-ufb

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

❤1

204 views06:02

Explore Data Science

ML Research Hub

🔹 Title: BroRL: Scaling Reinforcement Learning via Broadened Exploration

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.01180
• PDF: https://arxiv.org/pdf/2510.01180

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

❤2

211 views06:02

Explore Data Science

ML Research Hub

🔹 Title: ACON: Optimizing Context Compression for Long-horizon LLM Agents

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.00615
• PDF: https://arxiv.org/pdf/2510.00615

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

205 views07:02

Explore Data Science

ML Research Hub

🔹 Title: Eliciting Secret Knowledge from Language Models

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://huggingface.co/collections/bcywinski/eliciting-secret-knowledge-from-language-models-68de1a49ae6fa034e5c105ff
• PDF: https://arxiv.org/pdf/2510.01070
• Github: https://github.com/cywinski/eliciting-secret-knowledge

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

❤1

211 views07:02

Explore Data Science

ML Research Hub

🔹 Title: ReSWD: ReSTIR'd, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.01061
• PDF: https://arxiv.org/pdf/2510.01061
• Project Page: https://reservoirswd.github.io/
• Github: https://reservoirswd.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

216 views07:02

Explore Data Science

ML Research Hub

🔹 Title: CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs

🔹 Publication Date: Published on Oct 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.01037
• PDF: https://arxiv.org/pdf/2510.01037
• Github: https://github.com/ZexuSun/CurES

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

209 views08:03

Explore Data Science

ML Research Hub

🔹 Title: VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

🔹 Publication Date: Published on Sep 30

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.25916
• PDF: https://arxiv.org/pdf/2509.25916

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

236 views08:03

Explore Data Science

ML Research Hub

🔹 Title: Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures

🔹 Publication Date: Published on Sep 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.25045
• PDF: https://arxiv.org/pdf/2509.25045
• Github: https://github.com/Ipazia-AI/hyperprobe

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

297 views08:03

Explore Data Science

About

Blog

Apps

Platform