ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
4.09K photos
237 videos
23 files
4.41K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🤖🧠 Reinforcement Learning for Large Language Models: A Complete Guide from Foundations to Frontiers Arun Shankar, AI Engineer at Google

🗓️ 27 Oct 2025
📚 AI News & Trends

Artificial Intelligence is evolving rapidly and at the center of this evolution is Reinforcement Learning (RL), the science of teaching machines to make better decisions through experience and feedback. In “Reinforcement Learning for Large Language Models: A Complete Guide from Foundations to Frontiers”, Arun Shankar, an Applied AI Engineer at Google presents one of the ...

#ReinforcementLearning #LargeLanguageModels #ArtificialIntelligence #MachineLearning #AIEngineer #Google
🔹 Title: VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

🔹 Publication Date: Published on Oct 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.21817
• PDF: https://arxiv.org/pdf/2510.21817
• Project Page: https://lxysl.github.io/VITA-E/
• Github: https://github.com/Tencent/VITA/tree/VITA-E

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Language Server CLI Empowers Language Agents with Process Rewards

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22907
• PDF: https://arxiv.org/pdf/2510.22907
• Github: https://yifanzhang-pro.github.io/lanser-cli

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23451
• PDF: https://arxiv.org/pdf/2510.23451
• Github: https://github.com/HongbangYuan/OmniReward

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: MARS-M: When Variance Reduction Meets Matrices

🔹 Publication Date: Published on Oct 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.21800
• PDF: https://arxiv.org/pdf/2510.21800
• Project Page: https://github.com/AGI-Arena/MARS/tree/main/MARS_M
• Github: https://github.com/AGI-Arena/MARS/tree/main/MARS_M

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23607
• PDF: https://arxiv.org/pdf/2510.23607
• Project Page: https://pointcept.github.io/Concerto/
• Github: https://github.com/Pointcept/Pointcept

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: FARMER: Flow AutoRegressive Transformer over Pixels

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23588
• PDF: https://arxiv.org/pdf/2510.23588

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

🔹 Publication Date: Published on Oct 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22706
• PDF: https://arxiv.org/pdf/2510.22706
• Github: https://github.com/lifuguan/IGGT_official

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

🔹 Publication Date: Published on Oct 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22733
• PDF: https://arxiv.org/pdf/2510.22733
• Project Page: https://alibaba-nlp.github.io/E2Rank/
• Github: https://alibaba-nlp.github.io/E2Rank

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23603
• PDF: https://arxiv.org/pdf/2510.23603
• Project Page: https://circleradon.github.io/PixelRefer/
• Github: https://github.com/alibaba-damo-academy/PixelRefer

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: LimRank: Less is More for Reasoning-Intensive Information Reranking

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23544
• PDF: https://arxiv.org/pdf/2510.23544

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: RobotArena infty: Scalable Robot Benchmarking via Real-to-Sim Translation

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23571
• PDF: https://arxiv.org/pdf/2510.23571

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation

🔹 Publication Date: Published on Oct 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.21003
• PDF: https://arxiv.org/pdf/2510.21003
• Github: https://imagination-research.github.io/distilled-decoding/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23594
• PDF: https://arxiv.org/pdf/2510.23594
• Github: https://github.com/JornyWan/PRISM-Bench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: ReCode: Unify Plan and Action for Universal Granularity Control

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23564
• PDF: https://arxiv.org/pdf/2510.23564

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Knocking-Heads Attention

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23052
• PDF: https://arxiv.org/pdf/2510.23052

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22946
• PDF: https://arxiv.org/pdf/2510.22946
• Project Page: https://ucsc-vlaa.github.io/LightBagel/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: LongCat-Video Technical Report

🔹 Publication Date: Published on Oct 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22200
• PDF: https://arxiv.org/pdf/2510.22200
• Github: https://github.com/meituan-longcat/LongCat-Video

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23581
• PDF: https://arxiv.org/pdf/2510.23581
• Project Page: https://lookahead-anchoring.github.io/
• Github: https://lookahead-anchoring.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.23605
• PDF: https://arxiv.org/pdf/2510.23605

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: VoMP: Predicting Volumetric Mechanical Property Fields

🔹 Publication Date: Published on Oct 27

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.22975
• PDF: https://arxiv.org/pdf/2510.22975
• Project Page: https://research.nvidia.com/labs/sil/projects/vomp

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT