NEW BOT Телеграм, страница

ML Research Hub

✨IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

📝 Summary:
IterResearch improves long-horizon reasoning by reformulating it as a Markov Decision Process with strategic workspace reconstruction. This novel paradigm overcomes context suffocation, achieving substantial performance gains and unprecedented interaction scaling, and also serves as an effective ...

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07327
• PDF: https://arxiv.org/pdf/2511.07327

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#ReinforcementLearning #AI #MachineLearning #AIagents #MDP

204 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

📝 Summary:
MVU-Eval is a new comprehensive benchmark for evaluating Multi-Video Understanding in Multimodal Large Language Models. It addresses a critical gap in existing single-video benchmarks and reveals significant performance limitations in current MLLMs for multi-video scenarios.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07250
• PDF: https://arxiv.org/pdf/2511.07250
• Project Page: https://huggingface.co/datasets/MVU-Eval-Team/MVU-Eval-Data
• Github: https://github.com/NJU-LINK/MVU-Eval

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MLLMs #VideoUnderstanding #AI #Benchmarking #ComputerVision

165 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨The Station: An Open-World Environment for AI-Driven Discovery

📝 Summary:
The Station is an open-world multi-agent AI environment enabling autonomous scientific discovery. Agents engage in full scientific journeys, achieving state-of-the-art results across diverse benchmarks. This new paradigm fosters emergent behaviors and novel method development, moving beyond rigid...

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06309
• PDF: https://arxiv.org/pdf/2511.06309
• Github: https://github.com/dualverse-ai/station

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #MultiAgentSystems #ScientificDiscovery #OpenWorldAI #AutonomousAI

❤1

156 views05:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

✨Robot Learning from a Physical World Model

📝 Summary:
PhysWorld enables robots to learn accurate manipulation from AI-generated videos by integrating video generation with physical world modeling. This approach grounds visual guidance into physically executable actions, eliminating the need for real robot data.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07416
• PDF: https://arxiv.org/pdf/2511.07416
• Project Page: https://pointscoder.github.io/PhysWorld_Web/
• Github: https://github.com/PointsCoder/OpenReal2Sim

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#RobotLearning #Robotics #AI #PhysicalModeling #MachineLearning

139 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DigiData: Training and Evaluating General-Purpose Mobile Control Agents

📝 Summary:
DigiData provides a diverse, high-quality dataset for training mobile control agents with complex goals from app feature exploration. DigiData-Bench offers dynamic AI-powered evaluation protocols, improving agent assessment beyond common metrics.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07413
• PDF: https://arxiv.org/pdf/2511.07413
• Github: https://facebookresearch.github.io/DigiData

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MobileAgents #ArtificialIntelligence #MachineLearning #Datasets #AgentTraining

❤1

169 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads?

📝 Summary:
SWE-fficiency is a new benchmark evaluating how language models optimize real-world software repositories for performance on actual workloads. Agents must identify bottlenecks and generate correct code patches matching expert speedup. Current agents significantly underperform, struggling with loc...

🔹 Publication Date: Published on Nov 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06090
• PDF: https://arxiv.org/pdf/2511.06090
• Project Page: https://swefficiency.com/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #SoftwareOptimization #PerformanceTuning #AIagents #Benchmarking

175 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

📝 Summary:
LUT-LLM is an FPGA accelerator for LLM inference that leverages on-chip memory to shift computation from arithmetic to memory-based operations via table lookups. This innovative approach achieves 1.66x lower latency than AMD MI210 and 1.72x higher energy efficiency than NVIDIA A100 for a 1.7B LLM.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06174
• PDF: https://arxiv.org/pdf/2511.06174

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #FPGA #AI #DeepLearning #AIHardware

237 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

📝 Summary:
This study develops a two-stage reinforcement learning method for competitive code generation. It uses tailored data curation and a hard-focus curriculum, achieving state-of-the-art performance on competitive programming benchmarks.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06307
• PDF: https://arxiv.org/pdf/2511.06307

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#ReinforcementLearning #CodeGeneration #DataCuration #MachineLearning #AIResearch

❤1

209 views06:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

📝 Summary:
SofT-GRPO is a novel algorithm that enhances soft-thinking in LLMs by integrating Gumbel noise and Gumbel-Softmax. This method successfully reinforces soft-thinking policies, enabling LLMs to outperform discrete-token reinforcement learning approaches, especially on complex tasks.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06411
• PDF: https://arxiv.org/pdf/2511.06411

🔹 Models citing this paper:
• https://huggingface.co/zz1358m/SofT-GRPO-master

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #ReinforcementLearning #AI #MachineLearning #DeepLearning

177 views06:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models

📝 Summary:
Diffusion-SDPO improves text-to-image quality by fixing a flaw in standard DPO where preferred output error can increase. It uses a safeguarded update to adaptively scale the loser gradient, ensuring the preferred output's error never increases. This leads to consistent quality gains across bench...

🔹 Publication Date: Published on Nov 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.03317
• PDF: https://arxiv.org/pdf/2511.03317
• Github: https://github.com/AIDC-AI/Diffusion-SDPO

🔹 Models citing this paper:
• https://huggingface.co/AIDC-AI/Diffusion-SDPO

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#DiffusionModels #DPO #TextToImage #GenerativeAI #AI

194 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models

📝 Summary:
VADER is an LLM framework enhancing video anomaly understanding. It integrates keyframe object relations and visual cues to provide detailed, causally grounded denoscriptions and robust question answering, advancing explainable anomaly analysis.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07299
• PDF: https://arxiv.org/pdf/2511.07299

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #VideoAnalytics #AnomalyDetection #Causality #ExplainableAI

212 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MPJudge: Towards Perceptual Assessment of Music-Induced Paintings

📝 Summary:
MPJudge is a new framework for assessing music-induced paintings. It integrates music features into a visual encoder using a modulation-based fusion mechanism, outperforming existing emotion models by directly modeling perceptual coherence. It also identifies music-relevant regions better.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07137
• PDF: https://arxiv.org/pdf/2511.07137

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MusicAndArt #ComputerVision #MachineLearning #DeepLearning #MultimodalAI

❤1

208 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning

📝 Summary:
PRC-Emo is a new framework that significantly improves LLMs' emotion recognition in conversations. It combines prompt engineering, demonstration retrieval, and curriculum learning, achieving state-of-the-art results on benchmark datasets.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07061
• PDF: https://arxiv.org/pdf/2511.07061
• Github: https://github.com/LiXinran6/PRC-Emo

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #EmotionRecognition #NLP #AIResearch #MachineLearning

192 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨10 Open Challenges Steering the Future of Vision-Language-Action Models

📝 Summary:
This paper identifies 10 principal challenges in vision-language-action VLA models, including multimodality, reasoning, and safety. It also explores emerging trends like spatial understanding and data synthesis. The goal is to accelerate VLA model development and wider acceptance.

🔹 Publication Date: Published on Nov 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05936
• PDF: https://arxiv.org/pdf/2511.05936

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VLA #AI #MachineLearning #ComputerVision #NLP

236 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

185 viewsedited 08:05

ML Research Hub

✨Qwen-Image Technical Report

📝 Summary:
Qwen-Image is an image generation model that significantly advances complex text rendering through a comprehensive data pipeline and progressive training across languages. It also improves precise image editing via a dual-encoding mechanism and multi-task training for enhanced consistency and vis...

🔹 Publication Date: Published on Aug 4

🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/qwen-image-technical-report
• PDF: https://arxiv.org/pdf/2508.02324
• Github: https://github.com/QwenLM/Qwen-Image

🔹 Models citing this paper:
• https://huggingface.co/Qwen/Qwen-Image
• https://huggingface.co/Qwen/Qwen-Image-Edit
• https://huggingface.co/Qwen/Qwen-Image-Edit-2509

✨ Spaces citing this paper:
• https://huggingface.co/spaces/linoyts/Qwen-Image-Edit-Angles
• https://huggingface.co/spaces/tori29umai/Qwen-Image-2509-MultipleAngles
• https://huggingface.co/spaces/linoyts/Qwen-Image-Edit-next-scene

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#ImageGeneration #AI #DeepLearning #ComputerVision #TextToImage

Arxivexplained

Qwen-Image Technical Report - Explained Simply

By Chenfei Wu, Jiahao Li, Jingren Zhou et al.. # Qwen-Image: Breaking Through AI's Text and Image Editing Barriers

**The Problem:** Current AI ima...

237 views08:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

📝 Summary:
This paper introduces lightweight UHeads, transformer-based uncertainty quantification heads, to efficiently verify LLM reasoning steps. UHeads estimate uncertainty from the LLM's internal states, outperforming larger verification models while being scalable and effective across various domains.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06209
• PDF: https://arxiv.org/pdf/2511.06209

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #AI #MachineLearning #UncertaintyQuantification #ModelVerification

241 views09:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models

📝 Summary:
Omni-AVSR is a unified audio-visual LLM that efficiently supports ASR, VSR, and AVSR. It uses multi-granularity training and parameter-efficient adaptation to achieve high accuracy while significantly reducing resource use compared to separate models.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07253
• PDF: https://arxiv.org/pdf/2511.07253
• Project Page: https://umbertocappellazzo.github.io/Omni-AVSR
• Github: https://github.com/umbertocappellazzo/Omni-AVSR

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#SpeechRecognition #LLM #MultimodalAI #DeepLearning #AIResearch

254 views09:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries

📝 Summary:
Ariadne is a framework using synthetic mazes and RLVR to enhance VLM visual-centric spatial reasoning. It expanded VLM capabilities, raising accuracy from 0 percent to over 50 percent, and significantly improved zero-shot generalization on real-world benchmarks.

🔹 Publication Date: Published on Nov 1

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.00710
• PDF: https://arxiv.org/pdf/2511.00710
• Project Page: https://mingheshen.github.io/Ariadne/

🔹 Models citing this paper:
• https://huggingface.co/KOKKKOKK/Ariadne

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VLM #AI #MachineLearning #ComputerVision #SpatialReasoning

225 views10:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub