NEW BOT Телеграм, страница

ML Research Hub

✨RelayLLM: Efficient Reasoning via Collaborative Decoding

📝 Summary:
RelayLLM enables efficient collaborative reasoning between small and large language models through token-level dynamic invocation, achieving high accuracy with minimal computational overhead. AI-gener...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05167
• PDF: https://arxiv.org/pdf/2601.05167
• Github: https://github.com/Chengsong-Huang/RelayLLM

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

83 views09:34

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

📝 Summary:
VideoAuto-R1 framework employs a reason-when-necessary strategy for video understanding, using a Thinking Once, Answering Twice training paradigm with verifiable rewards and confidence-based reasoning...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05175
• PDF: https://arxiv.org/pdf/2601.05175
• Project Page: https://ivul-kaust.github.io/projects/videoauto-r1/
• Github: https://github.com/IVUL-KAUST/VideoAuto-R1/

✨ Spaces citing this paper:
• https://huggingface.co/spaces/sming256/VideoAuto-R1_Demo

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

85 views09:35

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

📝 Summary:
Collecting diverse robot manipulation data is challenging. This paper introduces visual identity prompting, using exemplar images to guide diffusion models for generating multi-view, temporally coherent data. This augmented data improves robot policy performance in both simulation and real-world ...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05241
• PDF: https://arxiv.org/pdf/2601.05241
• Project Page: https://robovip.github.io/RoboVIP/
• Github: https://robovip.github.io/RoboVIP/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#Robotics #AI #GenerativeAI #ComputerVision #MachineLearning

79 views09:35

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

📝 Summary:
AT^2PO is a framework for multi-turn agentic reinforcement learning. It uses a turn-level tree search with entropy-guided expansion and turn-wise credit assignment. This improves exploration, reward propagation, and policy optimization, achieving state-of-the-art results.

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04767
• PDF: https://arxiv.org/pdf/2601.04767
• Github: https://github.com/zzfoutofspace/ATPO

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#ReinforcementLearning #AgenticAI #TreeSearch #PolicyOptimization #ArtificialIntelligence

83 views09:35

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

📝 Summary:
Attacking a few high-entropy tokens in VLMs significantly degrades outputs with reduced budgets. These selective attacks efficiently create harmful outputs and transfer across architectures, exposing new VLM safety weaknesses.

🔹 Publication Date: Published on Dec 26, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.21815
• PDF: https://arxiv.org/pdf/2512.21815

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VLMs #AISafety #AdversarialAI #MachineLearning #AIResearch

78 views09:35

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:44

This media is not supported in your browser

VIEW IN TELEGRAM

✨VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

📝 Summary:
VerseCrafter is a 4D video world model enabling unified control over camera and object dynamics. It uses a novel 4D Geometric Control representation with 3D Gaussian trajectories for high-fidelity video generation. An automatic data engine addresses training data scarcity.

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05138
• PDF: https://arxiv.org/pdf/2601.05138
• Github: https://sixiaozheng.github.io/VerseCrafter_page/

🔹 Models citing this paper:
• https://huggingface.co/TencentARC/VerseCrafter

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

59 views09:35

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Agent-as-a-Judge

📝 Summary:
Large language models face limitations in evaluating complex, multi-step tasks, prompting the development of agent-based evaluation systems that utilize planning, tool-augmented verification, and mult...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05111
• PDF: https://arxiv.org/pdf/2601.05111
• Github: https://github.com/ModalityDance/Awesome-Agent-as-a-Judge

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

60 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

📝 Summary:
Mixture of Experts models exhibit a Standing Committee of experts that consistently dominates routing across domains, challenging the assumption of widespread specialization. This reveals a strong structural bias toward centralized computation, limiting effective specialization.

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03425
• PDF: https://arxiv.org/pdf/2601.03425

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MixtureOfExperts #DeepLearning #MachineLearning #AISpecialization #NeuralNetworks

72 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

✨Plenoptic Video Generation

📝 Summary:
PlenopticDreamer addresses multi-view video re-rendering inconsistency by synchronizing generative hallucinations. It uses an autoregressive model with camera-guided retrieval to ensure spatio-temporal coherence, achieving state-of-the-art results with high fidelity.

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05239
• PDF: https://arxiv.org/pdf/2601.05239
• Project Page: https://research.nvidia.com/labs/dir/plenopticdreamer/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#PlenopticVideo #GenerativeAI #VideoGeneration #ComputerVision #DeepLearning

91 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CoV: Chain-of-View Prompting for Spatial Reasoning

📝 Summary:
Chain-of-View CoV prompting helps vision-language models improve spatial reasoning in 3D embodied question answering. It actively selects question-aligned views and iteratively adjusts camera positions to gather context, significantly boosting performance without additional training.

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05172
• PDF: https://arxiv.org/pdf/2601.05172
• Github: https://github.com/ziplab/CoV

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#SpatialReasoning #VisionLanguageModels #PromptEngineering #EmbodiedAI #AIResearch

58 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

📝 Summary:
DiffCoT reformulates chain-of-thought reasoning as an iterative denoising process using diffusion principles, enabling unified generation and correction of intermediate steps while maintaining causal ...

🔹 Publication Date: Published on Jan 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03559
• PDF: https://arxiv.org/pdf/2601.03559

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

57 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

📝 Summary:
Re-Align addresses the gap between understanding and generation in in-context image generation and editing through structured reasoning-guided alignment and reinforcement learning training. AI-generat...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05124
• PDF: https://arxiv.org/pdf/2601.05124
• Project Page: https://hrz2000.github.io/realign/
• Github: https://github.com/hrz2000/realign

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

60 views09:36

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

📝 Summary:
This paper introduces polymath learning, demonstrating that a single, carefully designed training sample can significantly boost language model reasoning across multiple scientific disciplines. This sample engineering approach outperforms training with larger datasets, emphasizing quality over qu...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03111
• PDF: https://arxiv.org/pdf/2601.03111

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #MachineLearning #LLM #DataEfficiency #SampleEngineering

73 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DocDancer: Towards Agentic Document-Grounded Information Seeking

📝 Summary:
DocDancer is an end-to-end trained open-source document question answering agent that formulates the task as an information-seeking problem and uses a tool-driven framework with exploration and synthe...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05163
• PDF: https://arxiv.org/pdf/2601.05163

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

97 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Multi-Scale Local Speculative Decoding for Image Generation

📝 Summary:
Multi-Scale Local Speculative Decoding accelerates autoregressive image generation through multi-resolution drafting and spatially informed verification while maintaining semantic quality and perceptu...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.05149
• PDF: https://arxiv.org/pdf/2601.05149
• Project Page: https://qualcomm-ai-research.github.io/mulo-sd-webpage/
• Github: https://qualcomm-ai-research.github.io/mulo-sd-webpage

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

49 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PyramidalWan: On Making Pretrained Video Model Pyramidal for Efficient Inference

📝 Summary:
Pyramidal diffusion models reduce computational cost through hierarchical resolution processing, with pretrained models converted via low-cost fine-tuning maintaining output quality while enabling eff...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04792
• PDF: https://arxiv.org/pdf/2601.04792
• Project Page: https://qualcomm-ai-research.github.io/PyramidalWan

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

45 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ProFuse: Efficient Cross-View Context Fusion for Open-Vocabulary 3D Gaussian Splatting

📝 Summary:
ProFuse enhances 3D scene understanding by integrating semantic information into 3D Gaussian Splatting through efficient context-aware processing and pre-registration phases. AI-generated summary We p...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04754
• PDF: https://arxiv.org/pdf/2601.04754
• Project Page: https://chiou1203.github.io/ProFuse/
• Github: https://chiou1203.github.io/ProFuse/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

45 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing

📝 Summary:
Behavior cloning demonstrates improved performance and causal reasoning through scaling model size and training data, achieving human-level gameplay in 3D video games. AI-generated summary Behavior cl...

🔹 Publication Date: Published on Jan 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04575
• PDF: https://arxiv.org/pdf/2601.04575
• Project Page: https://elefant-ai.github.io/open-p2p/
• Github: https://github.com/elefant-ai/open-p2p

🔹 Models citing this paper:
• https://huggingface.co/elefantai/open-p2p

✨ Datasets citing this paper:
• https://huggingface.co/datasets/elefantai/p2p-full-data
• https://huggingface.co/datasets/elefantai/p2p-toy-examples

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

48 views09:37

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨ReHyAt: Recurrent Hybrid Attention for Video Diffusion Transformers

📝 Summary:
ReHyAt presents a recurrent hybrid attention mechanism, merging softmax fidelity with linear efficiency. This enables scalable, high-quality video generation by reducing computational cost from quadratic to linear, with significantly lower training costs.

🔹 Publication Date: Published on Jan 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.04342
• PDF: https://arxiv.org/pdf/2601.04342
• Project Page: https://qualcomm-ai-research.github.io/rehyat

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

47 views09:38

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Guardians of the Hair: Rescuing Soft Boundaries in Depth, Stereo, and Novel Views

📝 Summary:
HairGuard is a framework for recovering fine-grained soft boundary details in 3D vision tasks through specialized depth refinement and view synthesis techniques. AI-generated summary Soft boundaries, ...

🔹 Publication Date: Published on Jan 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03362
• PDF: https://arxiv.org/pdf/2601.03362

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

48 views09:38

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset

📝 Summary:
A large-scale industrial multimodal defect dataset with 1 million image-text pairs enables efficient foundation model adaptation for manufacturing quality inspection and generation tasks. AI-generated...

🔹 Publication Date: Published on Dec 30, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24160
• PDF: https://arxiv.org/pdf/2512.24160
• Project Page: https://ninaneon.github.io/projectpage/
• Github: https://github.com/NinaNeon/IMDD-1M-Towards-Open-Vocabulary-Industrial-Defect-

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

47 views09:38

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform