NEW BOT Телеграм, страница

ML Research Hub

✨AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation

📝 Summary:
AR-Omni is a unified autoregressive model for any-to-any multimodal generation using a single Transformer. It generates text images and streaming speech without relying on expert components. The model addresses key challenges like modality imbalance and achieves strong real-time quality.

🔹 Publication Date: Published on Jan 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17761
• PDF: https://arxiv.org/pdf/2601.17761
• Project Page: https://modalitydance.github.io/AR-Omni
• Github: https://modalitydance.github.io/AR-Omni

🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/AR-Omni-Pretrain-v0.1
• https://huggingface.co/ModalityDance/AR-Omni-Chat-v0.1

✨ Datasets citing this paper:
• https://huggingface.co/datasets/ModalityDance/AR-Omni-Instruct-v0.1

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

arXiv.org

AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation

Real-world perception and interaction are inherently multimodal, encompassing not only language but also vision and speech, which motivates the development of "Omni" MLLMs that support both...

98 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

📝 Summary:
Imbalanced expert routing in Mixture-of-Experts models leads to computational inefficiencies in expert parallelism, which are addressed by a dynamic rerouting algorithm that balances workload and redu...

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17111
• PDF: https://arxiv.org/pdf/2601.17111

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

97 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal

📝 Summary:
An agentic framework for automatic academic rebuttal generation that decomposes reviews, retrieves evidence, plans rebuttal strategies, and generates persuasive responses with human-level performance ...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://huggingface.co/collections/HakHan/drpg-rebuttalagent
• PDF: https://arxiv.org/pdf/2601.18081
• Github: https://github.com/ulab-uiuc/DRPG-RebuttalAgent/tree/master

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1

113 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨iFSQ: Improving FSQ for Image Generation with 1 Line of Code

📝 Summary:
Finite Scalar Quantization with improved activation mapping enables unified modeling of discrete and continuous image generation approaches, revealing optimal representation balance and performance ch...

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17124
• PDF: https://arxiv.org/pdf/2601.17124
• Github: https://github.com/Tencent-Hunyuan/iFSQ

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

98 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:25

This media is not supported in your browser

VIEW IN TELEGRAM

✨Self-Refining Video Sampling

📝 Summary:
Self-refining video sampling improves motion coherence and physics alignment by using a pre-trained video generator as its own denoising autoencoder for iterative refinement with uncertainty-aware reg...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18577
• PDF: https://arxiv.org/pdf/2601.18577
• Project Page: https://agwmon.github.io/self-refine-video/
• Github: https://github.com/agwmon/self-refine-video

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

88 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval

📝 Summary:
CGPT improves table retrieval by using LLM-generated synthetic queries for contrastive fine-tuning of embedding models through semantically diverse partial table construction. AI-generated summary Gen...

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15849
• PDF: https://arxiv.org/pdf/2601.15849
• Github: https://github.com/yumeow0122/CGPT

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

86 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨RouteMoA: Dynamic Routing without Pre-Inference Boosts Efficient Mixture-of-Agents

📝 Summary:
RouteMoA reduces computational costs and latency in mixture-of-agents frameworks by using dynamic routing with lightweight scoring and judgment mechanisms. AI-generated summary Mixture-of-Agents (MoA)...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18130
• PDF: https://arxiv.org/pdf/2601.18130

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

76 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SkyReels-V3 Technique Report

📝 Summary:
SkyReels-V3 is a unified multimodal video generation model that supports reference image-to-video, video-to-video extension, and audio-guided video generation through diffusion Transformers and in-con...

🔹 Publication Date: Published on Jan 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17323
• PDF: https://arxiv.org/pdf/2601.17323

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

82 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨UI Remix: Supporting UI Design Through Interactive Example Retrieval and Remixing

📝 Summary:
UI Remix is an interactive system that supports mobile UI design through example-driven workflows using a multimodal retrieval-augmented generation model, enabling iterative design adaptation with sou...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18759
• PDF: https://arxiv.org/pdf/2601.18759

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

96 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback

📝 Summary:
Deep search agents trained on synthetic question-answer pairs generated through an iterative agent-based pipeline demonstrate improved performance and adaptability across different search environments...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18202
• PDF: https://arxiv.org/pdf/2601.18202

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

118 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Agentic Very Long Video Understanding

📝 Summary:
An agentic framework using entity scene graphs enables long-horizon video understanding with structured search, temporal reasoning, and cross-modal capabilities for extended visual and audio interpret...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18157
• PDF: https://arxiv.org/pdf/2601.18157

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

129 views05:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment

📝 Summary:
Meta Reward Modeling reformulates personalized reward modeling as a meta-learning problem to enable efficient adaptation to individual users with limited feedback. AI-generated summary Alignment of La...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18731
• PDF: https://arxiv.org/pdf/2601.18731
• Github: https://github.com/ModalityDance/MRM

🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/MRM-Reddit150-V2
• https://huggingface.co/ModalityDance/MRM-Reddit100-V2
• https://huggingface.co/ModalityDance/MRM-Reddit150-V1

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤2

115 views06:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨daVinci-Dev: Agent-native Mid-training for Software Engineering

📝 Summary:
This paper introduces agentic mid-training for LLMs, bridging static data and dynamic development environments. Using agent-native data with contextually and environmentally native trajectories, it outperforms prior work on SWE-Bench Verified with fewer tokens.

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18418
• PDF: https://arxiv.org/pdf/2601.18418
• Github: https://github.com/GAIR-NLP/daVinci-Dev

🔹 Models citing this paper:
• https://huggingface.co/GAIR/daVinci-Dev-72B
• https://huggingface.co/GAIR/daVinci-Dev-32B-MT
• https://huggingface.co/GAIR/daVinci-Dev-72B-MT

✨ Datasets citing this paper:
• https://huggingface.co/datasets/GAIR/daVinci-Dev

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

85 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

📝 Summary:
Research examines factors influencing out-of-domain performance in reinforcement learning agents, identifying state information richness and planning complexity as key determinants, while proposing a ...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18217
• PDF: https://arxiv.org/pdf/2601.18217

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

102 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

📝 Summary:
A self-improvement framework enables pretrained language models to generate automated curricula for solving previously unsolvable problems by leveraging latent knowledge and meta-reinforcement learnin...

🔹 Publication Date: Published on Jan 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18778
• PDF: https://arxiv.org/pdf/2601.18778
• Github: https://ssundaram21.github.io/soar/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

144 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion

📝 Summary:
A 'draft-then-refine' framework for discrete diffusion language models that recovers global contextual understanding while maintaining semi-autoregressive efficiency through block diffusion and global...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13599
• PDF: https://arxiv.org/pdf/2601.13599
• Project Page: https://noah-dllm.github.io/
• Github: https://noah-dllm.github.io/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

171 views07:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning

📝 Summary:
A new benchmark, MIR-SafetyBench, reveals advanced MLLMs are more vulnerable to safety risks in multi-image reasoning. These models often give superficial safe responses, with unsafe generations showing lower attention entropy, indicating an over-focus on task completion.

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14127
• PDF: https://arxiv.org/pdf/2601.14127
• Github: https://github.com/thu-coai/MIR-SafetyBench

✨ Datasets citing this paper:
• https://huggingface.co/datasets/thu-coai/MIR-SafetyBench

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AIsafety #MLLMs #MultiModalAI #ResponsibleAI #AIresearch

❤1

147 views09:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions

📝 Summary:
This paper presents a unified end-to-end framework extending Whisper for joint ASR and child-adult speaker role diarization. It significantly improves trannoscription accuracy and scalability by preventing error propagation, achieving lower word error rates and competitive diarization performance.

🔹 Publication Date: Published on Jan 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17640
• PDF: https://arxiv.org/pdf/2601.17640
• Github: https://github.com/usc-sail/joint-asr-diarization-child-adult

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#ASR #SpeakerDiarization #SpeechProcessing #DeepLearning #ChildAdultInteraction

93 views10:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

📝 Summary:
PingPong is a new human-authored benchmark for natural, multi-party code-switching dialogues, including trilingual conversations. It offers greater structural diversity than machine-generated data. Evaluations show current language models struggle with code-switched inputs, emphasizing the need f...

🔹 Publication Date: Published on Jan 24

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17277
• PDF: https://arxiv.org/pdf/2601.17277

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#CodeSwitching #NLP #DialogueSystems #MultilingualAI #LLMs

94 views10:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Less Is More -- Until It Breaks: Security Pitfalls of Vision Token Compression in Large Vision-Language Models

📝 Summary:
Visual token compression degrades LVLM robustness via unstable token importance ranking. This causes critical information loss, creating vulnerabilities only under compression. An attack exploits this, revealing an efficiency-security trade-off.

🔹 Publication Date: Published on Jan 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12042
• PDF: https://arxiv.org/pdf/2601.12042

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LVLM #AIsecurity #VisionAI #ModelRobustness #DeepLearning

❤1

113 views10:06

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control

📝 Summary:
FluidGym presents a standalone, fully differentiable reinforcement learning benchmark for active flow control that operates without external CFD solvers and supports standardized evaluation protocols....

🔹 Publication Date: Published on Jan 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15015v1
• PDF: https://arxiv.org/pdf/2601.15015
• Github: https://github.com/safe-autonomous-systems/fluidgym

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

136 views10:06

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform