✨AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation
📝 Summary:
AR-Omni is a unified autoregressive model for any-to-any multimodal generation using a single Transformer. It generates text images and streaming speech without relying on expert components. The model addresses key challenges like modality imbalance and achieves strong real-time quality.
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17761
• PDF: https://arxiv.org/pdf/2601.17761
• Project Page: https://modalitydance.github.io/AR-Omni
• Github: https://modalitydance.github.io/AR-Omni
🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/AR-Omni-Pretrain-v0.1
• https://huggingface.co/ModalityDance/AR-Omni-Chat-v0.1
✨ Datasets citing this paper:
• https://huggingface.co/datasets/ModalityDance/AR-Omni-Instruct-v0.1
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
AR-Omni is a unified autoregressive model for any-to-any multimodal generation using a single Transformer. It generates text images and streaming speech without relying on expert components. The model addresses key challenges like modality imbalance and achieves strong real-time quality.
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17761
• PDF: https://arxiv.org/pdf/2601.17761
• Project Page: https://modalitydance.github.io/AR-Omni
• Github: https://modalitydance.github.io/AR-Omni
🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/AR-Omni-Pretrain-v0.1
• https://huggingface.co/ModalityDance/AR-Omni-Chat-v0.1
✨ Datasets citing this paper:
• https://huggingface.co/datasets/ModalityDance/AR-Omni-Instruct-v0.1
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation
Real-world perception and interaction are inherently multimodal, encompassing not only language but also vision and speech, which motivates the development of "Omni" MLLMs that support both...
✨Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts
📝 Summary:
Imbalanced expert routing in Mixture-of-Experts models leads to computational inefficiencies in expert parallelism, which are addressed by a dynamic rerouting algorithm that balances workload and redu...
🔹 Publication Date: Published on Jan 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17111
• PDF: https://arxiv.org/pdf/2601.17111
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Imbalanced expert routing in Mixture-of-Experts models leads to computational inefficiencies in expert parallelism, which are addressed by a dynamic rerouting algorithm that balances workload and redu...
🔹 Publication Date: Published on Jan 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17111
• PDF: https://arxiv.org/pdf/2601.17111
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal
📝 Summary:
An agentic framework for automatic academic rebuttal generation that decomposes reviews, retrieves evidence, plans rebuttal strategies, and generates persuasive responses with human-level performance ...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://huggingface.co/collections/HakHan/drpg-rebuttalagent
• PDF: https://arxiv.org/pdf/2601.18081
• Github: https://github.com/ulab-uiuc/DRPG-RebuttalAgent/tree/master
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
An agentic framework for automatic academic rebuttal generation that decomposes reviews, retrieves evidence, plans rebuttal strategies, and generates persuasive responses with human-level performance ...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://huggingface.co/collections/HakHan/drpg-rebuttalagent
• PDF: https://arxiv.org/pdf/2601.18081
• Github: https://github.com/ulab-uiuc/DRPG-RebuttalAgent/tree/master
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨iFSQ: Improving FSQ for Image Generation with 1 Line of Code
📝 Summary:
Finite Scalar Quantization with improved activation mapping enables unified modeling of discrete and continuous image generation approaches, revealing optimal representation balance and performance ch...
🔹 Publication Date: Published on Jan 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17124
• PDF: https://arxiv.org/pdf/2601.17124
• Github: https://github.com/Tencent-Hunyuan/iFSQ
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Finite Scalar Quantization with improved activation mapping enables unified modeling of discrete and continuous image generation approaches, revealing optimal representation balance and performance ch...
🔹 Publication Date: Published on Jan 23
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17124
• PDF: https://arxiv.org/pdf/2601.17124
• Github: https://github.com/Tencent-Hunyuan/iFSQ
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨Self-Refining Video Sampling
📝 Summary:
Self-refining video sampling improves motion coherence and physics alignment by using a pre-trained video generator as its own denoising autoencoder for iterative refinement with uncertainty-aware reg...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18577
• PDF: https://arxiv.org/pdf/2601.18577
• Project Page: https://agwmon.github.io/self-refine-video/
• Github: https://github.com/agwmon/self-refine-video
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Self-refining video sampling improves motion coherence and physics alignment by using a pre-trained video generator as its own denoising autoencoder for iterative refinement with uncertainty-aware reg...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18577
• PDF: https://arxiv.org/pdf/2601.18577
• Project Page: https://agwmon.github.io/self-refine-video/
• Github: https://github.com/agwmon/self-refine-video
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval
📝 Summary:
CGPT improves table retrieval by using LLM-generated synthetic queries for contrastive fine-tuning of embedding models through semantically diverse partial table construction. AI-generated summary Gen...
🔹 Publication Date: Published on Jan 22
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15849
• PDF: https://arxiv.org/pdf/2601.15849
• Github: https://github.com/yumeow0122/CGPT
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
CGPT improves table retrieval by using LLM-generated synthetic queries for contrastive fine-tuning of embedding models through semantically diverse partial table construction. AI-generated summary Gen...
🔹 Publication Date: Published on Jan 22
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15849
• PDF: https://arxiv.org/pdf/2601.15849
• Github: https://github.com/yumeow0122/CGPT
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨RouteMoA: Dynamic Routing without Pre-Inference Boosts Efficient Mixture-of-Agents
📝 Summary:
RouteMoA reduces computational costs and latency in mixture-of-agents frameworks by using dynamic routing with lightweight scoring and judgment mechanisms. AI-generated summary Mixture-of-Agents (MoA)...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18130
• PDF: https://arxiv.org/pdf/2601.18130
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
RouteMoA reduces computational costs and latency in mixture-of-agents frameworks by using dynamic routing with lightweight scoring and judgment mechanisms. AI-generated summary Mixture-of-Agents (MoA)...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18130
• PDF: https://arxiv.org/pdf/2601.18130
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SkyReels-V3 Technique Report
📝 Summary:
SkyReels-V3 is a unified multimodal video generation model that supports reference image-to-video, video-to-video extension, and audio-guided video generation through diffusion Transformers and in-con...
🔹 Publication Date: Published on Jan 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17323
• PDF: https://arxiv.org/pdf/2601.17323
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SkyReels-V3 is a unified multimodal video generation model that supports reference image-to-video, video-to-video extension, and audio-guided video generation through diffusion Transformers and in-con...
🔹 Publication Date: Published on Jan 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17323
• PDF: https://arxiv.org/pdf/2601.17323
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨UI Remix: Supporting UI Design Through Interactive Example Retrieval and Remixing
📝 Summary:
UI Remix is an interactive system that supports mobile UI design through example-driven workflows using a multimodal retrieval-augmented generation model, enabling iterative design adaptation with sou...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18759
• PDF: https://arxiv.org/pdf/2601.18759
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
UI Remix is an interactive system that supports mobile UI design through example-driven workflows using a multimodal retrieval-augmented generation model, enabling iterative design adaptation with sou...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18759
• PDF: https://arxiv.org/pdf/2601.18759
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback
📝 Summary:
Deep search agents trained on synthetic question-answer pairs generated through an iterative agent-based pipeline demonstrate improved performance and adaptability across different search environments...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18202
• PDF: https://arxiv.org/pdf/2601.18202
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Deep search agents trained on synthetic question-answer pairs generated through an iterative agent-based pipeline demonstrate improved performance and adaptability across different search environments...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18202
• PDF: https://arxiv.org/pdf/2601.18202
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Agentic Very Long Video Understanding
📝 Summary:
An agentic framework using entity scene graphs enables long-horizon video understanding with structured search, temporal reasoning, and cross-modal capabilities for extended visual and audio interpret...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18157
• PDF: https://arxiv.org/pdf/2601.18157
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
An agentic framework using entity scene graphs enables long-horizon video understanding with structured search, temporal reasoning, and cross-modal capabilities for extended visual and audio interpret...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18157
• PDF: https://arxiv.org/pdf/2601.18157
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
📝 Summary:
Meta Reward Modeling reformulates personalized reward modeling as a meta-learning problem to enable efficient adaptation to individual users with limited feedback. AI-generated summary Alignment of La...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18731
• PDF: https://arxiv.org/pdf/2601.18731
• Github: https://github.com/ModalityDance/MRM
🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/MRM-Reddit150-V2
• https://huggingface.co/ModalityDance/MRM-Reddit100-V2
• https://huggingface.co/ModalityDance/MRM-Reddit150-V1
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Meta Reward Modeling reformulates personalized reward modeling as a meta-learning problem to enable efficient adaptation to individual users with limited feedback. AI-generated summary Alignment of La...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18731
• PDF: https://arxiv.org/pdf/2601.18731
• Github: https://github.com/ModalityDance/MRM
🔹 Models citing this paper:
• https://huggingface.co/ModalityDance/MRM-Reddit150-V2
• https://huggingface.co/ModalityDance/MRM-Reddit100-V2
• https://huggingface.co/ModalityDance/MRM-Reddit150-V1
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤2
✨daVinci-Dev: Agent-native Mid-training for Software Engineering
📝 Summary:
This paper introduces agentic mid-training for LLMs, bridging static data and dynamic development environments. Using agent-native data with contextually and environmentally native trajectories, it outperforms prior work on SWE-Bench Verified with fewer tokens.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18418
• PDF: https://arxiv.org/pdf/2601.18418
• Github: https://github.com/GAIR-NLP/daVinci-Dev
🔹 Models citing this paper:
• https://huggingface.co/GAIR/daVinci-Dev-72B
• https://huggingface.co/GAIR/daVinci-Dev-32B-MT
• https://huggingface.co/GAIR/daVinci-Dev-72B-MT
✨ Datasets citing this paper:
• https://huggingface.co/datasets/GAIR/daVinci-Dev
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
This paper introduces agentic mid-training for LLMs, bridging static data and dynamic development environments. Using agent-native data with contextually and environmentally native trajectories, it outperforms prior work on SWE-Bench Verified with fewer tokens.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18418
• PDF: https://arxiv.org/pdf/2601.18418
• Github: https://github.com/GAIR-NLP/daVinci-Dev
🔹 Models citing this paper:
• https://huggingface.co/GAIR/daVinci-Dev-72B
• https://huggingface.co/GAIR/daVinci-Dev-32B-MT
• https://huggingface.co/GAIR/daVinci-Dev-72B-MT
✨ Datasets citing this paper:
• https://huggingface.co/datasets/GAIR/daVinci-Dev
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents
📝 Summary:
Research examines factors influencing out-of-domain performance in reinforcement learning agents, identifying state information richness and planning complexity as key determinants, while proposing a ...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18217
• PDF: https://arxiv.org/pdf/2601.18217
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Research examines factors influencing out-of-domain performance in reinforcement learning agents, identifying state information richness and planning complexity as key determinants, while proposing a ...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18217
• PDF: https://arxiv.org/pdf/2601.18217
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
📝 Summary:
A self-improvement framework enables pretrained language models to generate automated curricula for solving previously unsolvable problems by leveraging latent knowledge and meta-reinforcement learnin...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18778
• PDF: https://arxiv.org/pdf/2601.18778
• Github: https://ssundaram21.github.io/soar/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A self-improvement framework enables pretrained language models to generate automated curricula for solving previously unsolvable problems by leveraging latent knowledge and meta-reinforcement learnin...
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18778
• PDF: https://arxiv.org/pdf/2601.18778
• Github: https://ssundaram21.github.io/soar/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Diffusion In Diffusion: Reclaiming Global Coherence in Semi-Autoregressive Diffusion
📝 Summary:
A 'draft-then-refine' framework for discrete diffusion language models that recovers global contextual understanding while maintaining semi-autoregressive efficiency through block diffusion and global...
🔹 Publication Date: Published on Jan 20
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13599
• PDF: https://arxiv.org/pdf/2601.13599
• Project Page: https://noah-dllm.github.io/
• Github: https://noah-dllm.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A 'draft-then-refine' framework for discrete diffusion language models that recovers global contextual understanding while maintaining semi-autoregressive efficiency through block diffusion and global...
🔹 Publication Date: Published on Jan 20
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13599
• PDF: https://arxiv.org/pdf/2601.13599
• Project Page: https://noah-dllm.github.io/
• Github: https://noah-dllm.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning
📝 Summary:
A new benchmark, MIR-SafetyBench, reveals advanced MLLMs are more vulnerable to safety risks in multi-image reasoning. These models often give superficial safe responses, with unsafe generations showing lower attention entropy, indicating an over-focus on task completion.
🔹 Publication Date: Published on Jan 20
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14127
• PDF: https://arxiv.org/pdf/2601.14127
• Github: https://github.com/thu-coai/MIR-SafetyBench
✨ Datasets citing this paper:
• https://huggingface.co/datasets/thu-coai/MIR-SafetyBench
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AIsafety #MLLMs #MultiModalAI #ResponsibleAI #AIresearch
📝 Summary:
A new benchmark, MIR-SafetyBench, reveals advanced MLLMs are more vulnerable to safety risks in multi-image reasoning. These models often give superficial safe responses, with unsafe generations showing lower attention entropy, indicating an over-focus on task completion.
🔹 Publication Date: Published on Jan 20
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14127
• PDF: https://arxiv.org/pdf/2601.14127
• Github: https://github.com/thu-coai/MIR-SafetyBench
✨ Datasets citing this paper:
• https://huggingface.co/datasets/thu-coai/MIR-SafetyBench
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AIsafety #MLLMs #MultiModalAI #ResponsibleAI #AIresearch
❤1
✨End-to-End Joint ASR and Speaker Role Diarization with Child-Adult Interactions
📝 Summary:
This paper presents a unified end-to-end framework extending Whisper for joint ASR and child-adult speaker role diarization. It significantly improves trannoscription accuracy and scalability by preventing error propagation, achieving lower word error rates and competitive diarization performance.
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17640
• PDF: https://arxiv.org/pdf/2601.17640
• Github: https://github.com/usc-sail/joint-asr-diarization-child-adult
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ASR #SpeakerDiarization #SpeechProcessing #DeepLearning #ChildAdultInteraction
📝 Summary:
This paper presents a unified end-to-end framework extending Whisper for joint ASR and child-adult speaker role diarization. It significantly improves trannoscription accuracy and scalability by preventing error propagation, achieving lower word error rates and competitive diarization performance.
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17640
• PDF: https://arxiv.org/pdf/2601.17640
• Github: https://github.com/usc-sail/joint-asr-diarization-child-adult
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ASR #SpeakerDiarization #SpeechProcessing #DeepLearning #ChildAdultInteraction
✨PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues
📝 Summary:
PingPong is a new human-authored benchmark for natural, multi-party code-switching dialogues, including trilingual conversations. It offers greater structural diversity than machine-generated data. Evaluations show current language models struggle with code-switched inputs, emphasizing the need f...
🔹 Publication Date: Published on Jan 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17277
• PDF: https://arxiv.org/pdf/2601.17277
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#CodeSwitching #NLP #DialogueSystems #MultilingualAI #LLMs
📝 Summary:
PingPong is a new human-authored benchmark for natural, multi-party code-switching dialogues, including trilingual conversations. It offers greater structural diversity than machine-generated data. Evaluations show current language models struggle with code-switched inputs, emphasizing the need f...
🔹 Publication Date: Published on Jan 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.17277
• PDF: https://arxiv.org/pdf/2601.17277
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#CodeSwitching #NLP #DialogueSystems #MultilingualAI #LLMs
✨Less Is More -- Until It Breaks: Security Pitfalls of Vision Token Compression in Large Vision-Language Models
📝 Summary:
Visual token compression degrades LVLM robustness via unstable token importance ranking. This causes critical information loss, creating vulnerabilities only under compression. An attack exploits this, revealing an efficiency-security trade-off.
🔹 Publication Date: Published on Jan 17
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12042
• PDF: https://arxiv.org/pdf/2601.12042
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LVLM #AIsecurity #VisionAI #ModelRobustness #DeepLearning
📝 Summary:
Visual token compression degrades LVLM robustness via unstable token importance ranking. This causes critical information loss, creating vulnerabilities only under compression. An attack exploits this, revealing an efficiency-security trade-off.
🔹 Publication Date: Published on Jan 17
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.12042
• PDF: https://arxiv.org/pdf/2601.12042
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LVLM #AIsecurity #VisionAI #ModelRobustness #DeepLearning
❤1
✨Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control
📝 Summary:
FluidGym presents a standalone, fully differentiable reinforcement learning benchmark for active flow control that operates without external CFD solvers and supports standardized evaluation protocols....
🔹 Publication Date: Published on Jan 21
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15015v1
• PDF: https://arxiv.org/pdf/2601.15015
• Github: https://github.com/safe-autonomous-systems/fluidgym
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
FluidGym presents a standalone, fully differentiable reinforcement learning benchmark for active flow control that operates without external CFD solvers and supports standardized evaluation protocols....
🔹 Publication Date: Published on Jan 21
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15015v1
• PDF: https://arxiv.org/pdf/2601.15015
• Github: https://github.com/safe-autonomous-systems/fluidgym
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research