ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
4.01K photos
229 videos
23 files
4.32K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective

📝 Summary:
This study investigated software developers' perspectives on Large Language Models, identifying benefits like improved workflow and entrepreneurship, alongside risks to personal well-being and reputation. It highlights key trade-offs and best practices for adopting LLMs in software development.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.06428
• PDF: https://arxiv.org/pdf/2511.06428

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLMs #SoftwareDevelopment #AIinDevelopment #DeveloperExperience #TechResearch
Adaptive Multi-Agent Response Refinement in Conversational Systems

📝 Summary:
This paper presents a multi-agent framework for refining conversational responses across factuality, personalization, and coherence. It employs dynamic agent coordination, outperforming single LLM approaches on challenging conversational datasets.

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08319
• PDF: https://arxiv.org/pdf/2511.08319

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MultiAgentSystems #ConversationalAI #LLMs #NLP #AIResearch
KLASS: KL-Guided Fast Inference in Masked Diffusion Models

📝 Summary:
KLASS accelerates masked diffusion model inference by using KL divergence to identify stable, high-confidence predictions. It unmasks multiple tokens per iteration, significantly speeding up generation and improving quality across text, image, and molecular tasks.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05664
• PDF: https://arxiv.org/pdf/2511.05664
• Github: https://github.com/shkim0116/KLASS

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#DiffusionModels #GenerativeAI #MachineLearning #AIResearch #ModelAcceleration
1
The Path Not Taken: RLVR Provably Learns Off the Principals

📝 Summary:
RLVR learns by modifying parameters off principal directions in low-curvature subspaces, appearing sparse due to optimization bias. This distinct optimization regime contrasts with SFT, meaning SFT-era fine-tuning methods are flawed for RLVR.

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08567
• PDF: https://arxiv.org/pdf/2511.08567

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#RLVR #MachineLearning #Optimization #DeepLearning #AIResearch
🔥1
Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora

📝 Summary:
Wasm is a pipeline creating a new structured Arabic multimodal dataset from Common Crawl. It preserves document structure and supports both text-only and multimodal pre-training, addressing the lack of high-quality Arabic datasets.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07080
• PDF: https://arxiv.org/pdf/2511.07080

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#ArabicNLP #MultimodalAI #DatasetCreation #Corpora #DataScience
1
BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives

📝 Summary:
BiCA improves biomedical dense retrieval by using citation links as hard negatives. This leverages document structure to enhance performance with minimal fine-tuning, enabling data-efficient domain adaptation.

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08029
• PDF: https://arxiv.org/pdf/2511.08029
• Github: https://github.com/NiravBhattLab/BiCA

🔹 Models citing this paper:
https://huggingface.co/bisectgroup/BiCA-small
https://huggingface.co/bisectgroup/BiCA-base

Datasets citing this paper:
https://huggingface.co/datasets/bisectgroup/2hop-citation-graphs
https://huggingface.co/datasets/bisectgroup/hard-negatives-traversal

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#BiomedicalAI #DenseRetrieval #NLP #MachineLearning #InformationRetrieval
FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

📝 Summary:
FlashVSR introduces the first real-time, one-step streaming diffusion framework for video super-resolution. It addresses high latency and computation through innovations like distillation, sparse attention, and a tiny decoder. FlashVSR achieves state-of-the-art performance with up to 12x speedup.

🔹 Publication Date: Published on Oct 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.12747
• PDF: https://arxiv.org/pdf/2510.12747
• Project Page: https://zhuang2002.github.io/FlashVSR/
• Github: https://github.com/OpenImagingLab/FlashVSR

🔹 Models citing this paper:
https://huggingface.co/JunhaoZhuang/FlashVSR
https://huggingface.co/JunhaoZhuang/FlashVSR-v1.1

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#FlashVSR #VideoSuperResolution #RealTimeAI #DiffusionModels #ComputerVision
🔥1
Beyond English: Toward Inclusive and Scalable Multilingual Machine Translation with LLMs

📝 Summary:
LMT introduces new multilingual translation models covering 60 languages, centered on Chinese and English. It uses Strategic Downsampling and Parallel Multilingual Prompting to improve translation quality and cross-lingual transfer, achieving state-of-the-art performance.

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07003
• PDF: https://arxiv.org/pdf/2511.07003
• Project Page: https://github.com/NiuTrans/LMT
• Github: https://github.com/NiuTrans/LMT

🔹 Models citing this paper:
https://huggingface.co/NiuTrans/LMT-60-1.7B
https://huggingface.co/NiuTrans/LMT-60-0.6B-Base
https://huggingface.co/NiuTrans/LMT-60-0.6B

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MultilingualTranslation #LLMs #MachineTranslation #NLP #AI
🔥1
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

📝 Summary:
Ming-UniAudio introduces a unified speech LLM and tokenizer for joint understanding, generation, and instruction-based free-form editing. It overcomes token representation issues, achieves state-of-the-art results, and establishes a new benchmark for editing.

🔹 Publication Date: Published on Oct 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05516
• PDF: https://arxiv.org/pdf/2511.05516
• Project Page: https://xqacmer.github.io/Ming-Unitok-Audio.github.io/
• Github: https://github.com/inclusionAI/Ming-UniAudio

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#SpeechLLM #AI #NLP #GenerativeAI #MachineLearning
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

📝 Summary:
Intelligence per Watt IPW, accuracy per watt, is proposed to measure local AI efficiency. Local small LMs accurately answer 88.7% of queries, showing 5.3x IPW improvement and outperforming cloud accelerators. This proves local inference can redistribute demand from centralized cloud infrastructure.

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07885
• PDF: https://arxiv.org/pdf/2511.07885

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #LocalAI #EnergyEfficiency #LLM #EdgeComputing
DynaAct: Large Language Model Reasoning with Dynamic Action Spaces

📝 Summary:
DynaAct is a framework that uses large language models to automatically construct a compact action space for sequential decision-making. This method enhances reasoning performance and efficiency by selecting optimal actions based on utility and diversity. Experiments show significant improvements...

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08043
• PDF: https://arxiv.org/pdf/2511.08043
• Github: https://github.com/zhaoxlpku/DynaAct

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #ArtificialIntelligence #MachineLearning #Reasoning #DecisionMaking
👍1
This media is not supported in your browser
VIEW IN TELEGRAM
Optimizing Diversity and Quality through Base-Aligned Model Collaboration

📝 Summary:
BACo is a token-level collaboration framework for LLMs. It dynamically combines a base model with its aligned counterpart to improve both output diversity and quality during inference. BACo consistently outperforms baselines, achieving significant joint improvement.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05650
• PDF: https://arxiv.org/pdf/2511.05650

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLMs #AI #MachineLearning #NLP #ModelCollaboration
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

📝 Summary:
FilmAgent is an LLM-based multi-agent framework that automates end-to-end virtual film production, covering noscriptwriting, cinematography, and actor positioning. Human evaluations show it outperforms baselines, proving multi-agent collaboration is feasible for filmmaking.

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.12909
• PDF: https://huggingface.co/papers/2501.11233
• Project Page: https://filmagent.github.io/
• Github: https://filmagent.github.io/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #LLM #VirtualProduction #MultiAgentSystems #Filmmaking
1
🤖🧠 Nanobrowser: The Open-Source AI Web Automation Tool Changing How We Browse

🗓️ 12 Nov 2025
📚 AI News & Trends

The rise of artificial intelligence has redefined how we interact with the web, transforming routine browsing into a space for automation and productivity. Among the most exciting innovations in this field is Nanobrowser, an open-source AI-powered web automation tool designed to run directly inside your browser. Developed as a free alternative to OpenAI Operator, Nanobrowser ...

#Nanobrowser #AIWebAutomation #OpenSourceTools #BrowserAI #ProductivityTech #AIAutomation
Beyond Fact Retrieval: Episodic Memory for RAG with Generative Semantic Workspaces

📝 Summary:
The Generative Semantic Workspace GSW enhances LLMs for long-context reasoning and episodic memory. This neuro-inspired framework builds structured representations of evolving situations, outperforming RAG baselines by 20% and reducing context tokens by 51%. GSW provides human-like episodic memor...

🔹 Publication Date: Published on Nov 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07587
• PDF: https://arxiv.org/pdf/2511.07587

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLMs #RAG #EpisodicMemory #GenerativeAI #NeuroAI
🤖🧠 Claude-Flow v2.7: The Next Generation of Enterprise AI Orchestration

🗓️ 12 Nov 2025
📚 AI News & Trends

Artificial intelligence is rapidly transforming software development, research and enterprise workflows. As AI models become increasingly complex, managing, coordinating and optimizing them efficiently has become a critical challenge. Enter Claude-Flow v2.7, an advanced AI orchestration platform that blends multi-agent intelligence, persistent memory and swarm-based coordination to deliver enterprise-level automation and reasoning at scale. Developed by ...

#ClaudeFlow #EnterpriseAI #AIOrchestration #MultiAgentSystems #AIAutomation #PersistentMemory
🤖🧠 Bytebot: The Future of AI Desktop Automation

🗓️ 12 Nov 2025
📚 AI News & Trends

In the era of rapid digital transformation, automation is the driving force behind business efficiency and innovation. While most AI agents are limited to browsers or APIs, a groundbreaking open-source project called Bytebot has redefined what AI can achieve. Bytebot introduces a self-hosted AI desktop agent — a virtual computer that performs complex, multi-step tasks ...

#Bytebot #AIDesktopAutomation #SelfHostedAI #OpenSourceAI #AIAgents #TaskAutomation
1
TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning

📝 Summary:
TimeSearch-R improves long-form video understanding by optimizing temporal search with reinforcement learning. It uses GRPO-CSV to verify searched frame completeness, leading to improved reasoning. This achieves state-of-the-art performance on multiple video benchmarks.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05489
• PDF: https://arxiv.org/pdf/2511.05489
• Github: https://github.com/Time-Search/TimeSearch-R

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VideoUnderstanding #ReinforcementLearning #DeepLearning #AIResearch #ComputerVision
TiDAR: Think in Diffusion, Talk in Autoregression

📝 Summary:
TiDAR is a hybrid diffusion-autoregressive model achieving high throughput and AR-level quality. It drafts tokens with diffusion and samples autoregressively in a single pass, outperforming existing methods and delivering 4.71x to 5.91x faster generation.

🔹 Publication Date: Published on Nov 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08923
• PDF: https://arxiv.org/pdf/2511.08923

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #MachineLearning #DiffusionModels #AutoregressiveModels #GenerativeAI
This media is not supported in your browser
VIEW IN TELEGRAM
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

📝 Summary:
Lumine introduces an open recipe for generalist agents in 3D open worlds. This vision-language model-based agent processes pixels to perform complex, hours-long missions with human efficiency and demonstrates strong zero-shot generalization across diverse games like Genshin Impact and Honkai Star...

🔹 Publication Date: Published on Nov 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08892
• PDF: https://arxiv.org/pdf/2511.08892
• Project Page: https://www.lumine-ai.org/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#GeneralistAI #VisionLanguageModel #3DWorlds #AIagents #GamingAI
This media is not supported in your browser
VIEW IN TELEGRAM
Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

📝 Summary:
Time-to-Move TTM is a training-free framework for precise motion and appearance controlled video generation using I2V diffusion models. It employs crude reference animations as motion cues and introduces dual-clock denoising for flexible alignment, outperforming training-based methods.

🔹 Publication Date: Published on Nov 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08633
• PDF: https://arxiv.org/pdf/2511.08633
• Project Page: https://time-to-move.github.io/
• Github: https://github.com/time-to-move/TTM

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VideoGeneration #DiffusionModels #GenerativeAI #MotionControl #ComputerVision