ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
4.03K photos
230 videos
23 files
4.34K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🤖🧠 Open WebUI: The Most Powerful Self-Hosted AI Platform for Local and Private LLMs

🗓️ 09 Nov 2025
📚 AI News & Trends

In the rapidly evolving landscape of artificial intelligence, the ability to run large language models securely and efficiently has become a major priority for developers, enterprises and privacy-focused users. While cloud-based AI services are convenient, they rely heavily on remote servers, internet access and third-party control. This is where Open WebUI stands out as a ...

#OpenWebUI #SelfHostedAI #PrivateLLMs #LocalAI #AISecurity #OpenSourcePlatform
🤖🧠 Generative AI for Beginners: A Complete Guide to Microsoft’s Free Course

🗓️ 09 Nov 2025
📚 AI News & Trends

Generative AI has rapidly shifted from an emerging technology to a foundation of modern digital innovation. From automated writing assistants and AI chatbots to image generators and intelligent search engines, generative AI is transforming industries and shaping the future of work. Whether you are a student, a budding developer or a technology enthusiast, learning generative ...

#GenerativeAI #BeginnersGuide #MicrosoftAI #FreeCourse #AIEducation #DigitalInnovation
Media is too big
VIEW IN TELEGRAM
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

📝 Summary:
ComfyUI-Copilot is an LLM and multi-agent system that enhances ComfyUI's usability. It provides intelligent recommendations and automated one-click workflow construction, lowering entry barriers for beginners and boosting efficiency for experienced users.

🔹 Publication Date: Published on Jun 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.05010
• PDF: https://arxiv.org/pdf/2506.05010
• Project Page: https://x.com/wangly0229/status/1923515826713526583
• Github: https://github.com/AIDC-AI/ComfyUI-Copilot

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #MultiAgent #ComfyUI #AI #WorkflowAutomation
OpenVoice: Versatile Instant Voice Cloning

📝 Summary:
OpenVoice is a versatile voice cloning method using a short audio clip. It provides flexible control over voice styles and achieves zero-shot cross-lingual cloning for new languages without extensive training data. It is also highly efficient.

🔹 Publication Date: Published on Dec 3, 2023

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2312.01479
• PDF: https://arxiv.org/pdf/2312.01479
• Github: https://github.com/myshell-ai/openvoice

🔹 Models citing this paper:
https://huggingface.co/rsxdalv/OpenVoiceV2
https://huggingface.co/ameerazam08/Udiff
https://huggingface.co/flopml/OpenVoice-v2

Datasets citing this paper:
https://huggingface.co/datasets/tsinghua-ee/QualiSpeech
https://huggingface.co/datasets/dlxjj/Openvoice
https://huggingface.co/datasets/Pendrokar/open_tts_tracker

Spaces citing this paper:
https://huggingface.co/spaces/Russell1123213123/testOpenVoice
https://huggingface.co/spaces/gauthamk28/gauthamk28_voice
https://huggingface.co/spaces/blayks07/OpenVoice-main

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VoiceCloning #AIResearch #SpeechSynthesis #ZeroShotLearning #CrossLingualAI
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

📝 Summary:
LLMs struggle to authentically role-play villains due to safety alignment, showing a monotonic decline in fidelity as character morality decreases. The Moral RolePlay benchmark reveals models struggle with traits like deceit and manipulation, highlighting a tension between model safety and creati...

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.04962
• PDF: https://arxiv.org/pdf/2511.04962
• Github: https://github.com/Tencent/DigitalHuman/tree/main/RolePlay_Villain

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #AI #AISafety #RolePlaying #NLP
Visual Spatial Tuning

📝 Summary:
Visual Spatial Tuning VST is a framework that progressively trains Vision-Language Models VLMs using specialized datasets VST-P for spatial perception and VST-R for reasoning. VST achieves state-of-the-art results on spatial benchmarks without harming general VLM capabilities, leading to more phy...

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05491
• PDF: https://arxiv.org/pdf/2511.05491
• Project Page: https://yangr116.github.io/vst_project/
• Github: https://github.com/Yangr116/VST

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VisionLanguageModels #SpatialAI #ComputerVision #DeepLearning #AIResearch
Dense Motion Captioning

📝 Summary:
The paper introduces Dense Motion Captioning, a new task for 3D human motion understanding. It presents CompMo, a large dataset with complex, temporally annotated motions, and DEMO, a model combining a language model with a motion adapter to generate detailed, grounded captions.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05369
• PDF: https://arxiv.org/pdf/2511.05369
• Project Page: https://xusy2333.com/demo/
• Github: https://github.com/41xu/DEMO

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MotionCaptioning #3DMotion #ComputerVision #LanguageModels #AIResearch
DeepEyesV2: Toward Agentic Multimodal Model

📝 Summary:
DeepEyesV2 is an agentic multimodal model that uses a two-stage training pipeline for robust tool integration. This method, combining a cold-start stage and reinforcement learning, effectively enables task-adaptive tool invocation for real-world reasoning tasks.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05271
• PDF: https://arxiv.org/pdf/2511.05271
• Project Page: https://visual-agent.github.io/
• Github: https://github.com/Visual-Agent/DeepEyes

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MultimodalAI #AgenticAI #ReinforcementLearning #DeepLearning #AIResearch
Towards Mitigating Hallucinations in Large Vision-Language Models by Refining Textual Embeddings

📝 Summary:
Large Vision-Language Models suffer from language bias leading to hallucinations. Our method refines textual embeddings by integrating average-pooled visual features. This simple approach improves visual grounding and reduces hallucinations.

🔹 Publication Date: Published on Nov 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.05017
• PDF: https://arxiv.org/pdf/2511.05017

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VisionLanguageModels #AIHallucinations #VisualGrounding #DeepLearning #NLP
Jailbreaking in the Haystack

📝 Summary:
NINJA is a new jailbreak method for long-context LMs. It appends benign content to harmful goals, exploiting goal positioning. This significantly increases attack success rates, revealing fundamental vulnerabilities in modern models.

🔹 Publication Date: Published on Nov 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.04707
• PDF: https://arxiv.org/pdf/2511.04707
• Project Page: https://ar-forum.github.io/ninjaattackweb/
• Github: https://github.com/AR-FORUM/NINJA_Attack

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #Jailbreaking #AISafety #AI #Cybersecurity
🤖🧠 DeepAgent: A New Era of General AI Reasoning and Scalable Tool-Use Intelligence

🗓️ 09 Nov 2025
📚 AI News & Trends

Artificial intelligence has rapidly progressed from simple assistants to advanced reasoning systems capable of complex problem-solving. As tasks demand more autonomy, adaptability and real-world interaction, the AI field has entered the era of intelligent agent systems. These agents are expected not just to answer questions, but to think, plan, search, act and interact across digital ...

#GeneralAI #ArtificialIntelligence #AIReasoning #IntelligentAgents #ScalableAI #ToolUseAI
Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

📝 Summary:
ROLL Flash enhances LLM RL post-training using asynchronous methods. It employs fine-grained parallelism and rollout-train decoupling to boost resource use and scalability. This achieves up to 2.72x speedup while matching synchronous training performance.

🔹 Publication Date: Published on Oct 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.11345
• PDF: https://arxiv.org/pdf/2510.11345
• Github: https://github.com/alibaba/ROLL

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #ReinforcementLearning #AsynchronousAI #DeepLearning #AIResearch
🤖🧠 PokeeResearch: Advancing Deep Research with AI and Web-Integrated Intelligence

🗓️ 09 Nov 2025
📚 AI News & Trends

In the modern information era, the ability to research fast, accurately and at scale has become a competitive advantage for businesses, researchers, analysts and developers. As online data expands exponentially, traditional search engines and manual research workflows are no longer sufficient to gather reliable insights efficiently. This need has fueled the rise of AI research ...

#AIResearch #DeepResearch #WebIntelligence #ArtificialIntelligence #ResearchAutomation #DataAnalysis
🤖🧠 Pico-Banana-400K: The Breakthrough Dataset Advancing Text-Guided Image Editing

🗓️ 09 Nov 2025
📚 AI News & Trends

Text-guided image editing has rapidly evolved with powerful multimodal models capable of transforming images using simple natural-language instructions. These models can change object colors, modify lighting, add accessories, adjust backgrounds or even convert real photographs into artistic styles. However, the progress of research has been limited by one crucial bottleneck: the lack of large-scale, high-quality, ...

#TextGuidedEditing #MultimodalAI #ImageEditing #AIResearch #ComputerVision #DeepLearning
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

📝 Summary:
ROLL is an efficient, scalable, and user-friendly library for large-scale reinforcement learning optimization. It features a simplified architecture, parallel training, flexible sample management, and resource mapping for developers and researchers.

🔹 Publication Date: Published on Jun 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.06122
• PDF: https://arxiv.org/pdf/2506.06122
• Github: https://github.com/alibaba/roll

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#ReinforcementLearning #MachineLearning #LargeScaleAI #Optimization #AIResearch
🤖🧠 Concerto: How Joint 2D-3D Self-Supervised Learning Is Redefining Spatial Intelligence

🗓️ 09 Nov 2025
📚 AI News & Trends

The world of artificial intelligence is rapidly evolving and self-supervised learning has become a driving force behind breakthroughs in computer vision and 3D scene understanding. Traditional supervised learning relies heavily on labeled datasets which are expensive and time-consuming to produce. Self-supervised learning, on the other hand, extracts meaningful patterns without manual labels allowing models to ...

#SelfSupervisedLearning #ComputerVision #3DSceneUnderstanding #SpatialIntelligence #AIResearch #DeepLearning
CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?

📝 Summary:
CritiCal, a novel training method using natural language critiques, significantly improves LLM confidence calibration. This method outperforms other approaches, including GPT-4o, enhancing reliability and generalization across tasks.

🔹 Publication Date: Published on Oct 28

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.24505
• PDF: https://arxiv.org/pdf/2510.24505

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #ConfidenceCalibration #MachineLearning #NLP #AIResearch
HAFixAgent: History-Aware Automated Program Repair Agent

📝 Summary:
HAFixAgent enhances automated program repair for complex multi-hunk bugs by incorporating repository history. It significantly improves bug-fixing effectiveness over existing agent-based systems while maintaining efficiency. This offers a practical approach for history-aware agentic APR.

🔹 Publication Date: Published on Nov 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.01047
• PDF: https://arxiv.org/pdf/2511.01047

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AutomatedProgramRepair #SoftwareEngineering #AI #BugFixing #CodeRepair