ML Research Hub – Telegram
ML Research Hub
32.6K subscribers
3.9K photos
210 videos
23 files
4.18K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Universal Reasoning Model

📝 Summary:
The Universal Reasoning Model URM enhances Universal Transformers with short convolution and truncated backpropagation. This approach substantially improves reasoning performance on ARC-AGI tasks, achieving state-of-the-art results.

🔹 Publication Date: Published on Dec 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.14693
• PDF: https://arxiv.org/pdf/2512.14693
• Github: https://github.com/zitian-gao/URM

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
VABench: A Comprehensive Benchmark for Audio-Video Generation

📝 Summary:
VABench is a benchmark framework for evaluating audio-video generation models, covering text-to-audio-video, image-to-audio-video, and stereo audio-video tasks with 15 evaluation dimensions. AI-genera...

🔹 Publication Date: Published on Dec 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.09299
• PDF: https://arxiv.org/pdf/2512.09299
• Github: https://github.com/tanABCC/VABench

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

📝 Summary:
G2RL, a gradient-guided reinforcement learning framework, enhances exploration in large language models by leveraging the model's own update geometry, leading to improved performance on various reason...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15687
• PDF: https://arxiv.org/pdf/2512.15687

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

📝 Summary:
A benchmark evaluates the performance of vision-language models on understanding long-context information compressed into dense visual representations, revealing significant limitations in capturing l...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15649
• PDF: https://arxiv.org/pdf/2512.15649
• Github: https://github.com/Moenupa/VTCBench

Datasets citing this paper:
https://huggingface.co/datasets/MLLM-CL/VTCBench

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness

📝 Summary:
SCOPE enhances LLM agents' context management through prompt evolution, improving task success rates in dynamic environments without human intervention. AI-generated summary Large Language Model (LLM)...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15374
• PDF: https://arxiv.org/pdf/2512.15374
• Github: https://github.com/JarvisPei/SCOPE

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation

📝 Summary:
TacThru-UMI, a system combining a TacThru sensor with a Transformer-based Diffusion Policy, achieves superior performance in robotic manipulation tasks by integrating simultaneous multimodal perceptio...

🔹 Publication Date: Published on Dec 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.09851
• PDF: https://arxiv.org/pdf/2512.09851
• Project Page: https://tacthru.yuyang.li/
• Github: https://github.com/YuyangLee/TacThru

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DEER: Draft with Diffusion, Verify with Autoregressive Models

📝 Summary:
DEER is a novel speculative decoding framework that uses diffusion large language models for drafting, overcoming limitations of autoregressive drafters. It achieves significantly longer draft acceptance lengths and much faster LLM decoding speeds, outperforming existing methods like EAGLE-3.

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15176
• PDF: https://arxiv.org/pdf/2512.15176

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning

📝 Summary:
Skyra, a specialized multimodal large language model, detects and explains visual artifacts in AI-generated videos using a novel dataset and two-stage training strategy, outperforming existing methods...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15693
• PDF: https://arxiv.org/pdf/2512.15693
• Project Page: https://joeleelyf.github.io/Skyra/
• Github: https://github.com/JoeLeelyf/Skyra

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

📝 Summary:
Jacobi Forcing is a progressive distillation method that enables efficient parallel decoding of transformer-based models while maintaining performance, significantly reducing inference latency. AI-gen...

🔹 Publication Date: Published on Dec 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.14681
• PDF: https://arxiv.org/pdf/2512.14681
• Github: https://github.com/hao-ai-lab/JacobiForcing

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

📝 Summary:
DiffusionVL, a family of diffusion vision language models derived from autoregressive models through fine-tuning, achieves performance improvements and faster inference speeds compared to existing mod...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15713
• PDF: https://arxiv.org/pdf/2512.15713

🔹 Models citing this paper:
https://huggingface.co/hustvl/DiffusionVL-Qwen2.5VL-3B
https://huggingface.co/hustvl/DiffusionVL-Qwen2.5VL-7B

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

📝 Summary:
Qwen-Image-Layered decomposes images into semantically disentangled RGBA layers using a diffusion model, enabling independent editing of each layer and improving decomposition quality and consistency....

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15603
• PDF: https://arxiv.org/pdf/2512.15603

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Step-GUI Technical Report

📝 Summary:
A self-evolving training pipeline with the Calibrated Step Reward System and GUI-MCP protocol improve GUI automation efficiency, accuracy, and privacy in real-world scenarios. AI-generated summary Rec...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15431
• PDF: https://arxiv.org/pdf/2512.15431

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Robust and Calibrated Detection of Authentic Multimedia Content

📝 Summary:
A resynthesis framework enhances deepfake detection by verifying authenticity with low false positive rates and robustness against efficient adversaries, supporting multiple modalities. AI-generated s...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15182
• PDF: https://arxiv.org/pdf/2512.15182

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

📝 Summary:
Nano Banana Pro excels in subjective visual quality across low-level vision tasks without fine-tuning but struggles with traditional reference-based quantitative metrics due to generative model stocha...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15110
• PDF: https://arxiv.org/pdf/2512.15110
• Project Page: https://lowlevelbanana.github.io/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

📝 Summary:
The paper proposes SAGE, a multi-turn reasoning system for video that mimics human behavior, using synthetic data and reinforcement learning to improve performance on long videos. AI-generated summary...

🔹 Publication Date: Published on Dec 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.13874
• PDF: https://arxiv.org/pdf/2512.13874
• Project Page: https://praeclarumjj3.github.io/sage/
• Github: https://github.com/allenai/SAGE

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
In Pursuit of Pixel Supervision for Visual Pre-training

📝 Summary:
Pixio, an enhanced masked autoencoder, demonstrates competitive performance across various downstream tasks using pixel-space self-supervised learning, outperforming latent-space approaches. AI-genera...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15715
• PDF: https://arxiv.org/pdf/2512.15715
• Project Page: https://github.com/facebookresearch/pixio
• Github: https://github.com/facebookresearch/pixio

🔹 Models citing this paper:
https://huggingface.co/facebook/pixio-vitb16
https://huggingface.co/facebook/pixio-vitl16
https://huggingface.co/facebook/pixio-vit1b16

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling

📝 Summary:
Resampling Forcing is a teacher-free framework to train autoregressive video diffusion models. It uses self-resampling to simulate inference errors and history routing for efficient long video generation. This approach improves temporal consistency and achieves comparable performance to teacher-b...

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15702
• PDF: https://arxiv.org/pdf/2512.15702

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LikeBench: Evaluating Subjective Likability in LLMs for Personalization

📝 Summary:
LikeBench introduces a multi-session evaluation framework to measure the likability of LLMs by their ability to adapt to user preferences across multiple dimensions, demonstrating that strong memory p...

🔹 Publication Date: Published on Dec 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.13077
• PDF: https://arxiv.org/pdf/2512.13077

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This channels is for Programmers, Coders, Software Engineers.

0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages

https://news.1rj.ru/str/addlist/8_rRW2scgfRhOTc0

https://news.1rj.ru/str/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
1
IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

📝 Summary:
IC-Effect is an instruction-guided DiT framework for precise video VFX editing. It synthesizes complex effects with spatial-temporal consistency by leveraging contextual learning, a two-stage training strategy, and sparse tokenization, outperforming existing models.

🔹 Publication Date: Published on Dec 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.15635
• PDF: https://arxiv.org/pdf/2512.15635

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
WAY: Estimation of Vessel Destination in Worldwide AIS Trajectory

📝 Summary:
A novel deep learning architecture, WAY, uses nested sequence structures and spatial grids for accurate long-term vessel destination estimation from AIS data, incorporating CASP blocks and Gradient Dr...

🔹 Publication Date: Published on Dec 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.13190
• PDF: https://arxiv.org/pdf/2512.13190
• Github: https://github.com/sadPororo/WAY

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1