ML Research Hub – Telegram
ML Research Hub
32.8K subscribers
4.41K photos
272 videos
23 files
4.77K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

📝 Summary:
SmolDocling is a compact vision-language model that performs end-to-end document conversion with robust performance across various document types using 256M parameters and a new markup format. AI-gene...

🔹 Publication Date: Published on Mar 14, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2503.11576
• PDF: https://huggingface.co/papers/2502.18443
• Project Page: https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo
• Github: https://github.com/docling-project/docling

🔹 Models citing this paper:
https://huggingface.co/docling-project/SmolDocling-256M-preview
https://huggingface.co/ibm-granite/granite-docling-258M
https://huggingface.co/prithivMLmods/granite-docling-258M-f32-GGUF

Datasets citing this paper:
https://huggingface.co/datasets/HuggingFaceM4/DoclingMatix
https://huggingface.co/datasets/docling-project/SynthCodeNet
https://huggingface.co/datasets/docling-project/SynthFormulaNet

Spaces citing this paper:
https://huggingface.co/spaces/ibm-granite/granite-docling-258m-demo
https://huggingface.co/spaces/ibm-granite/granite-docling-258M-WebGPU
https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

📝 Summary:
VideoRAG enhances large language models for multi-modal video processing with a dual-channel architecture that integrates textual knowledge grounding and multi-modal context encoding. AI-generated sum...

🔹 Publication Date: Published on Feb 3, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.01549
• PDF: https://arxiv.org/pdf/2502.01549
• Github: https://github.com/hkuds/videorag

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Agent READMEs: An Empirical Study of Context Files for Agentic Coding

📝 Summary:
Agentic coding tools receive goals written in natural language as input, break them down into specific tasks, and write or execute the actual code with minimal human intervention. Central to this proc...

🔹 Publication Date: Published on Nov 17, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.12884
• PDF: https://arxiv.org/pdf/2511.12884
• Project Page: https://huggingface.co/papers/2511.03404
• Github: https://github.com/openai/agents.md

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

📝 Summary:
Bitnet.cpp enhances edge inference for ternary LLMs using a novel mixed-precision matrix multiplication library, achieving significant speed improvements over baselines. AI-generated summary The adven...

🔹 Publication Date: Published on Feb 17, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.11880
• PDF: https://arxiv.org/pdf/2502.11880
• Github: https://github.com/microsoft/BitNet/tree/paper

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
BitNet Distillation

📝 Summary:
BitNet Distillation fine-tunes large language models to 1.58-bit precision using SubLN, multi-head attention distillation, and continual pre-training, achieving comparable performance with significant...

🔹 Publication Date: Published on Oct 15, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13998
• PDF: https://arxiv.org/pdf/2510.13998
• Github: https://github.com/microsoft/BitNet

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

📝 Summary:
PaddleOCR-VL, a vision-language model combining NaViT-style dynamic resolution and ERNIE, achieves state-of-the-art performance in document parsing and element recognition with high efficiency. AI-gen...

🔹 Publication Date: Published on Oct 16, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14528
• PDF: https://arxiv.org/pdf/2510.14528
• Github: https://github.com/PaddlePaddle/PaddleOCR

🔹 Models citing this paper:
https://huggingface.co/PaddlePaddle/PaddleOCR-VL
https://huggingface.co/PaddlePaddle/PP-DocLayoutV2
https://huggingface.co/unsloth/PaddleOCR-VL

Spaces citing this paper:
https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL_Online_Demo
https://huggingface.co/spaces/seanpedrickcase/document_redaction
https://huggingface.co/spaces/markobinario/PaddleOCR-VL_Online_Demo

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Efficient Memory Management for Large Language Model Serving with PagedAttention

📝 Summary:
PagedAttention algorithm and vLLM system enhance the throughput of large language models by efficiently managing memory and reducing waste in the key-value cache. AI-generated summary High throughput ...

🔹 Publication Date: Published on Sep 12, 2023

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2309.06180
• PDF: https://arxiv.org/pdf/2309.06180
• Github: https://github.com/vllm-project/vllm

🔹 Models citing this paper:
https://huggingface.co/theonlyengine/Flash-attention1

Datasets citing this paper:
https://huggingface.co/datasets/TheBlueScrubs/TheBlueScrubs-v1

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

📝 Summary:
MinerU2.5, a 1.2B-parameter document parsing vision-language model, achieves state-of-the-art recognition accuracy with computational efficiency through a coarse-to-fine parsing strategy. AI-generated...

🔹 Publication Date: Published on Sep 26, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.22186
• PDF: https://arxiv.org/pdf/2509.22186
• Project Page: https://opendatalab.github.io/MinerU/
• Github: https://github.com/opendatalab/MinerU

🔹 Models citing this paper:
https://huggingface.co/opendatalab/MinerU2.5-2509-1.2B
https://huggingface.co/freakynit/MinerU2.5-2509-1.2B
https://huggingface.co/Mungert/MinerU2.5-2509-1.2B-GGUF

Spaces citing this paper:
https://huggingface.co/spaces/opendatalab/MinerU
https://huggingface.co/spaces/xiaoye-winters/MinerU-API
https://huggingface.co/spaces/pzp5700/Paper2Any

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
HunyuanVideo 1.5 Technical Report

📝 Summary:
HunyuanVideo 1.5 is a lightweight video generation model with state-of-the-art visual quality and motion coherence, using a DiT architecture with SSTA and an efficient video super-resolution network. ...

🔹 Publication Date: Published on Nov 24, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.18870
• PDF: https://arxiv.org/pdf/2511.18870
• Github: https://github.com/Tencent-Hunyuan/HunyuanVideo-1.5

🔹 Models citing this paper:
https://huggingface.co/tencent/HunyuanVideo-1.5
https://huggingface.co/EvanEternal/Astra

Spaces citing this paper:
https://huggingface.co/spaces/gagndeep/HF-Worldplay
https://huggingface.co/spaces/akhaliq/anycoder-355bd392
https://huggingface.co/spaces/Xenurox/tencent-HunyuanVideo-1.5

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
UniVideo: Unified Understanding, Generation, and Editing for Videos

📝 Summary:
UniVideo, a dual-stream framework combining a Multimodal Large Language Model and a Multimodal DiT, extends unified modeling to video generation and editing, achieving state-of-the-art performance and...

🔹 Publication Date: Published on Oct 9, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08377
• PDF: https://arxiv.org/pdf/2510.08377
• Project Page: https://congwei1230.github.io/UniVideo/
• Github: https://github.com/KwaiVGI/UniVideo

🔹 Models citing this paper:
https://huggingface.co/KlingTeam/UniVideo

Spaces citing this paper:
https://huggingface.co/spaces/Harryji168/univideo-studio

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
MinerU: An Open-Source Solution for Precise Document Content Extraction

📝 Summary:
MinerU is an open-source tool that enhances document content extraction using fine-tuned models and pre/postprocessing rules across diverse document types. AI-generated summary Document content analys...

🔹 Publication Date: Published on Sep 27, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2409.18839
• PDF: https://huggingface.co/spaces/Echo9k/PDF_reader
• Github: https://github.com/opendatalab/MinerU

🔹 Models citing this paper:
https://huggingface.co/jiaxianustc/BioMiner-MinerU-Model

Spaces citing this paper:
https://huggingface.co/spaces/opendatalab/MinerU
https://huggingface.co/spaces/xiaoye-winters/MinerU-API
https://huggingface.co/spaces/Hunter0000/MinerU

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
TradingAgents: Multi-Agents LLM Financial Trading Framework

📝 Summary:
A multi-agent framework using large language models for stock trading simulates real-world trading firms, improving performance metrics like cumulative returns and Sharpe ratio. AI-generated summary S...

🔹 Publication Date: Published on Dec 28, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2412.20138
• PDF: https://arxiv.org/pdf/2412.20138
• Github: https://github.com/tauricresearch/tradingagents

Spaces citing this paper:
https://huggingface.co/spaces/shanghengdu/LLM-Agent-Optimization-PaperList
https://huggingface.co/spaces/Ervin2077/qiu

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1