ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
3.99K photos
226 videos
23 files
4.29K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ML Research Hub pinned «🔥 NEW YEAR 2026 – PREMIUM nature papers: 400$ Q1 and  Q2 papers    300$ Q3 and Q4 papers   200$ Doctoral thesis (complete)    500$ M.S thesis         300$ paper simulation   150$ Contact me: @Omidyzd62»
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

📝 Summary:
LoPA is a training-free algorithm enhancing dLLM inference parallelism by optimizing Token Filling Order. It achieves 10.1 tokens per forward pass for D2F-Dream, significantly boosting efficiency while maintaining performance. A multi-GPU system further accelerates throughput to 1073.9 tokens per...

🔹 Publication Date: Published on Dec 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.16229
• PDF: https://arxiv.org/pdf/2512.16229
• Github: https://zhijie-group.github.io/blogs/lopa

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #AI #Inference #ParallelDecoding #Performance
Does It Tie Out? Towards Autonomous Legal Agents in Venture Capital

📝 Summary:
Automating legal capitalization tie-out in venture capital is difficult for current AI. It requires multi-document reasoning and strict evidence traceability. This paper proposes a world model architecture for automation, advancing applied legal intelligence.

🔹 Publication Date: Published on Dec 21

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.18658
• PDF: https://arxiv.org/pdf/2512.18658

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LegalAI #VentureCapital #AIAutomation #LegalTech #ArtificialIntelligence
This media is not supported in your browser
VIEW IN TELEGRAM
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry

📝 Summary:
MatSpray integrates 2D PBR materials from diffusion models onto 3D Gaussian Splatting geometry. Using projection and neural refinement, it enables accurate relighting and photorealistic rendering from reconstructed scenes. This boosts asset creation efficiency.

🔹 Publication Date: Published on Dec 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.18314
• PDF: https://arxiv.org/pdf/2512.18314
• Project Page: https://matspray.jdihlmann.com/
• Github: https://github.com/cgtuebingen/MatSpray

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MatSpray #GaussianSplatting #DiffusionModels #3DRendering #ComputerGraphics
1
🔥 NEW YEAR 2026 – PREMIUM

nature papers: 400$

Q1 and  Q2 papers    300$

Q3 and Q4 papers   200$

Doctoral thesis (complete)    500$

M.S thesis         300$

paper simulation   150$

Contact me: @Omidyzd62
Please open Telegram to view this post
VIEW IN TELEGRAM
🚀 Master Data Science & Programming!

Unlock your potential with this curated list of Telegram channels. Whether you need books, datasets, interview prep, or project ideas, we have the perfect resource for you. Join the community today!


🔰 Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
https://news.1rj.ru/str/CodeProgrammer

🔖 Machine Learning
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.
https://news.1rj.ru/str/DataScienceM

🧠 Code With Python
This channel delivers clear, practical content for developers, covering Python, Django, Data Structures, Algorithms, and DSA – perfect for learning, coding, and mastering key programming skills.
https://news.1rj.ru/str/DataScience4

🎯 PyData Careers | Quiz
Python Data Science jobs, interview tips, and career insights for aspiring professionals.
https://news.1rj.ru/str/DataScienceQ

💾 Kaggle Data Hub
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.
https://news.1rj.ru/str/datasets1

🧑‍🎓 Udemy Coupons | Courses
The first channel in Telegram that offers free Udemy coupons
https://news.1rj.ru/str/DataScienceC

😀 ML Research Hub
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.
https://news.1rj.ru/str/DataScienceT

💬 Data Science Chat
An active community group for discussing data challenges and networking with peers.
https://news.1rj.ru/str/DataScience9

🐍 Python Arab| بايثون عربي
The largest Arabic-speaking group for Python developers to share knowledge and help.
https://news.1rj.ru/str/PythonArab

🖊 Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooks—insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
https://news.1rj.ru/str/DataScienceN

📺 Free Online Courses | Videos
Free online courses covering data science, machine learning, analytics, programming, and essential skills for learners.
https://news.1rj.ru/str/DataScienceV

📈 Data Analytics
Dive into the world of Data Analytics – uncover insights, explore trends, and master data-driven decision making.
https://news.1rj.ru/str/DataAnalyticsX

🎧 Learn Python Hub
Master Python with step-by-step courses – from basics to advanced projects and practical applications.
https://news.1rj.ru/str/Python53

⭐️ Research Papers
Professional Academic Writing & Simulation Services
https://news.1rj.ru/str/DataScienceY

━━━━━━━━━━━━━━━━━━
Admin: @HusseinSheikho
Please open Telegram to view this post
VIEW IN TELEGRAM
1
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion

📝 Summary:
CASA enhances cross-attention for vision-language models by adding local text-to-text interaction. This approach substantially reduces the performance gap with costly token insertion methods on detailed visual tasks. CASA maintains efficiency and scalability for long-context multimodal applicatio...

🔹 Publication Date: Published on Dec 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.19535
• PDF: https://arxiv.org/pdf/2512.19535
• Project Page: https://kyutai.org/casa
• Github: https://github.com/kyutai-labs/casa

🔹 Models citing this paper:
https://huggingface.co/kyutai/CASA-Helium1-VL-2B

Spaces citing this paper:
https://huggingface.co/spaces/kyutai/casa-samples

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VisionLanguage #MultimodalAI #AttentionMechanisms #EfficientAI #DeepLearning
3
Over++: Generative Video Compositing for Layer Interaction Effects

📝 Summary:
Over++ introduces augmented compositing, a framework that generates realistic, text-prompted environmental effects for videos. It synthesizes effects like shadows onto video layers while preserving the original scene, outperforming prior methods without dense annotations.

🔹 Publication Date: Published on Dec 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.19661
• PDF: https://arxiv.org/pdf/2512.19661
• Project Page: https://overplusplus.github.io/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#GenerativeAI #VideoCompositing #VFX #ComputerGraphics #AIResearch
👍1
SecureCode v2.0: A Production-Grade Dataset for Training Security-Aware Code Generation Models

📝 Summary:
SecureCode v2.0 is a production-grade dataset of 1215 security-focused coding examples. It trains AI models to generate secure code by providing real-incident examples with vulnerable and secure implementations, attacks, defense, and operational security context across 11 languages, using a conve...

🔹 Publication Date: Published on Dec 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.18542
• PDF: https://arxiv.org/pdf/2512.18542
• Project Page: https://perfecxion.ai/
• Github: https://github.com/scthornton/securecode-v2

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#Cybersecurity #CodeSecurity #AI #CodeGeneration #Dataset
Step-DeepResearch Technical Report

📝 Summary:
Step-DeepResearch is an end-to-end agent for deep research, using a data synthesis strategy and progressive training. It achieves expert-level capabilities, outperforming existing models and rivaling SOTA closed-source models with cost-efficiency. It also introduces ADR-Bench for realistic Chines...

🔹 Publication Date: Published on Dec 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20491
• PDF: https://arxiv.org/pdf/2512.20491

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #MachineLearning #DeepResearch #AIagent #SOTA
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

📝 Summary:
This paper decomposes LLM policies into internal layer and modular policies, revealing distinct reasoning patterns across layers. It finds early layers explore and top layers refine. Motivated by this, Bottom-up Policy Optimization BuPO is proposed to optimize internal layer policies for superior...

🔹 Publication Date: Published on Dec 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.19673
• PDF: https://arxiv.org/pdf/2512.19673

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #PolicyOptimization #DeepLearning #AIResearch #NLP
SAM Audio: Segment Anything in Audio

📝 Summary:
SAM Audio is a foundation model for general audio separation. It unifies text visual and temporal span prompts achieving state-of-the-art performance across diverse audio types. It also introduces a new real-world separation benchmark.

🔹 Publication Date: Published on Dec 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.18099
• PDF: https://arxiv.org/pdf/2512.18099
• Project Page: https://ai.meta.com/samaudio/
• Github: https://github.com/facebookresearch/sam-audio

🔹 Models citing this paper:
https://huggingface.co/facebook/sam-audio-large
https://huggingface.co/facebook/sam-audio-small
https://huggingface.co/facebook/sam-audio-base

Spaces citing this paper:
https://huggingface.co/spaces/lpeterl/sam-audio-webui
https://huggingface.co/spaces/Arrcttacsrks/SAM-Audio-Demo
https://huggingface.co/spaces/chippie1/SAM-Audio-Demo

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AudioSeparation #FoundationModels #AI #DeepLearning #SAMAudio
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

📝 Summary:
QuantiPhy is a benchmark that quantitatively assesses state-of-the-art vision perception models' ability to reason about physical properties such as size, velocity, and acceleration from video observa...

🔹 Publication Date: Published on Dec 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.19526
• PDF: https://arxiv.org/pdf/2512.19526

Datasets citing this paper:
https://huggingface.co/datasets/PaulineLi/QuantiPhy-validation

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
SpatialTree: How Spatial Abilities Branch Out in MLLMs

📝 Summary:
SpatialTree introduces a 4-level cognitive hierarchy and benchmark for evaluating MLLM spatial abilities. It reveals distinct skill dependencies and strong cross-level transfer from low to high-level abilities. A novel auto-think strategy consistently enhances performance across all spatial levels.

🔹 Publication Date: Published on Dec 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20617
• PDF: https://arxiv.org/pdf/2512.20617
• Project Page: https://spatialtree.github.io/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
SemanticGen: Video Generation in Semantic Space

📝 Summary:
SemanticGen addresses slow convergence and computational costs in video generation by using a two-stage diffusion model approach that first generates semantic features and then VAE latents, leading to...

🔹 Publication Date: Published on Dec 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20619
• PDF: https://arxiv.org/pdf/2512.20619
• Project Page: https://jianhongbai.github.io/SemanticGen/
• Github: https://jianhongbai.github.io/SemanticGen/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Reinforcement Learning for Self-Improving Agent with Skill Library

📝 Summary:
A novel RL framework, SAGE, enhances LLM-based agents' self-improvement capabilities by systematically incorporating skills from a skill library, leading to better performance and efficiency in new en...

🔹 Publication Date: Published on Dec 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.17102
• PDF: https://arxiv.org/pdf/2512.17102

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Active Intelligence in Video Avatars via Closed-loop World Modeling

📝 Summary:
Video avatars currently lack agency for autonomous goal pursuit. ORCA introduces a framework for active intelligence, using a closed-loop Observe-Think-Act-Reflect cycle and a dual-system architecture for strategic reasoning and action. It enables robust, goal-directed task completion, transformi...

🔹 Publication Date: Published on Dec 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20615
• PDF: https://arxiv.org/pdf/2512.20615
• Project Page: https://xuanhuahe.github.io/ORCA/
• Github: https://xuanhuahe.github.io/ORCA/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
FaithLens: Detecting and Explaining Faithfulness Hallucination

📝 Summary:
FaithLens is a cost-efficient model for detecting and explaining faithfulness hallucinations in LLM outputs. It uses synthesized training data and rule-based reinforcement learning. FaithLens outperforms advanced models like GPT-4.1 on 12 tasks while providing high-quality explanations.

🔹 Publication Date: Published on Dec 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20182
• PDF: https://arxiv.org/pdf/2512.20182
• Github: https://github.com/S1s-Z/FaithLens

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research