ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
3.93K photos
217 videos
23 files
4.23K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
I'm pleased to invite you to join my private Signal group.

All my resources will be free and unrestricted there. My goal is to build a clean community exclusively for smart programmers, and I believe Signal is the most suitable platform for this (Signal is the second most popular app after WhatsApp in the US), making it particularly suitable for us as programmers.

https://signal.group/#CjQKIPcpEqLQow53AG7RHjeVk-4sc1TFxyym3r0gQQzV-OPpEhCPw_-kRmJ8LlC13l0WiEfp
This media is not supported in your browser
VIEW IN TELEGRAM
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

📝 Summary:
SCAIL is a framework that improves character animation to studio-grade quality. It uses a novel 3D pose representation and a diffusion-transformer with full-context pose injection, achieving state-of-the-art realism and reliability.

🔹 Publication Date: Published on Dec 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05905
• PDF: https://arxiv.org/pdf/2512.05905

🔹 Models citing this paper:
https://huggingface.co/zai-org/SCAIL-Preview

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#CharacterAnimation #AI #3DAnimation #DeepLearning #ComputerGraphics
This media is not supported in your browser
VIEW IN TELEGRAM
TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation

📝 Summary:
TimesNet-Gen is a time-domain deep learning model that effectively synthesizes site-specific strong ground motion records. It uses a station-specific latent bottleneck and outperforms a spectrogram-based baseline, improving earthquake risk assessment.

🔹 Publication Date: Published on Dec 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04694
• PDF: https://arxiv.org/pdf/2512.04694
• Project Page: https://huggingface.co/spaces/Barisylmz/TimesNet-Gen
• Github: https://github.com/brsylmz23/TimesNet-Gen/tree/main

Spaces citing this paper:
https://huggingface.co/spaces/Barisylmz/TimesNet-Gen

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#DeepLearning #EarthquakeEngineering #Seismology #GroundMotion #AI
InkSight: Offline-to-Online Handwriting Conversion by Learning to Read and Write

📝 Summary:
InkSight converts offline handwriting to digital ink using novel reading and writing priors. This approach effectively derenders text from diverse photos, generalizing beyond its training and requiring less paired data than prior methods.

🔹 Publication Date: Published on Feb 8, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2402.05804
• PDF: https://arxiv.org/pdf/2402.05804
• Project Page: https://charlieleee.github.io/publication/inksight
• Github: https://github.com/google-research/inksight

🔹 Models citing this paper:
https://huggingface.co/Derendering/InkSight-Small-p

Datasets citing this paper:
https://huggingface.co/datasets/Derendering/InkSight-Derenderings

Spaces citing this paper:
https://huggingface.co/spaces/Derendering/Model-Output-Playground

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#HandwritingConversion #ComputerVision #DeepLearning #AIResearch #DocumentDigitization
SQ-format: A Unified Sparse-Quantized Hardware-friendly Data Format for LLMs

📝 Summary:
The SQ-format is a unified sparse-quantized data format for LLM post-training quantization. It improves accuracy and efficiency balance by combining sparse and low-precision matrix multiplications. This enables better performance and throughput, especially for outlier activations, supporting next...

🔹 Publication Date: Published on Dec 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05409
• PDF: https://arxiv.org/pdf/2512.05409

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLMs #Quantization #SparseML #HardwareAcceleration #AIResearch
1
M3DR: Towards Universal Multilingual Multimodal Document Retrieval

📝 Summary:
M3DR is a framework for multilingual multimodal document retrieval that uses contrastive training to achieve robust cross-lingual and cross-modal alignment. It overcomes English-centric limitations, showing state-of-the-art performance across 22 diverse languages.

🔹 Publication Date: Published on Dec 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03514
• PDF: https://arxiv.org/pdf/2512.03514
• Project Page: https://www.cognitivelab.in/blog/introducing-netraembed
• Github: https://github.com/adithya-s-k/colpali

🔹 Models citing this paper:
https://huggingface.co/Cognitive-Lab/NetraEmbed
https://huggingface.co/Cognitive-Lab/ColNetraEmbed

Datasets citing this paper:
https://huggingface.co/datasets/Cognitive-Lab/NayanaIR-CrossBench

Spaces citing this paper:
https://huggingface.co/spaces/AdithyaSK/NetraEmbed

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MultimodalAI #InformationRetrieval #NLP #CrossLingualAI #MachineLearning
1
EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

📝 Summary:
EMMA is an efficient unified architecture for multimodal tasks like understanding, generation, and editing. It uses novel components including an autoencoder, channel-wise concatenation, and mixture-of-experts. EMMA achieves superior performance and efficiency over state-of-the-art unified models.

🔹 Publication Date: Published on Dec 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04810
• PDF: https://arxiv.org/pdf/2512.04810
• Project Page: https://emma-umm.github.io/emma/
• Github: https://emma-umm.github.io/emma/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MultimodalAI #GenerativeAI #DeepLearning #AIArchitecture #EfficientAI
3
🚀 Master Data Science & Programming!

Unlock your potential with this curated list of Telegram channels. Whether you need books, datasets, interview prep, or project ideas, we have the perfect resource for you. Join the community today!


🔰 Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
https://news.1rj.ru/str/CodeProgrammer

🔖 Machine Learning
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.
https://news.1rj.ru/str/DataScienceM

🧠 Code With Python
This channel delivers clear, practical content for developers, covering Python, Django, Data Structures, Algorithms, and DSA – perfect for learning, coding, and mastering key programming skills.
https://news.1rj.ru/str/DataScience4

🎯 PyData Careers | Quiz
Python Data Science jobs, interview tips, and career insights for aspiring professionals.
https://news.1rj.ru/str/DataScienceQ

💾 Kaggle Data Hub
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.
https://news.1rj.ru/str/datasets1

🧑‍🎓 Udemy Coupons | Courses
The first channel in Telegram that offers free Udemy coupons
https://news.1rj.ru/str/DataScienceC

😀 ML Research Hub
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.
https://news.1rj.ru/str/DataScienceT

💬 Data Science Chat
An active community group for discussing data challenges and networking with peers.
https://news.1rj.ru/str/DataScience9

🐍 Python Arab| بايثون عربي
The largest Arabic-speaking group for Python developers to share knowledge and help.
https://news.1rj.ru/str/PythonArab

🖊 Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooks—insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
https://news.1rj.ru/str/DataScienceN

📺 Free Online Courses | Videos
Free online courses covering data science, machine learning, analytics, programming, and essential skills for learners.
https://news.1rj.ru/str/DataScienceV

📈 Data Analytics
Dive into the world of Data Analytics – uncover insights, explore trends, and master data-driven decision making.
https://news.1rj.ru/str/DataAnalyticsX

🎧 Learn Python Hub
Master Python with step-by-step courses – from basics to advanced projects and practical applications.
https://news.1rj.ru/str/Python53

⭐️ Research Papers
Professional Academic Writing & Simulation Services
https://news.1rj.ru/str/DataScienceY

━━━━━━━━━━━━━━━━━━
Admin: @HusseinSheikho
Please open Telegram to view this post
VIEW IN TELEGRAM
3
From FLOPs to Footprints: The Resource Cost of Artificial Intelligence

📝 Summary:
This study quantifies the material footprint of AI training, analyzing Nvidia A100 GPUs heavy metal composition. Training GPT-4 demands thousands of GPUs, leading to tons of toxic waste. Optimizing hardware use and lifespan can significantly cut these material costs.

🔹 Publication Date: Published on Dec 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04142
• PDF: https://arxiv.org/pdf/2512.04142

Spaces citing this paper:
https://huggingface.co/spaces/sophia-falk/flops-2-footprints

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AIFootprint #AISustainability #GreenAI #ElectronicWaste #TechEthics
1
Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding

📝 Summary:
Active Video Perception AVP improves long video understanding by actively seeking query-relevant evidence. It uses an iterative plan-observe-reflect process, acquiring compact evidence directly from pixels. This achieves higher accuracy with reduced computational cost.

🔹 Publication Date: Published on Dec 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05774
• PDF: https://arxiv.org/pdf/2512.05774

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VideoUnderstanding #ActiveLearning #ComputerVision #AIResearch #DeepLearning
Taxonomy-Adaptive Moderation Model with Robust Guardrails for Large Language Models

📝 Summary:
Roblox Guard 1.0 is an instruction fine-tuned LLM that enhances safety through comprehensive input-output moderation. It uses a pipeline of LLMs, generalizes to new safety taxonomies, and performs strongly on out-of-domain benchmarks. A new evaluation benchmark, RobloxGuard-Eval, is also released.

🔹 Publication Date: Published on Dec 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05339
• PDF: https://arxiv.org/pdf/2512.05339

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #AISafety #AI #MachineLearning #NLP
1
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

📝 Summary:
The TAD benchmark is introduced to evaluate temporal understanding in autonomous driving, addressing a gap where current VLMs perform poorly. It reveals that state-of-the-art models show substandard accuracy in this domain. Two training-free solutions, Scene-CoT and TCogMap, are proposed, improvi...

🔹 Publication Date: Published on Dec 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05277
• PDF: https://arxiv.org/pdf/2512.05277
• Github: https://github.com/vbdi/tad_bench

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AutonomousDriving #VisionLanguageModels #ComputerVision #AIResearch #DeepLearning
3
🤖🧠 Distil-Whisper: Faster, Smaller, and Smarter Speech Recognition by Hugging Face

🗓️ 08 Dec 2025
📚 AI News & Trends

The evolution of Automatic Speech Recognition (ASR) has reshaped how humans interact with technology. From dictation tools and live trannoscription to smart assistants and media captioning, ASR technology continues to bridge the gap between speech and digital communication. However, achieving real-time, high-accuracy trannoscription often comes at the cost of heavy computational requirements until now. Enter ...

#DistilWhisper #FasterSpeechRecognition #SmallerModels #HuggingFace #ASRTechnology #RealTimeTrannoscription
Colon-X: Advancing Intelligent Colonoscopy from Multimodal Understanding to Clinical Reasoning

📝 Summary:
Colon-X introduces ColonR1, a novel reasoning-centric model for intelligent colonoscopy. It achieves 56.61% accuracy, outperforming traditional methods by 25.22% under data scarcity, by leveraging new comprehensive multimodal datasets.

🔹 Publication Date: Published on Dec 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03667
• PDF: https://arxiv.org/pdf/2512.03667
• Github: https://github.com/ai4colonoscopy/Colon-X

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

📝 Summary:
DoVer is an intervention-driven debugging approach for LLM multi-agent systems. It validates failure hypotheses and measures progress via targeted interventions, improving reliability. DoVer converts 18-49% of failed tasks into successes, offering an outcome-oriented debugging method.

🔹 Publication Date: Published on Dec 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.06749
• PDF: https://arxiv.org/pdf/2512.06749
• Project Page: https://aka.ms/DoVer

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLM #MultiAgentSystems #Debugging #AI #Research
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

📝 Summary:
The paper proposes a method to enhance Rotary Position Embeddings by utilizing both the real and imaginary components of the complex-valued dot product, improving long-context modeling in Large Langua...

🔹 Publication Date: Published on Dec 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.07525
• PDF: https://arxiv.org/pdf/2512.07525
• Github: https://github.com/OpenMOSS/rope_pp

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LongCat-Image Technical Report

📝 Summary:
LongCat-Image is a bilingual open-source foundation model for image generation that addresses multilingual text rendering, photorealism, and deployment efficiency through rigorous data curation, compa...

🔹 Publication Date: Published on Dec 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.07584
• PDF: https://arxiv.org/pdf/2512.07584
• Project Page: https://longcat.chat/
• Github: https://github.com/meituan-longcat/LongCat-Image

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing

📝 Summary:
EgoEdit is a real-time, instruction-following egocentric video editor that addresses challenges in handling egomotion and hand-object interactions, outperforming existing methods on egocentric editing...

🔹 Publication Date: Published on Dec 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.06065
• PDF: https://arxiv.org/pdf/2512.06065
• Project Page: https://snap-research.github.io/EgoEdit/
• Github: https://github.com/snap-research/EgoEdit

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Scaling Zero-Shot Reference-to-Video Generation

📝 Summary:
Saber is a scalable zero-shot framework for reference-to-video generation that uses video-text pairs to learn identity-consistent representations and outperforms models trained with explicit reference...

🔹 Publication Date: Published on Dec 7

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.06905
• PDF: https://arxiv.org/pdf/2512.06905
• Project Page: https://franciszzj.github.io/Saber/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research