ML Research Hub – Telegram
ML Research Hub
32.9K subscribers
4.45K photos
273 videos
23 files
4.81K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Alterbute: Editing Intrinsic Attributes of Objects in Images

📝 Summary:
Alterbute is a diffusion method for editing intrinsic object attributes like color or shape, while preserving identity and scene context. It uses a relaxed training objective and Visual Named Entities for scalable, identity-preserving supervision, outperforming existing methods.

🔹 Publication Date: Published on Jan 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2601.10714
• PDF: https://arxiv.org/pdf/2601.10714
• Project Page: https://talreiss.github.io/alterbute/
• Github: https://talreiss.github.io/alterbute/

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#Alterbute #DiffusionModels #ImageEditing #ComputerVision #AIResearch
VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

📝 Summary:
VQ-Seg introduces vector quantization to replace dropout with a controllable perturbation module for semi-supervised medical image segmentation. It uses a dual-branch architecture and foundation model guidance to maintain performance. VQ-Seg outperforms state-of-the-art methods on various medical...

🔹 Publication Date: Published on Jan 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.10124
• PDF: https://arxiv.org/pdf/2601.10124
• Project Page: https://github.com/noscript-Yang/VQ-Seg
• Github: https://github.com/noscript-Yang/VQ-Seg

Datasets citing this paper:
https://huggingface.co/datasets/ynoscript/ACDC-PNG

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#MedicalImageSegmentation #SemiSupervisedLearning #VectorQuantization #DeepLearning #ComputerVision
Enhancing Sentiment Classification and Irony Detection in Large Language Models through Advanced Prompt Engineering Techniques

📝 Summary:
This study enhanced LLM sentiment analysis and irony detection through advanced prompt engineering. Different techniques improved performance, but optimal strategies varied by model and task, emphasizing the need for tailored prompt design.

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08302
• PDF: https://arxiv.org/pdf/2601.08302
• Github: https://github.com/Marvin2108/ESCID-LLM-APET

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#PromptEngineering #LLMs #SentimentAnalysis #IronyDetection #NLP
Memory Bank Compression for Continual Adaptation of Large Language Models

📝 Summary:
Memory-augmented continual learning for LLMs faces growing memory bank issues. MBC compresses these banks via codebook optimization and an online resetting mechanism, using Key-Value Low-Rank Adaptation. It reduces bank size to 0.3 percent while maintaining high accuracy.

🔹 Publication Date: Published on Jan 2

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.00756
• PDF: https://arxiv.org/pdf/2601.00756
• Github: https://github.com/Thomkat/MBC

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#LLMs #ContinualLearning #MemoryCompression #MachineLearning #DeepLearning
Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale

📝 Summary:
A large-scale study of AI agent skills found 26.1% contain widespread vulnerabilities like data exfiltration and privilege escalation. Skills with executable noscripts are twice as likely to be vulnerable, showing an urgent need for security vetting and permission systems.

🔹 Publication Date: Published on Jan 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.10338
• PDF: https://arxiv.org/pdf/2601.10338

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AIAgents #AISecurity #Cybersecurity #VulnerabilityResearch #DataSecurity
Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL

📝 Summary:
CLINSQL is a new benchmark for evaluating text-to-SQL models on complex clinical tasks, including patient similarity, using real EHR data. Current models achieve moderate execution scores but remain far from clinical reliability for real-world EHR analytics.

🔹 Publication Date: Published on Jan 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.09876
• PDF: https://arxiv.org/pdf/2601.09876
• Github: https://github.com/Barryshen1/ClinSQL

Datasets citing this paper:
https://huggingface.co/datasets/yifeis02/ClinSQL

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
V-DPM: 4D Video Reconstruction with Dynamic Point Maps

📝 Summary:
Dynamic Point Maps extended to video input through V-DPM framework achieve state-of-the-art 3D and 4D reconstruction by recovering both dynamic depth and full 3D motion of scene points. AI-generated s...

🔹 Publication Date: Published on Jan 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.09499
• PDF: https://arxiv.org/pdf/2601.09499
• Project Page: https://www.robots.ox.ac.uk/~vgg/research/vdpm/
• Github: https://github.com/eldar/vdpm

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

📝 Summary:
PACEvolve framework addresses key failure modes in LLM evolutionary search through hierarchical context management, momentum-based backtracking, and adaptive sampling policies for improved self-improv...

🔹 Publication Date: Published on Jan 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.10657
• PDF: https://arxiv.org/pdf/2601.10657

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
RigMo: Unifying Rig and Motion Learning for Generative Animation

📝 Summary:
RigMo unifies rig and motion learning directly from raw mesh sequences, encoding deformations into compact latent spaces. This framework generates interpretable, plausible 3D animation, offering superior reconstruction and generalization over baselines.

🔹 Publication Date: Published on Jan 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.06378
• PDF: https://arxiv.org/pdf/2601.06378
• Project Page: https://rigmo-page.github.io/
• Github: https://rigmo-page.github.io

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Demystifying the Slash Pattern in Attention: The Role of RoPE

📝 Summary:
Slash-Dominant Heads in LLMs emerge when queries and keys are almost rank-one and Rotary Position Embedding has dominant medium-high frequencies. Theoretical proof shows these conditions, combined with gradient descent, explain their emergence.

🔹 Publication Date: Published on Jan 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.08297
• PDF: https://arxiv.org/pdf/2601.08297

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
2
M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints

📝 Summary:
M4olGen is a multi-agent, multi-stage framework for precise molecular generation under multiple physicochemical constraints. It uses fragment-level, retrieval-augmented reasoning and RL-based optimization, outperforming LLMs and graph-based methods.

🔹 Publication Date: Published on Jan 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.10131
• PDF: https://arxiv.org/pdf/2601.10131

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
1
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model

📝 Summary:
dots.ocr is a unified Vision-Language Model that jointly learns document layout parsing tasks, overcoming limitations of multi-stage pipelines. It achieves state-of-the-art performance on OmniDocBench and sets a new baseline on the challenging multilingual XDocParse benchmark.

🔹 Publication Date: Published on Dec 2, 2025

🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/dotsocr-multilingual-document-layout-parsing-in-a-single-vision-language-model
• PDF: https://arxiv.org/pdf/2512.02498
• Github: https://github.com/rednote-hilab/dots.ocr

==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT

#VisionLanguageModel #DocumentParsing #MultilingualAI #AIResearch #DeepLearning