ML Research Hub – Telegram
ML Research Hub
32.7K subscribers
4.03K photos
230 videos
23 files
4.34K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🔹 Title: Advances in Speech Separation: Techniques, Challenges, and Future Trends

🔹 Publication Date: Published on Aug 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2508.10830
• PDF: https://arxiv.org/pdf/2508.10830
• Project Page: https://cslikai.cn/Speech-Separation-Paper-Tutorial
• Github: https://github.com/JusperLee/Speech-Separation-Paper-Tutorial

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends

🔹 Publication Date: Published on Aug 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.11548
• PDF: https://arxiv.org/pdf/2508.11548
• Github: https://xuzhenhua55.github.io/awesome-llm-copyright-protection/index.html

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: CorrSteer: Steering Improves Task Performance and Safety in LLMs through Correlation-based Sparse Autoencoder Feature Selection

🔹 Publication Date: Published on Aug 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.12535
• PDF: https://arxiv.org/pdf/2508.12535

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research

🔹 Publication Date: Published on Aug 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.04326
• PDF: https://arxiv.org/pdf/2508.04326
• Project Page: https://mediated-reality.github.io/rf4xr/papers/li_tvcg25/
• Github: https://github.com/mediated-reality/awesome-rf4xr

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

🔹 Publication Date: Published on Aug 6

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.04038
• PDF: https://arxiv.org/pdf/2508.04038
• Github: https://github.com/zechenli03/ZARA

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge

🔹 Publication Date: Published on Aug 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.08777
• PDF: https://arxiv.org/pdf/2508.08777

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation

🔹 Publication Date: Published on Aug 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.11032
• PDF: https://arxiv.org/pdf/2508.11032
• Github: https://github.com/podismine/MedSAMix

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Semantic IDs for Joint Generative Search and Recommendation

🔹 Publication Date: Published on Aug 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.10478
• PDF: https://arxiv.org/pdf/2508.10478

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

🔹 Publication Date: Published on Aug 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.09789
• PDF: https://arxiv.org/pdf/2508.09789

🔹 Datasets citing this paper:
https://huggingface.co/datasets/marcodena/video-recs-describe-what-you-see

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation

🔹 Publication Date: Published on Aug 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.13998
• PDF: https://arxiv.org/pdf/2508.13998

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Beyond Human Judgment: A Bayesian Evaluation of LLMs' Moral Values Understanding

🔹 Publication Date: Published on Aug 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.13804
• PDF: https://arxiv.org/pdf/2508.13804
• Project Page: https://maciejskorski.github.io/moral-foundations-llm-eval
• Github: https://github.com/maciejskorski/moral-foundations-llm-eval

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation

🔹 Publication Date: Published on Aug 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.12040
• PDF: https://arxiv.org/pdf/2508.12040

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models

🔹 Publication Date: Published on Aug 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.12903
• PDF: https://arxiv.org/pdf/2508.12903

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

🔹 Publication Date: Published on Aug 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.13186
• PDF: https://arxiv.org/pdf/2508.13186
• Github: https://github.com/MMBrowseComp/MM-BrowseComp

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
1
🔹 Title: CAMAR: Continuous Actions Multi-Agent Routing

🔹 Publication Date: Published on Aug 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.12845
• PDF: https://arxiv.org/pdf/2508.12845

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

🔹 Publication Date: Published on Aug 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.12800
• PDF: https://arxiv.org/pdf/2508.12800

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
1
🔥 $10.000 WITH LISA!

Lisa earned $200,000 in a month, and now it’s YOUR TURN!

She’s made trading SO SIMPLE that anyone can do it.

❗️Just copy her signals every day
❗️Follow her trades step by step
❗️Earn $1,000+ in your first week – GUARANTEED!

🚨 BONUS: Lisa is giving away $10,000 to her subscribers!

Don’t miss this once-in-a-lifetime opportunity. Free access for the first 500 people only!

👉 CLICK HERE TO JOIN NOW 👈
🔹 Title: Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

🔹 Publication Date: Published on Aug 1

🔹 Abstract: Foundation-Sec-8B-Instruct is a cybersecurity-focused LLM designed for chat-style interactions and instruction-following, outperforming other models in cybersecurity tasks while matching their instruction-following capabilities. AI-generated summary Large language models ( LLMs ) have shown remarkable success across many domains, yet their integration into cybersecurity applications remains limited due to a lack of general-purpose cybersecurity data, representational complexity, and safety and regulatory concerns. To address this gap, we previously introduced Foundation-Sec-8B , a cybersecurity -focused LLM suitable for fine-tuning on downstream tasks. That model, however, was not designed for chat-style interactions or instruction-following . In this report, we release Foundation-Sec-8B -Instruct: a model specifically trained for general-purpose cybersecurity dialogue . Built on Foundation-Sec-8B , it combines domain-specific knowledge with instruction-following , conversational capabilities , and alignment with human preferences to produce high-quality, relevant responses. Comprehensive evaluations show that Foundation-Sec-8B -Instruct outperforms Llama 3.1-8B-Instruct on a range of cybersecurity tasks while matching its instruction-following performance. It is also competitive with GPT-4o-mini on cyber threat intelligence and instruction-following tasks. We envision Foundation-Sec-8B -Instruct becoming an indispensable assistant in the daily workflows of cybersecurity professionals. We release the model publicly at https://huggingface.co/fdtn-ai/ Foundation-Sec-8B -Instruct.

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.01059

• PDF: https://arxiv.org/pdf/2508.01059

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts

🔹 Publication Date: Published on Aug 18

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.13320
• PDF: https://arxiv.org/pdf/2508.13320

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: Retrieval-augmented reasoning with lean language models

🔹 Publication Date: Published on Aug 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.11386
• PDF: https://arxiv.org/pdf/2508.11386

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT
🔹 Title: StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

🔹 Publication Date: Published on Aug 3

🔹 Abstract: A sketch-based strand generation model using a learnable upsampling strategy and multi-scale adaptive conditioning mechanism outperforms existing methods in realism and precision for hair strand generation. AI-generated summary Realistic hair strand generation is crucial for applications like computer graphics and virtual reality. While diffusion models can generate hairstyles from text or images, these inputs lack precision and user-friendliness. Instead, we propose the first sketch-based strand generation model, which offers finer control while remaining user-friendly. Our framework tackles key challenges, such as modeling complex strand interactions and diverse sketch patterns, through two main innovations: a learnable strand upsampling strategy that encodes 3D strands into multi-scale latent spaces , and a multi-scale adaptive conditioning mechanism using a transformer with diffusion heads to ensure consistency across granularity levels. Experiments on several benchmark datasets show our method outperforms existing approaches in realism and precision. Qualitative results further confirm its effectiveness. Code will be released at [GitHub](https://github.com/fighting-Zhang/StrandDesigner).

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.01650

• PDF: https://arxiv.org/pdf/2508.01650

• Github: https://github.com/fighting-Zhang/StrandDesigner

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://news.1rj.ru/str/DataScienceT