NEW BOT Телеграм, страница

Forwarded from Machine Learning with Python

Data Science Interview questions

#DeepLearning #AI #MachineLearning #NeuralNetworks #DataScience #DataAnalysis #LLM #InterviewQuestions

https://news.1rj.ru/str/CodeProgrammer

❤2👍1

334 views08:46

ML Research Hub

✨TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

📝 Summary:
TwinBrainVLA resolves the VLM tension in robot control by coordinating a frozen generalist VLM Left Brain with a trainable specialist VLM Right Brain via Asymmetric Mixture-of-Transformers. This approach achieves superior manipulation performance while preserving semantic understanding for genera...

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14133
• PDF: https://arxiv.org/pdf/2601.14133
• Github: https://github.com/ZGC-EmbodyAI/TwinBrainVLA

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VLM #EmbodiedAI #Robotics #Transformers #AIResearch

286 views03:00

✨ Explore Data Science 📝 Write your paper

✨VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

📝 Summary:
VisGym introduces 17 environments to evaluate VLM performance in multi-step visual interactions. Current models struggle, especially with long contexts and visual symbolic tasks. Explicit goals and demonstrations offer pathways for improvement.

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16973
• PDF: https://arxiv.org/pdf/2601.16973
• Project Page: https://visgym.github.io/
• Github: https://visgym.github.io/

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MultimodalAI #VisualLanguageModels #AIenvironments #ComputerVision #AIResearch

188 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨LongCat-Flash-Thinking-2601 Technical Report

📝 Summary:
LongCat-Flash-Thinking-2601 is a 560B MoE reasoning model that achieves state-of-the-art performance on agentic benchmarks. Its capabilities stem from a unified training framework, robust tool interaction, and a Heavy Thinking mode for complex reasoning.

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16725
• PDF: https://arxiv.org/pdf/2601.16725

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#MoE #ReasoningModels #AgentAI #LLM #AI

180 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Endless Terminals: Scaling RL Environments for Terminal Agents

📝 Summary:
Endless Terminals introduces an autonomous pipeline for generating procedural terminal tasks that significantly improves agent performance on both synthetic and human-curated benchmarks through scalab...

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16443
• PDF: https://arxiv.org/pdf/2601.16443
• Github: https://github.com/kanishkg/endless-terminals

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

171 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

📝 Summary:
DSGym is a standardized framework for evaluating and training data science agents, addressing shortcomings of existing benchmarks. It offers a holistic, data-grounded task suite and enables execution-verified agent training. This allows rigorous measurement of agents' analytical capabilities, dem...

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16344
• PDF: https://arxiv.org/pdf/2601.16344

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#DataScience #AI #MachineLearning #AIagents #Research

169 views03:01

✨ Explore Data Science 📝 Write your paper

ML Research Hub

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

✨Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory

📝 Summary:
Memory-V2V enhances multi-turn video editing by adding explicit memory to diffusion models. It ensures cross-consistency using efficient token compression and retrieval. This significantly improves video consistency and performance with low computational cost.

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16296
• PDF: https://arxiv.org/pdf/2601.16296
• Project Page: https://dohunlee1.github.io/MemoryV2V

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #MachineLearning

184 views03:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

📝 Summary:
SWE-Pruner is a self-adaptive context pruning framework for coding agents. It performs task-aware adaptive pruning, guided by explicit agent goals and a neural skimmer, to reduce long context token usage by 23-54 percent with minimal performance loss.

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16746
• PDF: https://arxiv.org/pdf/2601.16746
• Github: https://github.com/Ayanami1314/swe-pruner

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AIAgents #ContextPruning #LLM #AI #SoftwareEngineering

186 views03:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

📝 Summary:
A self-evolving framework improves Deep Research Agents via inference-time, rubric-guided verification. This method iteratively refines outputs without retraining, achieving 8-11% accuracy gains with the DeepVerifier system and releasing a verification dataset.

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15808
• PDF: https://arxiv.org/pdf/2601.15808

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #MachineLearning #DeepLearning #Verification #SelfEvolvingAI

198 views04:02

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences

📝 Summary:
MeepleLM is an AI virtual playtester providing constructive critique for board game design by simulating diverse player experiences. It models subjective feedback via persona-specific reasoning, outperforming commercial AI in critique quality and community alignment.

🔹 Publication Date: Published on Jan 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.07251
• PDF: https://arxiv.org/pdf/2601.07251
• Github: https://github.com/leroy9472/MeepleLM

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #GameDesign #BoardGames #Simulation #LLM

181 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

📝 Summary:
SALAD improves video Diffusion Transformers by combining linear and sparse attention with an input-dependent gating mechanism. It achieves 90% sparsity and a 1.72x speedup while maintaining quality and requiring minimal finetuning data.

🔹 Publication Date: Published on Jan 23

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16515
• PDF: https://arxiv.org/pdf/2601.16515

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#VideoDiffusion #Transformers #Sparsity #EfficientAI #DeepLearning

245 views05:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

198 views06:03

ML Research Hub

✨Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain

📝 Summary:
Mecellem models are a framework for specialized Turkish legal language models. They feature a scratch-trained encoder achieving top retrieval rankings with efficiency, and a continually pre-trained decoder for legal domain adaptation, reducing legal text perplexity.

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16018
• PDF: https://arxiv.org/pdf/2601.16018
• Project Page: https://huggingface.co/collections/newmindai/mecellem-models
• Github: https://github.com/newmindai/mecellem-models

🔹 Models citing this paper:
• https://huggingface.co/newmindai/Mursit-Base-TR-Retrieval
• https://huggingface.co/newmindai/Mursit-Base
• https://huggingface.co/newmindai/Mursit-Large-TR-Retrieval

✨ Datasets citing this paper:
• https://huggingface.co/datasets/newmindai/caselaw-retrieval
• https://huggingface.co/datasets/newmindai/contract-retrieval
• https://huggingface.co/datasets/newmindai/regulation-retrieval

✨ Spaces citing this paper:
• https://huggingface.co/spaces/newmindai/Mizan

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LegalAI #TurkishNLP #LLM #InformationRetrieval #DomainAdaptation

arXiv.org

Mecellem Models: Turkish Models Trained from Scratch and...

This paper presents Mecellem models, a framework for developing specialized language models for the Turkish legal domain through domain adaptation strategies. We make two contributions: (1)Encoder...

223 views06:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

📝 Summary:
Quantized RL faces instability using FP8 rollout with BF16 training. Jet-RL proposes a unified FP8 precision for both training and rollout. This minimizes numerical mismatch, achieving stable convergence and significant speedups.

🔹 Publication Date: Published on Jan 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.14243
• PDF: https://arxiv.org/pdf/2601.14243

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

271 views07:03

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Guidelines to Prompt Large Language Models for Code Generation: An Empirical Characterization

📝 Summary:
Research derives and evaluates prompt optimization guidelines for code generation tasks in software engineering, identifying 10 specific improvement patterns related to input/output specification, con...

🔹 Publication Date: Published on Jan 19

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13118
• PDF: https://arxiv.org/pdf/2601.13118

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research

❤1👍1

300 views07:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation

📝 Summary:
LLMs struggle to apply new knowledge effectively via SFT alone. PaST combines SFT with injecting a domain-agnostic Skill Vector, derived from RL, to efficiently transfer reasoning skills. This novel framework significantly improves performance in question answering and tool-use tasks.

🔹 Publication Date: Published on Jan 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.11258
• PDF: https://arxiv.org/pdf/2601.11258

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLM #ReinforcementLearning #ContinualLearning #AI #MachineLearning

❤1

322 views09:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind

📝 Summary:
RebuttalAgent is a novel AI framework that applies Theory of Mind to academic rebuttal. It models reviewer mental states to formulate strategic, persuasive responses, significantly outperforming existing models.

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15715
• PDF: https://arxiv.org/pdf/2601.15715

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#AI #TheoryOfMind #AcademicRebuttal #NLP #MachineLearning

❤2

347 views10:04

✨ Explore Data Science 📝 Write your paper

ML Research Hub

✨GameTalk: Training LLMs for Strategic Conversation

📝 Summary:
The GameTalk framework trains large language models for strategic multi-turn dialogue, optimizing global objectives using whole-conversation reward signals. This approach significantly outperforms untrained models, showing conversational fine-tuning is a promising path for LLM reasoning and negot...

🔹 Publication Date: Published on Jan 22

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.16276
• PDF: https://arxiv.org/pdf/2601.16276

==================================

For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT

#LLMs #ConversationalAI #StrategicDialogue #AITraining #AIReasoning

❤1

337 views13:05

✨ Explore Data Science 📝 Write your paper

ML Research Hub

290 views15:05

ML Research Hub