✨TV2TV: A Unified Framework for Interleaved Language and Video Generation
📝 Summary:
TV2TV is a unified framework for interleaved language and video generation, using a Mixture-of-Transformers. It learns to 'think in words' before 'acting in pixels,' enhancing visual quality, controllability, and prompt alignment. The model shows strong performance on video game and natural video...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05103
• PDF: https://arxiv.org/pdf/2512.05103
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #GenerativeAI #MultimodalAI #Transformers #AI
📝 Summary:
TV2TV is a unified framework for interleaved language and video generation, using a Mixture-of-Transformers. It learns to 'think in words' before 'acting in pixels,' enhancing visual quality, controllability, and prompt alignment. The model shows strong performance on video game and natural video...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05103
• PDF: https://arxiv.org/pdf/2512.05103
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #GenerativeAI #MultimodalAI #Transformers #AI
✨DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
📝 Summary:
DAComp is a benchmark with 210 tasks for data engineering and analysis workflows. It reveals significant deficiencies in state-of-the-art agents, with success rates under 20% for engineering and below 40% for analysis, highlighting critical gaps.
🔹 Publication Date: Published on Dec 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04324
• PDF: https://arxiv.org/pdf/2512.04324
• Project Page: https://da-comp.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#DataAgents #Benchmarking #DataEngineering #DataAnalysis #AIResearch
📝 Summary:
DAComp is a benchmark with 210 tasks for data engineering and analysis workflows. It reveals significant deficiencies in state-of-the-art agents, with success rates under 20% for engineering and below 40% for analysis, highlighting critical gaps.
🔹 Publication Date: Published on Dec 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04324
• PDF: https://arxiv.org/pdf/2512.04324
• Project Page: https://da-comp.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#DataAgents #Benchmarking #DataEngineering #DataAnalysis #AIResearch
✨On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral
📝 Summary:
GRPO in tool-integrated RL collapses due to Lazy Likelihood Displacement LLD, a systematic drop in response likelihoods. LLDS regularization addresses this by preserving likelihoods, stabilizing training, preventing gradient explosion, and substantially improving performance.
🔹 Publication Date: Published on Dec 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04220
• PDF: https://arxiv.org/pdf/2512.04220
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ReinforcementLearning #MachineLearning #AI #DeepLearning #AIResearch
📝 Summary:
GRPO in tool-integrated RL collapses due to Lazy Likelihood Displacement LLD, a systematic drop in response likelihoods. LLDS regularization addresses this by preserving likelihoods, stabilizing training, preventing gradient explosion, and substantially improving performance.
🔹 Publication Date: Published on Dec 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04220
• PDF: https://arxiv.org/pdf/2512.04220
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ReinforcementLearning #MachineLearning #AI #DeepLearning #AIResearch
❤1
Media is too big
VIEW IN TELEGRAM
✨Stable Video Infinity: Infinite-Length Video Generation with Error Recycling
📝 Summary:
Stable Video Infinity SVI generates infinite-length videos with high consistency and controllable stories. It introduces Error-Recycling Fine-Tuning, teaching the Diffusion Transformer to correct its self-generated errors and address the training-test discrepancy.
🔹 Publication Date: Published on Oct 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.09212
• PDF: https://arxiv.org/pdf/2510.09212
• Project Page: https://stable-video-infinity.github.io/homepage/
• Github: https://github.com/vita-epfl/Stable-Video-Infinity
🔹 Models citing this paper:
• https://huggingface.co/vita-video-gen/svi-model
✨ Datasets citing this paper:
• https://huggingface.co/datasets/vita-video-gen/svi-benchmark
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #AI #DiffusionModels #DeepLearning #ComputerVision
📝 Summary:
Stable Video Infinity SVI generates infinite-length videos with high consistency and controllable stories. It introduces Error-Recycling Fine-Tuning, teaching the Diffusion Transformer to correct its self-generated errors and address the training-test discrepancy.
🔹 Publication Date: Published on Oct 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.09212
• PDF: https://arxiv.org/pdf/2510.09212
• Project Page: https://stable-video-infinity.github.io/homepage/
• Github: https://github.com/vita-epfl/Stable-Video-Infinity
🔹 Models citing this paper:
• https://huggingface.co/vita-video-gen/svi-model
✨ Datasets citing this paper:
• https://huggingface.co/datasets/vita-video-gen/svi-benchmark
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #AI #DiffusionModels #DeepLearning #ComputerVision
This media is not supported in your browser
VIEW IN TELEGRAM
✨PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing
📝 Summary:
PaperDebugger is an in-editor, multi-agent academic writing assistant that integrates large language models directly into LaTeX environments. It allows deep interaction with document state and revision history for enhanced writing, review, and editing workflows.
🔹 Publication Date: Published on Dec 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.02589
• PDF: https://arxiv.org/pdf/2512.02589
• Project Page: https://www.paperdebugger.com/
• Github: https://github.com/PaperDebugger/PaperDebugger
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AcademicWriting #LLM #MultiAgentSystems #ResearchTools #AI
📝 Summary:
PaperDebugger is an in-editor, multi-agent academic writing assistant that integrates large language models directly into LaTeX environments. It allows deep interaction with document state and revision history for enhanced writing, review, and editing workflows.
🔹 Publication Date: Published on Dec 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.02589
• PDF: https://arxiv.org/pdf/2512.02589
• Project Page: https://www.paperdebugger.com/
• Github: https://github.com/PaperDebugger/PaperDebugger
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AcademicWriting #LLM #MultiAgentSystems #ResearchTools #AI
Media is too big
VIEW IN TELEGRAM
✨DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
📝 Summary:
DynamicVerse introduces a 4D world modeling framework for dynamic real-world videos, overcoming existing dataset limitations. It integrates large vision, geometric, and multimodal models to create a vast dataset with metric-scale annotations. This approach achieves superior performance in depth, ...
🔹 Publication Date: Published on Dec 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03000
• PDF: https://arxiv.org/pdf/2512.03000
• Project Page: https://dynamic-verse.github.io/
• Github: https://dynamic-verse.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#4DModeling #MultimodalAI #ComputerVision #DeepLearning #AIResearch
📝 Summary:
DynamicVerse introduces a 4D world modeling framework for dynamic real-world videos, overcoming existing dataset limitations. It integrates large vision, geometric, and multimodal models to create a vast dataset with metric-scale annotations. This approach achieves superior performance in depth, ...
🔹 Publication Date: Published on Dec 2
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03000
• PDF: https://arxiv.org/pdf/2512.03000
• Project Page: https://dynamic-verse.github.io/
• Github: https://dynamic-verse.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#4DModeling #MultimodalAI #ComputerVision #DeepLearning #AIResearch
✨DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
📝 Summary:
DraCo is a novel text-to-image generation method that uses interleaved reasoning with both textual and visual content. It generates low-resolution drafts, verifies semantic alignment, and refines images to address coarse textual planning and rare attribute generation. DraCo significantly outperfo...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05112
• PDF: https://arxiv.org/pdf/2512.05112
• Github: https://github.com/CaraJ7/DraCo
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#TextToImage #GenerativeAI #DeepLearning #ComputerVision #AI
📝 Summary:
DraCo is a novel text-to-image generation method that uses interleaved reasoning with both textual and visual content. It generates low-resolution drafts, verifies semantic alignment, and refines images to address coarse textual planning and rare attribute generation. DraCo significantly outperfo...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05112
• PDF: https://arxiv.org/pdf/2512.05112
• Github: https://github.com/CaraJ7/DraCo
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#TextToImage #GenerativeAI #DeepLearning #ComputerVision #AI
✨BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
📝 Summary:
This paper presents a video diffusion framework that decouples scene dynamics from camera pose. This enables precise 4D control over time and viewpoint for high-quality video generation, outperforming prior models in controllability.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05076
• PDF: https://arxiv.org/pdf/2512.05076
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #DiffusionModels #GenerativeAI #ComputerVision #AICameraControl
📝 Summary:
This paper presents a video diffusion framework that decouples scene dynamics from camera pose. This enables precise 4D control over time and viewpoint for high-quality video generation, outperforming prior models in controllability.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05076
• PDF: https://arxiv.org/pdf/2512.05076
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#VideoGeneration #DiffusionModels #GenerativeAI #ComputerVision #AICameraControl
✨Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
📝 Summary:
Live Avatar uses a 14-billion-parameter diffusion model to achieve real-time, high-fidelity, infinite-length audio-driven avatar generation. It employs Timestep-forcing Pipeline Parallelism and Rolling Sink Frame Mechanism for efficiency and consistency, reaching 20 FPS on 5 H800 GPUs.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04677
• PDF: https://arxiv.org/pdf/2512.04677
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LiveAvatar #GenerativeAI #RealtimeAI #DiffusionModels #AvatarGeneration
📝 Summary:
Live Avatar uses a 14-billion-parameter diffusion model to achieve real-time, high-fidelity, infinite-length audio-driven avatar generation. It employs Timestep-forcing Pipeline Parallelism and Rolling Sink Frame Mechanism for efficiency and consistency, reaching 20 FPS on 5 H800 GPUs.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04677
• PDF: https://arxiv.org/pdf/2512.04677
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LiveAvatar #GenerativeAI #RealtimeAI #DiffusionModels #AvatarGeneration
✨NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
📝 Summary:
Standard diffusion corrupts image phase, destroying spatial structure. This paper introduces Phase-Preserving Diffusion phi-PD to preserve phase, enabling structure-aligned generation for tasks like re-rendering. It adds no cost and improves sim-to-real enhancement significantly.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05106
• PDF: https://arxiv.org/pdf/2512.05106
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#DiffusionModels #GenerativeAI #ComputerVision #DeepLearning #AIResearch
📝 Summary:
Standard diffusion corrupts image phase, destroying spatial structure. This paper introduces Phase-Preserving Diffusion phi-PD to preserve phase, enabling structure-aligned generation for tasks like re-rendering. It adds no cost and improves sim-to-real enhancement significantly.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05106
• PDF: https://arxiv.org/pdf/2512.05106
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#DiffusionModels #GenerativeAI #ComputerVision #DeepLearning #AIResearch
✨REFLEX: Self-Refining Explainable Fact-Checking via Disentangling Truth into Style and Substance
📝 Summary:
REFLEX is a new fact-checking method that uses internal model knowledge to improve verdict accuracy and explanation quality. It disentangles truth into style and substance via adaptive activation signals, achieving state-of-the-art performance with minimal training data. This approach also shows ...
🔹 Publication Date: Published on Nov 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20233
• PDF: https://arxiv.org/pdf/2511.20233
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#FactChecking #ExplainableAI #MachineLearning #AI #NLP
📝 Summary:
REFLEX is a new fact-checking method that uses internal model knowledge to improve verdict accuracy and explanation quality. It disentangles truth into style and substance via adaptive activation signals, achieving state-of-the-art performance with minimal training data. This approach also shows ...
🔹 Publication Date: Published on Nov 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.20233
• PDF: https://arxiv.org/pdf/2511.20233
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#FactChecking #ExplainableAI #MachineLearning #AI #NLP
This media is not supported in your browser
VIEW IN TELEGRAM
✨EgoLCD: Egocentric Video Generation with Long Context Diffusion
📝 Summary:
EgoLCD addresses content drift in long egocentric video generation by integrating long-term sparse and attention-based short-term memory with narrative prompting. It achieves state-of-the-art perceptual quality and temporal consistency, mitigating generative forgetting.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04515
• PDF: https://arxiv.org/pdf/2512.04515
• Project Page: https://aigeeksgroup.github.io/EgoLCD/
• Github: https://github.com/AIGeeksGroup/EgoLCD
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #VideoGeneration #DiffusionModels #ComputerVision #EgocentricVision
📝 Summary:
EgoLCD addresses content drift in long egocentric video generation by integrating long-term sparse and attention-based short-term memory with narrative prompting. It achieves state-of-the-art perceptual quality and temporal consistency, mitigating generative forgetting.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04515
• PDF: https://arxiv.org/pdf/2512.04515
• Project Page: https://aigeeksgroup.github.io/EgoLCD/
• Github: https://github.com/AIGeeksGroup/EgoLCD
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #VideoGeneration #DiffusionModels #ComputerVision #EgocentricVision
👍1
✨ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
📝 Summary:
ARM-Thinker is an agentic reward model that uses external tools like image cropping and document retrieval to verify judgments in multimodal reasoning tasks. This significantly improves accuracy, interpretability, and visual grounding compared to existing reward models, achieving substantial perf...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05111
• PDF: https://arxiv.org/pdf/2512.05111
• Project Page: https://github.com/InternLM/ARM-Thinker
• Github: https://github.com/open-compass/VLMEvalKit/pull/1334
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#MultimodalAI #AgenticAI #RewardModels #VisualReasoning #AIResearch
📝 Summary:
ARM-Thinker is an agentic reward model that uses external tools like image cropping and document retrieval to verify judgments in multimodal reasoning tasks. This significantly improves accuracy, interpretability, and visual grounding compared to existing reward models, achieving substantial perf...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05111
• PDF: https://arxiv.org/pdf/2512.05111
• Project Page: https://github.com/InternLM/ARM-Thinker
• Github: https://github.com/open-compass/VLMEvalKit/pull/1334
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#MultimodalAI #AgenticAI #RewardModels #VisualReasoning #AIResearch
✨QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
📝 Summary:
QKAN-LSTM is a quantum-inspired LSTM that integrates Data Re-Uploading Activation modules. This model achieves superior predictive accuracy and generalization with 79% fewer parameters than classical LSTMs. It offers a scalable, interpretable approach for sequential modeling on classical hardware.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05049
• PDF: https://arxiv.org/pdf/2512.05049
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#QKANLSTM #QuantumInspiredAI #DeepLearning #MachineLearning #DataScience
📝 Summary:
QKAN-LSTM is a quantum-inspired LSTM that integrates Data Re-Uploading Activation modules. This model achieves superior predictive accuracy and generalization with 79% fewer parameters than classical LSTMs. It offers a scalable, interpretable approach for sequential modeling on classical hardware.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05049
• PDF: https://arxiv.org/pdf/2512.05049
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#QKANLSTM #QuantumInspiredAI #DeepLearning #MachineLearning #DataScience
❤1
✨Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
📝 Summary:
The SANTA framework addresses object and action hallucinations in multimodal LLM video captions. It uses self-augmented contrastive alignment to identify potential hallucinations and then aligns regional objects and actions with visual phrases, improving factual accuracy. Experiments show SANTA o...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04356
• PDF: https://arxiv.org/pdf/2512.04356
• Project Page: https://kpc0810.github.io/santa/
• Github: https://kpc0810.github.io/santa/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#MultimodalLLMs #AI #Hallucinations #VideoUnderstanding #ContrastiveLearning
📝 Summary:
The SANTA framework addresses object and action hallucinations in multimodal LLM video captions. It uses self-augmented contrastive alignment to identify potential hallucinations and then aligns regional objects and actions with visual phrases, improving factual accuracy. Experiments show SANTA o...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04356
• PDF: https://arxiv.org/pdf/2512.04356
• Project Page: https://kpc0810.github.io/santa/
• Github: https://kpc0810.github.io/santa/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#MultimodalLLMs #AI #Hallucinations #VideoUnderstanding #ContrastiveLearning
✨LATTICE: Democratize High-Fidelity 3D Generation at Scale
📝 Summary:
LATTICE is a framework for high-fidelity 3D generation using VoxSet, a compact semi-structured representation. It employs a two-stage pipeline with a rectified flow transformer, achieving efficient, scalable, and high-quality 3D creation.
🔹 Publication Date: Published on Nov 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03052
• PDF: https://arxiv.org/pdf/2512.03052
• Project Page: https://lattice3d.github.io/
• Github: https://github.com/Zeqiang-Lai/LATTICE
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#3DGeneration #AI #DeepLearning #ComputerGraphics #GenerativeAI
📝 Summary:
LATTICE is a framework for high-fidelity 3D generation using VoxSet, a compact semi-structured representation. It employs a two-stage pipeline with a rectified flow transformer, achieving efficient, scalable, and high-quality 3D creation.
🔹 Publication Date: Published on Nov 24
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.03052
• PDF: https://arxiv.org/pdf/2512.03052
• Project Page: https://lattice3d.github.io/
• Github: https://github.com/Zeqiang-Lai/LATTICE
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#3DGeneration #AI #DeepLearning #ComputerGraphics #GenerativeAI
❤1
✨UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers
📝 Summary:
UltraImage tackles content repetition and quality degradation in high-resolution image generation by correcting dominant frequency periodicity and applying entropy-guided attention. It achieves extreme extrapolation, producing high-fidelity images up to 6Kx6K without low-resolution guidance.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04504
• PDF: https://arxiv.org/pdf/2512.04504
• Project Page: https://thu-ml.github.io/ultraimage.github.io/
• Github: https://thu-ml.github.io/ultraimage.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ImageGeneration #DiffusionModels #Transformers #HighResolution #DeepLearning
📝 Summary:
UltraImage tackles content repetition and quality degradation in high-resolution image generation by correcting dominant frequency periodicity and applying entropy-guided attention. It achieves extreme extrapolation, producing high-fidelity images up to 6Kx6K without low-resolution guidance.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04504
• PDF: https://arxiv.org/pdf/2512.04504
• Project Page: https://thu-ml.github.io/ultraimage.github.io/
• Github: https://thu-ml.github.io/ultraimage.github.io/
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#ImageGeneration #DiffusionModels #Transformers #HighResolution #DeepLearning
✨Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models
📝 Summary:
LVLM-based text-to-image models exhibit greater social bias than non-LVLM models, with system prompts identified as the key driver. The paper introduces FairPro, a training-free meta-prompting framework that significantly reduces demographic bias while maintaining text-image alignment.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04981
• PDF: https://arxiv.org/pdf/2512.04981
• Github: https://github.com/nahyeonkaty/fairpro
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AIBias #TextToImage #LVLMs #PromptEngineering #AIFairness
📝 Summary:
LVLM-based text-to-image models exhibit greater social bias than non-LVLM models, with system prompts identified as the key driver. The paper introduces FairPro, a training-free meta-prompting framework that significantly reduces demographic bias while maintaining text-image alignment.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04981
• PDF: https://arxiv.org/pdf/2512.04981
• Github: https://github.com/nahyeonkaty/fairpro
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AIBias #TextToImage #LVLMs #PromptEngineering #AIFairness
✨Generative Neural Video Compression via Video Diffusion Prior
📝 Summary:
GNVC-VD is a new DiT-based generative video compression framework. It combines spatio-temporal latent compression and sequence-level generative refinement with a video diffusion transformer to enhance perceptual quality and eliminate flickering artifacts, outperforming prior methods.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05016
• PDF: https://arxiv.org/pdf/2512.05016
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
GNVC-VD is a new DiT-based generative video compression framework. It combines spatio-temporal latent compression and sequence-level generative refinement with a video diffusion transformer to enhance perceptual quality and eliminate flickering artifacts, outperforming prior methods.
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.05016
• PDF: https://arxiv.org/pdf/2512.05016
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
👍1
✨Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
📝 Summary:
Source-Shielded Updates (SSU) enables the adaptation of instruct LLMs to new languages using only unlabeled data, preserving source knowledge and achieving competitive target-language performance. AI-...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04844
• PDF: https://arxiv.org/pdf/2512.04844
• Github: https://github.com/gucci-j/ssu
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LLM #NLP #MachineLearning #CatastrophicForgetting #MultilingualAI
📝 Summary:
Source-Shielded Updates (SSU) enables the adaptation of instruct LLMs to new languages using only unlabeled data, preserving source knowledge and achieving competitive target-language performance. AI-...
🔹 Publication Date: Published on Dec 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.04844
• PDF: https://arxiv.org/pdf/2512.04844
• Github: https://github.com/gucci-j/ssu
==================================
For more data science resources:
✓ https://news.1rj.ru/str/DataScienceT
#LLM #NLP #MachineLearning #CatastrophicForgetting #MultilingualAI
This channels is for Programmers, Coders, Software Engineers.
0️⃣ Python
1️⃣ Data Science
2️⃣ Machine Learning
3️⃣ Data Visualization
4️⃣ Artificial Intelligence
5️⃣ Data Analysis
6️⃣ Statistics
7️⃣ Deep Learning
8️⃣ programming Languages
✅ https://news.1rj.ru/str/addlist/8_rRW2scgfRhOTc0
✅ https://news.1rj.ru/str/Codeprogrammer
Please open Telegram to view this post
VIEW IN TELEGRAM
❤2