API Update ⚙️
🔹 deepseek-chat → non-thinking mode
🔹 deepseek-reasoner → thinking mode
🧵 128K context for both
🔌 Anthropic API format supported: api-docs.deepseek.com/guides/anthrop…
✅ Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/anthropic_api
🚀 More API resources, smoother API experience
🔹 deepseek-chat → non-thinking mode
🔹 deepseek-reasoner → thinking mode
🧵 128K context for both
🔌 Anthropic API format supported: api-docs.deepseek.com/guides/anthrop…
✅ Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/anthropic_api
🚀 More API resources, smoother API experience
🤔2🐳2✍1❤1👎1👨💻1
Tools & Agents Upgrades 🧰
📈 Better results on SWE / Terminal-Bench
🔍 Stronger multi-step reasoning for complex search tasks
⚡️ Big gains in thinking efficiency
📈 Better results on SWE / Terminal-Bench
🔍 Stronger multi-step reasoning for complex search tasks
⚡️ Big gains in thinking efficiency
🤯5❤3👎1🔥1🐳1
Model Update 🤖
🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3
🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/deepseek-ai/DeepSeek-V3.1/blob/main/tokenizer_config.json
🔗 V3.1 Base Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
🔗 V3.1 Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1
🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3
🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/deepseek-ai/DeepSeek-V3.1/blob/main/tokenizer_config.json
🔗 V3.1 Base Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
🔗 V3.1 Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1
🐳9❤1🖕1
Pricing Changes 💳
🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time)
🔹 Until then, APIs follow current pricing
📝 Pricing page: https://api-docs.deepseek.com/quick_start/pricing/
🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time)
🔹 Until then, APIs follow current pricing
📝 Pricing page: https://api-docs.deepseek.com/quick_start/pricing/
😱16❤2🐳2👎1
🚀 DeepSeek-V3.1 → DeepSeek-V3.1-Terminus
The latest update builds on V3.1’s strengths while addressing key user feedback.
✨ What’s improved?
🌐 Language consistency: fewer CN/EN mix-ups & no more random chars.
🤖 Agent upgrades: stronger Code Agent & Search Agent performance.
The latest update builds on V3.1’s strengths while addressing key user feedback.
✨ What’s improved?
🌐 Language consistency: fewer CN/EN mix-ups & no more random chars.
🤖 Agent upgrades: stronger Code Agent & Search Agent performance.
👍5🐳5🤩2
📊 DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version.
👉 Available now on: App / Web / API
🔗 Open-source weights here: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus
Thanks to everyone for your feedback. It drives us to keep improving and refining the experience! 🚀
👉 Available now on: App / Web / API
🔗 Open-source weights here: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus
Thanks to everyone for your feedback. It drives us to keep improving and refining the experience! 🚀
🐳9🔥7
🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!
✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context.
👉 Now live on App, Web, and API.
💰 API prices cut by 50%+!
✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context.
👉 Now live on App, Web, and API.
💰 API prices cut by 50%+!
🐳7👍2
⚡️ Efficiency Gains
🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost.
📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost.
📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
🐳4🔥3
💻 API Update
🎉 Lower costs, same access!
💰 DeepSeek API prices drop 50%+, effective immediately.
🔹For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://api-docs.deepseek.com/guides/comparison_testing
🔹Feedback welcome: https://feedback.deepseek.com/dsa
🎉 Lower costs, same access!
💰 DeepSeek API prices drop 50%+, effective immediately.
🔹For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://api-docs.deepseek.com/guides/comparison_testing
🔹Feedback welcome: https://feedback.deepseek.com/dsa
🐳6🔥2
🛠 Open Source Release
⛓ Model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp
⛓ Tech report: https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/DeepSeek_V3_2.pdf
⛓ Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)
⛓ Model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp
⛓ Tech report: https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/DeepSeek_V3_2.pdf
⛓ Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)
🐳12❤1🔥1
⚠️ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that could degrade performance. Indexer RoPE expects non-interleaved input, MLA RoPE expects interleaved. Fixed in https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/tree/main/inference
🐳12❤1👍1🔥1