DeepSeek – Telegram
DeepSeek
1.1K subscribers
38 photos
32 links
Unravel the mystery of AGI with curiousity. Answer the essential questions with long-termism. https://www.deepseek.com
Download Telegram
🚀 DeepSeek-R1-0528 is here!

🔹 Improved benchmark performance
🔹 Enhanced front-end capabilities
🔹 Reduced hallucinations
🔹 Supports JSON output & function calling

Try it now: https://chat.deepseek.com
🔌 No change to API usage — docs here: https://api-docs.deepseek.com/guides/reasoning_model
🔗 Open-source weights: https://huggingface.co/deepseek-ai/DeepSeek-R1-0528
🐳307👏4🔥3❤‍🔥21👀1🆒1
Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀

🧠 Hybrid inference: Think & Non-Think — one model, two modes
⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528
🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks

Try it now — toggle Think/Non-Think via the "DeepThink" button: chat.deepseek.com
🐳42👏21👍1👎1
API Update ⚙️

🔹 deepseek-chat → non-thinking mode
🔹 deepseek-reasoner → thinking mode
🧵 128K context for both
🔌 Anthropic API format supported: api-docs.deepseek.com/guides/anthrop
Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/anthropic_api
🚀 More API resources, smoother API experience
🤔2🐳211👎1👨‍💻1
Tools & Agents Upgrades 🧰

📈 Better results on SWE / Terminal-Bench
🔍 Stronger multi-step reasoning for complex search tasks
⚡️ Big gains in thinking efficiency
🤯53👎1🔥1🐳1
Model Update 🤖

🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3
🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/deepseek-ai/DeepSeek-V3.1/blob/main/tokenizer_config.json
🔗 V3.1 Base Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
🔗 V3.1 Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1
🐳91🖕1
Pricing Changes 💳

🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time)
🔹 Until then, APIs follow current pricing
📝 Pricing page: https://api-docs.deepseek.com/quick_start/pricing/
😱162🐳2👎1
🚀 DeepSeek-V3.1 → DeepSeek-V3.1-Terminus
The latest update builds on V3.1’s strengths while addressing key user feedback.

What’s improved?
🌐 Language consistency: fewer CN/EN mix-ups & no more random chars.
🤖 Agent upgrades: stronger Code Agent & Search Agent performance.
👍5🐳5🤩2
📊 DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version.

👉 Available now on: App / Web / API
🔗 Open-source weights here: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus

Thanks to everyone for your feedback. It drives us to keep improving and refining the experience! 🚀
🐳9🔥7
🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!

Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context.
👉 Now live on App, Web, and API.
💰 API prices cut by 50%+!
🐳7👍2
⚡️ Efficiency Gains

🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost.
📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
🐳4🔥3
💻 API Update

🎉 Lower costs, same access!
💰 DeepSeek API prices drop 50%+, effective immediately.

🔹For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://api-docs.deepseek.com/guides/comparison_testing
🔹Feedback welcome: https://feedback.deepseek.com/dsa
🐳6🔥2
🛠 Open Source Release

Model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp
Tech report: https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/DeepSeek_V3_2.pdf
Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)
🐳121🔥1
⚠️ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that could degrade performance. Indexer RoPE expects non-interleaved input, MLA RoPE expects interleaved. Fixed in https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/tree/main/inference
🐳121👍1🔥1