NEW BOT Телеграм, страница

DeepSeek

API Update ⚙️

🔹 deepseek-chat → non-thinking mode
🔹 deepseek-reasoner → thinking mode
🧵 128K context for both
🔌 Anthropic API format supported: api-docs.deepseek.com/guides/anthrop…
✅ Strict Function Calling supported in Beta API: api-docs.deepseek.com/guides/anthropic_api
🚀 More API resources, smoother API experience

🤔2🐳2✍1❤1👎1👨‍💻1

1.62K views06:34

DeepSeek

Tools & Agents Upgrades 🧰

📈 Better results on SWE / Terminal-Bench
🔍 Stronger multi-step reasoning for complex search tasks
⚡️ Big gains in thinking efficiency

🤯5❤3👎1🔥1🐳1

2.3K views06:35

DeepSeek

Model Update 🤖

🔹 V3.1 Base: 840B tokens continued pretraining for long context extension on top of V3
🔹 Tokenizer & chat template updated — new tokenizer config: https://huggingface.co/deepseek-ai/DeepSeek-V3.1/blob/main/tokenizer_config.json
🔗 V3.1 Base Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1-Base
🔗 V3.1 Open-source weights: huggingface.co/deepseek-ai/DeepSeek-V3.1

🐳9❤1🖕1

1.35K viewsedited 06:37

DeepSeek

Pricing Changes 💳

🔹 New pricing starts & off-peak discounts end at Sep 5th, 2025, 16:00 (UTC Time)
🔹 Until then, APIs follow current pricing
📝 Pricing page: https://api-docs.deepseek.com/quick_start/pricing/

😱16❤2🐳2👎1

1.57K views06:37

DeepSeek

🚀 DeepSeek-V3.1 → DeepSeek-V3.1-Terminus
The latest update builds on V3.1’s strengths while addressing key user feedback.

✨ What’s improved?
🌐 Language consistency: fewer CN/EN mix-ups & no more random chars.
🤖 Agent upgrades: stronger Code Agent & Search Agent performance.

👍5🐳5🤩2

898 views14:06

DeepSeek

📊 DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version.

👉 Available now on: App / Web / API
🔗 Open-source weights here: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus

Thanks to everyone for your feedback. It drives us to keep improving and refining the experience! 🚀

🐳9🔥7

978 views14:08

DeepSeek

🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!

✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context.
👉 Now live on App, Web, and API.
💰 API prices cut by 50%+!

🐳7👍2

701 views02:04

DeepSeek

⚡️ Efficiency Gains

🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost.
📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.

🐳4🔥3

1.06K views02:05

DeepSeek

💻 API Update

🎉 Lower costs, same access!
💰 DeepSeek API prices drop 50%+, effective immediately.

🔹For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://api-docs.deepseek.com/guides/comparison_testing
🔹Feedback welcome: https://feedback.deepseek.com/dsa

🐳6🔥2

1.42K views02:07

DeepSeek

🛠 Open Source Release

⛓ Model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp
⛓ Tech report: https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/blob/main/DeepSeek_V3_2.pdf
⛓ Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)

🐳12❤1🔥1

1.48K views02:09

DeepSeek

⚠️ Heads-up to anyone using the DeepSeek-V3.2-Exp inference demo: earlier versions had a RoPE implementation mismatch in the indexer module that could degrade performance. Indexer RoPE expects non-interleaved input, MLA RoPE expects interleaved. Fixed in https://github.com/deepseek-ai/DeepSeek-V3.2-Exp/tree/main/inference

🐳12❤1👍1🔥1

778 views03:04

About

Blog

Apps

Platform