DeepSeek V3.1 Terminus
deepseek-ai · released 2025-09-22 · mit license
DeepSeek-V3.1-Terminus (Sept 2025) is a maintenance update of V3.1 improving agent and language consistency, with vendor reasoning-mode scores of 85.0% MMLU-Pro, 80.7% GPQA Diamond and 68.4% SWE-bench Verified.
Key specs
| Type | Local open-weight |
|---|---|
| Parameters | 684.53B total · MoE, — active |
| Architecture | deepseek_v3 |
| Context window | 164K tokens |
| Knowledge cutoff | 2025-03-31 |
| Modalities | text |
| Recommended backends | — |
| Minimum viable rig | — |
Benchmark scores
| GPQA Diamond | 80.7% |
|---|---|
| SWE-bench Verified | 68.4% |
| AIME | — |
| MMLU-Pro | 85% |
| BFCL v3 (tool use) | — |
| Composite score | 6.42 |
| Community rating | No reviews yet |
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|---|---|---|---|
| Q4_K_M | 398.5 GB | 397 GB | — | 164K |
Strengths & weaknesses
Strengths: Hybrid reasoning model with strong agentic tool use (SWE 68.4); Improved language consistency over V3.1; Open weights (MIT), cheap API
Weaknesses: Lower HLE (21.7) than frontier reasoners; AA index of 21 is an estimate