DeepSeek V3.1 Terminus

deepseek-ai · released 2025-09-22 · mit license

DeepSeek-V3.1-Terminus (Sept 2025) is a maintenance update of V3.1 improving agent and language consistency, with vendor reasoning-mode scores of 85.0% MMLU-Pro, 80.7% GPQA Diamond and 68.4% SWE-bench Verified.

Key specs

Type	Local open-weight
Parameters	684.53B total · MoE, — active
Architecture	deepseek_v3
Context window	164K tokens
Knowledge cutoff	2025-03-31
Modalities	text
Recommended backends	—
Minimum viable rig	—

Benchmark scores

GPQA Diamond	80.7%
SWE-bench Verified	68.4%
AIME	—
MMLU-Pro	85%
BFCL v3 (tool use)	—
Composite score	6.42
Community rating	No reviews yet

VRAM & disk per quantization

Quant	VRAM	Disk	RAM	Context
Q4_K_M	398.5 GB	397 GB	—	164K

Strengths & weaknesses

Strengths: Hybrid reasoning model with strong agentic tool use (SWE 68.4); Improved language consistency over V3.1; Open weights (MIT), cheap API

Weaknesses: Lower HLE (21.7) than frontier reasoners; AA index of 21 is an estimate