NVIDIA Nemotron 3 Super 120B A12B

NVIDIA · released 2026-03-01 · NVIDIA Open Model License license

NVIDIA Nemotron 3 Super is a 120B-total / 12B-active hybrid Mamba-Transformer MoE reasoning model (March 2026) tuned for throughput and 1M context. Artificial Analysis index 36, ahead of GPT-OSS-120B, behind Qwen3.5-122B.

Key specs

TypeLocal open-weight
Parameters
Architecture
Context window
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond79.23%
SWE-bench Verified60.47%
AIME90.21%
MMLU-Pro83.73%
BFCL v3 (tool use)61.15%
Composite score6.54
Community ratingNo reviews yet

Strengths & weaknesses

Strengths: Very high throughput (hybrid Mamba-Transformer MoE, 12B active); Strong long context: RULER 91.75 at 1M tokens; Native 1M-token context

Weaknesses: Lags Qwen3.5-122B on most knowledge/agentic benchmarks; Low HLE (18.26); restrictive NVIDIA license