NVIDIA Nemotron Nano 9B V2

nvidia · released 2025-09-05 · other license

NVIDIA-Nemotron-Nano-9B-v2 (Aug 2025) is a 9B hybrid Mamba-Transformer reasoning model with strong math/coding (MATH-500 97.8, LiveCodeBench 71.1, AIME25 72.1) and toggleable reasoning. Sept-2024 cutoff.

Key specs

TypeLocal open-weight
Parameters8.89B total
Architecturenemotron_h
Context window131K tokens
Knowledge cutoff2025-03-31
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond64%
SWE-bench Verified
AIME72.1%
MMLU-Pro
BFCL v3 (tool use)
Composite score5.43
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M6.7 GB5.2 GB131K

Strengths & weaknesses

Strengths: Excellent math/coding for 9B (MATH-500 97.8, LiveCodeBench 71.1); Hybrid Mamba-2/Transformer, toggleable reasoning + thinking budget; Strong IFEval (90.3) and long context (RULER 78.9)

Weaknesses: Very low on hardest frontier QA (HLE 6.5); Restrictive NVIDIA Open Model License