GLM 4.5 Air

zai-org · released 2025-07-25 · mit license

GLM-4.5-Air is the compact 106B MoE sibling of GLM-4.5 (July 2025, MIT); per its report it scores 57.6% SWE-bench Verified, 89.4% AIME 24, 75.0% GPQA and 81.4% MMLU-Pro, trading capability for efficiency.

Key specs

TypeLocal open-weight
Parameters110.47B total · MoE, — active
Architectureglm4_moe
Context window131K tokens
Knowledge cutoff2024-12-31
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond75%
SWE-bench Verified57.6%
AIME89.4%
MMLU-Pro81.4%
BFCL v3 (tool use)
Composite score6.14
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M65.6 GB64.1 GB131K

Strengths & weaknesses

Strengths: Compact 106B MoE with near-flagship reasoning (AIME 89.4, MATH-500 98.1); Efficient agentic/tool-use; MIT license

Weaknesses: Lower coding than full GLM-4.5 (SWE-bench 57.6); Superseded by newer GLM generations