Phi 3.5 Mini Instruct

microsoft · released 2024-08-16 · mit license

Phi-3.5-mini-instruct is Microsoft's 3.8B open model (Aug 2024, MIT, 128K) for reasoning/math/multilingual, with strong long-context retrieval (RULER 84.1) but only moderate instruction-following (IFEval 50.6).

Key specs

TypeLocal open-weight
Parameters3.82B total
Architecturephi3
Context window131K tokens
Knowledge cutoff2023-10-01
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro47.4%
BFCL v3 (tool use)
Composite score5.02
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M3.7 GB2.2 GB131K

Strengths & weaknesses

Strengths: Strong reasoning/math for 3.8B (GSM8K 86.2); Solid long-context retrieval (RULER 84.1), 128K; Multilingual; MIT

Weaknesses: Moderate instruction-following (IFEval 50.6); Lower MMLU-Pro than later small models