Granite-3.1-8B-Instruct
IBM · released 2024-12-01 · Apache-2.0 license
Granite-3.1-8B-Instruct is IBM's Dec-2024 8B Apache-2.0 long-context enterprise model with solid instruction-following (IFEval 72.1) and 12-language support.
Key specs
| Type | Local open-weight |
|---|---|
| Parameters | 8.17B total |
| Architecture | granite |
| Context window | 131K tokens |
| Knowledge cutoff | — |
| Modalities | text |
| Recommended backends | — |
| Minimum viable rig | — |
Benchmark scores
| GPQA Diamond | — |
|---|---|
| SWE-bench Verified | — |
| AIME | — |
| MMLU-Pro | — |
| BFCL v3 (tool use) | — |
| Composite score | 7.21 |
| Community rating | No reviews yet |
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|---|---|---|---|
| Q4_K_M | 6.2 GB | 4.7 GB | — | 131K |
Strengths & weaknesses
Strengths: Solid instruction following (IFEval 72.1); 128K long-context, multilingual (12 languages); Apache-2.0, enterprise focus
Weaknesses: Modest reasoning/math at hard levels; Outclassed by 2025 small models