Mixtral 8x22B Instruct V0.1
mistralai · released 2024-04-17 · apache-2.0 license
Mixtral-8x22B-Instruct is Mistral's April 2024 open-weight MoE (141B total / 39B active, Apache-2.0). Mistral's announcement reports strong GSM8K/Math but its standard benchmarks are published only as chart images, so no comparable per-metric numbers could be sourced.
Key specs
| Type | Local open-weight |
|---|---|
| Parameters | 140.63B total · MoE, — active |
| Architecture | mixtral |
| Context window | 66K tokens |
| Knowledge cutoff | 2024-01-31 |
| Modalities | text |
| Recommended backends | — |
| Minimum viable rig | — |
Benchmark scores
| GPQA Diamond | — |
|---|---|
| SWE-bench Verified | — |
| AIME | — |
| MMLU-Pro | — |
| BFCL v3 (tool use) | — |
| Composite score | — |
| Community rating | No reviews yet |
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|---|---|---|---|
| Q4_K_M | 83.1 GB | 81.6 GB | — | 66K |
Strengths & weaknesses
Strengths: Sparse MoE (141B total / 39B active) with strong math/coding for its 2024 generation; Native function calling, multilingual; Apache-2.0, 64K context
Weaknesses: 2024-era model; not evaluated on most modern benchmarks; Outclassed by current open models