A compact, nimble model that punches above its weight for a 1.5B parameter system. It handles everyday instruction-following, simple Q&A, and light reasoning tasks with reasonable coherence, though it struggles with complex multi-step problems or nuanced understanding that larger models handle more gracefully. Think of it as a fast, lightweight workhorse suited for constrained environments.
| Benchmark | Score | Type | Recorded |
|---|---|---|---|
| MATH | 22.1 | accuracy | 26d ago |
| MMLU-Pro | 20.0 | accuracy | 26d ago |
| MuSR | 3.2 | accuracy | 26d ago |
| BBH | 19.8 | accuracy | 26d ago |
| IFEval | 44.8 | accuracy | 26d ago |
| GPQA Diamond | 0.8 | accuracy | 26d ago |