Conifer · Models
Models
Size · speed · intelligence. Every model, local.
Supported28 models
| model | size | tok/s | intelligence |
|---|---|---|---|
| Llama | |||
| Llama 3.2 1B | 1.2B | 299 | 49.3 |
| Llama 3.2 3B | 3.2B | 118 | 63.4 |
| Llama 3.1 8B | 8.0B | 49 | 69.4 |
| Hermes 3 · Llama 3.1 8B | 8.0B | ~49 | 64.8 |
| Qwen 3 | |||
| Qwen 3 0.6B | 0.6B | ~290 | 52.8 |
| Qwen 3 1.7B | 1.7B | ~165 | 62.6 |
| Qwen 3 4B | 4.0B | 90 | 73.0 |
| Qwen 3 8B | 8.2B | 46 | 76.9 |
| Qwen 3 14B | 14.8B | ~25 | 81.1 |
| Qwen 3 32B | 32.8B | ~12 | 83.6 |
| Qwen 2.5 | |||
| Qwen 2.5 0.5B | 0.5B | 288 | 47.5 |
| Qwen 2.5 1.5B | 1.5B | 166 | 60.9 |
| Qwen 2.5 3B | 3.1B | 89 | 65.6 |
| Qwen 2.5 7B | 7.6B | 53 | 74.2 |
| Qwen 2.5 14B | 14.7B | ~27 | 79.7 |
| Qwen 2.5 Coder 1.5B | 1.5B | ~166 | 53.6 |
| Qwen 2.5 Coder 7B | 7.6B | ~53 | 68.0 |
| Gemma | |||
| Gemma 2 2B | 2.6B | 138 | 51.3 |
| Gemma 2 9B | 9.2B | ~41 | 71.3 |
| Gemma 3 4B | 4.3B | 66 | 59.6 |
| Gemma 4 12B | 11.9B | 15 | 74.5 |
| DeepSeek | |||
| R1 Distill Qwen 1.5B | 1.8B | ~166 | — |
| R1 Distill Qwen 7B | 7.6B | ~53 | — |
| R1 Distill Llama 8B | 8.0B | ~49 | — |
| R1 Distill Qwen 14B | 14.8B | ~27 | — |
| R1 Distill Qwen 32B | 32.8B | ~12 | — |
| DeepSeek V2 Lite | 15.7B | — | 55.7 |
| Phi | |||
| Phi 3.5 Mini | 3.8B | ~78 | 69.0 |
tok/s — decode, Apple M3 Max, Q4_K_M, 512-token prompt; ~ projected from a measured sibling. intelligence — MMLU 5-shot, as published. — not measured / not published.