skip to content
Conifer · Models

Models

Size · speed · intelligence. Every model, local.

Supported28 models
modelsizetok/sintelligence
Llama
Llama 3.2 1B1.2B29949.3
Llama 3.2 3B3.2B11863.4
Llama 3.1 8B8.0B4969.4
Hermes 3 · Llama 3.1 8B8.0B~4964.8
Qwen 3
Qwen 3 0.6B0.6B~29052.8
Qwen 3 1.7B1.7B~16562.6
Qwen 3 4B4.0B9073.0
Qwen 3 8B8.2B4676.9
Qwen 3 14B14.8B~2581.1
Qwen 3 32B32.8B~1283.6
Qwen 2.5
Qwen 2.5 0.5B0.5B28847.5
Qwen 2.5 1.5B1.5B16660.9
Qwen 2.5 3B3.1B8965.6
Qwen 2.5 7B7.6B5374.2
Qwen 2.5 14B14.7B~2779.7
Qwen 2.5 Coder 1.5B1.5B~16653.6
Qwen 2.5 Coder 7B7.6B~5368.0
Gemma
Gemma 2 2B2.6B13851.3
Gemma 2 9B9.2B~4171.3
Gemma 3 4B4.3B6659.6
Gemma 4 12B11.9B1574.5
DeepSeek
R1 Distill Qwen 1.5B1.8B~166
R1 Distill Qwen 7B7.6B~53
R1 Distill Llama 8B8.0B~49
R1 Distill Qwen 14B14.8B~27
R1 Distill Qwen 32B32.8B~12
DeepSeek V2 Lite15.7B55.7
Phi
Phi 3.5 Mini3.8B~7869.0

tok/s — decode, Apple M3 Max, Q4_K_M, 512-token prompt; ~ projected from a measured sibling. intelligence — MMLU 5-shot, as published. — not measured / not published.