Available Memory
64GB
Unified Memory
Memory Bandwidth
273GB/s
Basic1000 GB/s max
Why Bandwidth Matters

LLM inference is memory-bound. Higher bandwidth directly translates to faster token generation, making it more important than raw compute power.

Top Picks for Your Hardware

All Compatible Models

Gemma 3 1B

Gemma1B
FP16very high
2.5 GB64 GB
Fast81.9 t/s
General
ollama run gemma3:1b

Llama 3.2 1B

Llama1B
FP16very high
2.3 GB64 GB
Fast81.9 t/s
General
ollama run llama3.2:1b

Granite 3 MoE 1B

Granite1B
FP16very high
2.3 GB64 GB
Fast81.9 t/s
GeneralCoding
ollama run granite3-moe:1b

Qwen 3 0.6B

Qwen0.6B
FP16very high
1.4 GB64 GB
Fast137 t/s
General
ollama run qwen3:0.6b

Qwen 2.5 0.5B

Qwen0.5B
FP16very high
1.2 GB64 GB
Fast164 t/s
General
ollama run qwen2.5:0.5b

SmolLM2 360M

SmolLM0.36B
FP16very high
0.8 GB64 GB
Fast228 t/s
General
ollama run smollm2:360m

Gemma 2 2B

Gemma2B
FP16very high
5.0 GB64 GB
Fast40.9 t/s
General
ollama run gemma2:2b

EXAONE 3.5 2.4B

EXAONE2.4B
FP16very high
5.0 GB64 GB
Fast34.1 t/s
GeneralCoding
ollama run exaone3.5:2.4b

SmolLM2 135M

SmolLM0.135B
FP16very high
0.3 GB64 GB
Fast607 t/s
General
ollama run smollm2:135m

Granite 3 Dense 2B

Granite2B
FP16very high
4.2 GB64 GB
Fast40.9 t/s
GeneralCoding
ollama run granite3-dense:2b

Qwen 3 1.7B

Qwen1.7B
FP16very high
3.6 GB64 GB
Fast48.2 t/s
GeneralCoding
ollama run qwen3:1.7b

SmolLM2 1.7B

SmolLM1.7B
FP16very high
3.5 GB64 GB
Fast48.2 t/s
GeneralCoding
ollama run smollm2:1.7b

Qwen 2.5 3B

Qwen3B
FP16very high
7.0 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run qwen2.5:3b

Llama 3.2 3B

Llama3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run llama3.2:3b

StarCoder2 3B

StarCoder3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
Coding
ollama run starcoder2:3b

Kimi K1.5 A3B

Kimi3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralReasoningMath
ollama run kimi-k1.5:a3b

Granite 3 MoE 3B

Granite3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run granite3-moe:3b

Phi-3 Mini (3.8B)

Phi3.8B
FP16very high
7.8 GB64 GB
Good21.6 t/s
GeneralCodingReasoning
ollama run phi3:mini

Qwen 3 4B

Qwen4B
FP16very high
8.5 GB64 GB
Good20.5 t/s
GeneralCodingReasoning
ollama run qwen3:4b

GLM Edge 4B

GLM4B
FP16very high
8.2 GB64 GB
Good20.5 t/s
GeneralCoding
ollama run glm-edge:4b

Gemma 3 4B

Gemma4B
FP16very high
8.2 GB64 GB
Good20.5 t/s
GeneralCoding
ollama run gemma3:4b

Codestral 22B

Mistral22B
FP16very high
44.0 GB64 GB
Very Slow3.7 t/s
Coding
ollama run codestral:22b