Home›Tools›VRAM Calculator

LLM VRAM Calculator

Estimate how much GPU memory your local LLM will use. Plan your hardware before you buy.

Model size: 8B parameters

1B7B30B70B120B

Quantization

~5-8% quality loss

Context window: 8K tokens

2K8K32K64K128K

Estimated VRAM needed

5.8 GB

Base model: 4.7 GB · KV cache: 0.0 GB · Activations: ~1 GB

GPU Compatibility

RTX 3060 12GB12 GB ✅ Comfortable

RTX 3080 / 4070 / 4070 Super12 GB ✅ Comfortable

RTX 408016 GB ✅ Comfortable

RTX 4090 / 309024 GB ✅ Comfortable

RTX 509032 GB ✅ Comfortable

A600048 GB ✅ Comfortable

2x RTX 309048 GB ✅ Comfortable

A100 80GB / H100 80GB80 GB ✅ Comfortable

Mac Studio M2 Ultra 192GB192 GB ✅ Comfortable

📌 Notes on accuracy