Resources · AI

AI workstation spec guide.

The decisions that actually matter when you're sizing a local AI machine.

GPU

VRAM is usually the constraint

Model size + quantization sets the floor
Batch size and context length push the ceiling
Multi-GPU spreads VRAM but adds latency and complexity

Platform

CPU, RAM, and PCIe

PCIe lane count matters for multi-GPU and NVMe
RAM at least 2x total VRAM is a good baseline
Single-thread performance still matters for data prep

Cooling and power

Sustained load, not peak

PSU sized for sustained multi-GPU + headroom
Case airflow over GPU shrouds
Noise budget for the room

Software

Stack notes

Linux + CUDA for most production stacks
Ollama / llama.cpp / vLLM trade-offs
Container vs bare-metal driver paths