Blog

Notes on local LLMs, hardware tradeoffs, and what actually moves token-per-second numbers.