Welcome to the blog

This is the first post. The plan is short, opinionated notes on running large language models locally — what hardware actually matters, what to ignore, and where the gap between "fits in memory" and "useful at the desk" lives.

Posts are written as plain markdown in content/blog/. Each one has a title, description, and date in the frontmatter. The index sorts by date.

What to expect

Practical hardware notes: bandwidth, quantization, headroom.
Short reads. No 3,000-word listicles.
Occasional reproducible benchmarks, when there's something worth measuring.

If you have a topic you'd like to see covered, open an issue.