Long-form notes on LLM architecture, systems thinking, and engineering decisions in real-world AI products.
Apr 3, 2026
Why you are paying too much for tokens, how caching actually works, and the math that makes it obvious.
Mar 6, 2026
Coding agents, harness design, and a daily practice for getting less wrong.