Writing

Thinking in public.

Notes on building AI systems, the math underneath, and the human side of engineering work.

Feb 2025 Draft

Building Reward Signals for LLM Agents

Lessons from designing evaluation frameworks that actually measure what matters — not just what's easy to measure.

Reward ModelingLLMsEvaluation

Jan 2025 Draft

How a Ph.D. in pure math turned into a career building AI systems at scale, and why the transition was less linear than it sounds.

CareerMathematicsML

Jan 2025 Draft

Why human evaluation doesn't scale for agentic systems, and how to build automated evaluation that you can actually trust.

EvaluationAgentsInfrastructure

✦ More posts coming soon. In the meantime, you can find my thinking scattered across commit messages and design docs.