About

I build the evaluation infrastructure that keeps advanced AI aligned — from chain-of-thought faithfulness to multi-agent governance, from research prototypes to production systems.

Current Projects (2026)

Research Focus

I focus on practical evaluation tools that bridge the gap between AI evaluation research and real-world deployment. My work addresses:

Connect