Building production-ready infrastructure for AI systems.
CTO at Arklex AI. I focus on making AI systems production-ready — reliable under load, predictable under failure, and efficient under constraint.
Led Kubernetes migration at Airbnb, saving $63M in infrastructure costs.
Co-built Gunrock, the 2018 Amazon Alexa Prize-winning conversational AI. Previously at Airbnb and HTC. NTU alum.
AI systems fail in predictable ways.
Most teams ignore reliability and cost constraints until production.
I design systems where routing, failure, and budget are first-class concerns.
Agent-first organization framework — the official Python library for building structured, production-ready AI agent systems at Arklex.