Software engineer working at the intersection of Kubernetes, bare-metal, and AI inference. I’m passionate about solving complex problems in distributed systems with simple principles. Here I share techniques to improve system performance and scalability across compute and network. Join me in exploring the frontiers of accelerator scheduling, inference serving, and everything in between.
Opinions here are my own and do not reflect those of my employer.


