I build production AI for
regulated industries.
28 years delivering AI/ML platforms in healthcare, energy, and finance. $4.1B in measurable outcomes. Four open-source frameworks in production — most recently Bulwark, the agent-security framework defending AI systems against prompt injection. I write Field Notes: Production AI — bi-weekly dispatches from the inside of regulated AI.
I started my AI career at ISRO, training neural networks before transformers existed. What has stayed constant across three decades is a belief that AI systems are only as valuable as their reliability in production — not their performance on a benchmark.
Across Tech, Techstartups, Healthcare, Lifesciences, Energy & Utilities, Insurance, Banking, and Fintech I have led platform transformations that generated over $4B in measurable business outcomes. The work I am most proud of is not the technology — it is the trust that business stakeholders placed in AI systems I built, because those systems told the truth about uncertainty when it mattered.
I co-founded the CAIO Circle Tri-State Chapter to build the executive AI leadership community that I wished had existed when I was navigating these decisions alone. I publish and speak to share what the journey actually looks like from inside enterprise AI — not the demo, the production.
Production-grade AI platforms built from real requirements at Ambharii Labs. No demos. No prototypes.
Production-grade Python framework defending AI agents against prompt-injection attacks. Five-layer defense — Input Sanitizer, ML + pattern Detector, Compartmentalized RBAC, Encrypted Audit Trail, and Human Confirmation Gates. MCP-native and compliance-ready for HIPAA, NERC CIP, and SOC 2.
11 Autonomous AI Agents powering denial prevention and revenue recovery for healthcare revenue cycle management. Built with G-ARVIS Observability and ARGUS Self-Correction to predict claim denials before submission and optimize A/R workflows at scale.
Enterprise LLM observability and scoring platform. Monitors six dimensions of production LLM health — Groundedness, Accuracy, Reliability, Variance, Inference Cost, and Safety — to ensure AI systems perform when stakes are real.
Six dimensions of production LLM health — distilled from building AI systems that govern billions in capital decisions across regulated industries.
Read the Full Article →From data infrastructure to model deployment to business translation — the complete stack of skills required to ship AI that actually works in production.
Prompt injection is no longer theoretical. Live attacks now target MCP servers, agentic browsers, and tool-using LLMs through hidden HTML, Unicode abuse, and role-marker injection. Introduces Bulwark — an open-source five-layer defense framework for production AI agents.
Why benchmark scores are a distraction — and the 8 measurements that will make or break your AI system when real money is on the line. Introduces the G-ARVIS framework for production LLM observability.
Single-turn accuracy metrics break down when your LLM is taking multi-step actions. What needs to change in your observability stack before you ship agentic workflows to production.
If you've onboarded a teammate to Claude Code in the last six months, you've probably had this conversation: "It says I'm being charged per token, but I have a Pro subscription?" Six auth methods, one priority chain — and the open-source script I built to solve it across macOS, Linux, Windows, and WSL.
Open to conversations about engineering leadership, AI platform architecture, speaking engagements, and advisory roles at companies building AI that has to work when stakes are real.