Skip to content

AI & automation

This is a peer of the other four areas, not a bolt-on. The same discipline that makes an integration platform trustworthy — contracts, observability, governance — is exactly what most AI projects are missing. I build agentic systems the way I build any production system: with cost controls, evaluation gates, and an audit trail.

What I do

  • Agentic systems with guardrails. Model routing, per-agent cost and budget enforcement, vendor abstraction, and a local-model fallback for sovereignty — with Prometheus observability so spend and behaviour are visible.
  • Retrieval that's grounded. RAG with proper embeddings and reranking so answers come from your data, not the model's imagination.
  • Evaluation as a gate. L1–L4 test layers, LLM-as-judge, and golden datasets so a regression is caught before it ships — the AI equivalent of acceptance criteria.
  • Governance you can audit. Deterministic tool contracts (MCP), answers that trace to a recorded call chain, and an "agents propose, humans dispose" model enforced in the delivery pipeline.

The proof

I don't just advise on this — I build it. The portfolio is three sibling repositories told as one govern → build → deliver story. In the interest of honesty: one of them runs end-to-end today; the other two are reference architectures. They are not three production systems, and the cards say so plainly.

Evidenced by

  • The AI trio — the govern→build→deliver case study behind this area.
  • Portfolio — the repositories themselves, with honest maturity labels.

Background: DeepLearning.AI Agentic AI (2025); 20+ years building data and ML systems.