— Practice / AI

AI engineering, not AI demos.

Anyone can wire a chatbot to a model. We engineer the parts that come after — retrieval, evals, guardrails, observability, and the systems that keep the feature honest in production.

— Practice / What we ship

What we ship.

Production RAG.

Vectorize on Cloudflare. Hybrid retrieval. Chunking strategies tuned to the actual corpus, not a tutorial.
Agent systems.

Anthropic SDK and OpenAI SDK with tool use, function calling, and the boring infrastructure that keeps an agent from looping.
MCP servers.

Model Context Protocol servers on Cloudflare Workers — for Claude, Cursor, or any other MCP client.
Evals as a habit.

If you can’t measure regressions, you don’t have an AI system — you have a prompt. We write evals into the deploy pipeline.

— Practice / What we don’t do

What we don’t do.

Model training or fine-tuning.

If your problem genuinely needs custom weights, we’ll point you at people who do that for a living.
Bolted-on chatbots.

A chat widget on a marketing site is rarely the right answer. We design AI as a feature, not a UI gimmick.
“Just generate the content.”

Marketing-volume text generation is well served by other tooling. That’s not an engineering engagement.

Stack — opinionated

Smaller toolkit, more depth.

Models

Anthropic Claude · OpenAI GPT · Cloudflare Workers AI

Retrieval

Cloudflare Vectorize · D1 · BM25 hybrid

Orchestration

Anthropic SDK · OpenAI SDK · MCP · Workers

Observability

Workers Logs · Sentry · custom evals

Prompts

Versioned in source · evaluated in CI

Surface

API · agent · MCP server · embedded UI

Field notes — AI

Recent writing on the practice.

— From the GitHub

Templates we ship and maintain.

Working production patterns we use ourselves, MIT-licensed. Both deploy to your own Cloudflare account in under five minutes. Companion blog posts explain the architecture in full.

cf-rag-template ↗ ~200 LoC · MIT

Production RAG pipeline on Cloudflare Workers

Repo → Writeup →
cf-mcp-template ↗ ~200 LoC · MIT

Production MCP server on Cloudflare Workers

Repo → Writeup →

See all open work

Working on something AI-shaped? Tell us about it.

Two paragraphs is enough. We respond in writing, with a fit / no-fit answer, within one business day.

Write a brief

AI engineering, not AI demos.

What we ship.

Production RAG.

Agent systems.

MCP servers.

Evals as a habit.

What we don’t do.

Model training or fine-tuning.

Bolted-on chatbots.

“Just generate the content.”

Smaller toolkit, more depth.

Recent writing on the practice.

Workers AI vs OpenAI: A Cost-Quality Matrix at Low Volume

Production RAG on Cloudflare Without LangChain

MCP Servers in Production: When to Build, When to Skip

AI Tools for Atlantic Canada Small Businesses: What's Actually Worth It in 2025

Templates we ship and maintain.

Working on something AI-shaped? Tell us about it.