The prompt is the part of an AI product people talk about. It is also the smallest part that matters. The product is what the system does when the prompt’s output is wrong, malformed, dangerous, or simply boring.

Every AI feature we have shipped spends more engineering effort on the failure path than on the happy path. That is not a complaint. It is the shape of the work.

The product is what the system does when the prompt’s output is wrong.

What failure looks like

Hallucinated facts. Malformed JSON. Refusals where refusal is unhelpful. Confident answers to questions the user did not ask. Each of these needs a specific response: retry with a different model, fall back to a rules-based path, surface the uncertainty, ask a clarifying question, escalate to a human.

The architecture that wraps the prompt

A production agentic system is a set of guardrails, validators, fallbacks, and observability hooks arranged around a core model call. The prompt is the heart. The rest is the cardiovascular system. You cannot ship with just the heart.

Prompt is not the product

What failure looks like

The architecture that wraps the prompt

Evaluating agentic systems beyond the demo

Why “AI-first” is already the wrong frame

We stopped calling ourselves consultants

Six weeks is not a constraint. It is the point.