Engineering Ceremonies for the AI Era

April 30, 2026

Engineering ceremonies exist to solve a coordination problem: a group of people building one system has to stay aligned on what they're building, who's doing what, and whether it's working — and left to itself, that alignment decays. The rituals are the forcing functions that keep it from decaying. But which rituals isn't fixed; each dominant process model was a response to where the technology of its era put the cost of building software. Waterfall fit a world of mainframes and long, expensive release cycles, where compute was scarce and changing your mind late was ruinous — so you specified everything up front and moved in sequence. Agile arrived once iteration got cheap: faster hardware, version control, continuous integration, and frequent releases dropped the cost of change, so tight feedback loops beat big up-front plans, and the ceremonies reorganized around short cycles. Each regime, in other words, was tuned to a specific constraint — and they all shared one assumption underneath: that the expensive, error-prone step is humans writing the code.

That assumption is what's changing. When code generation gets cheap, the constraint moves off the keyboard and onto alignment and validation — and a ritual aimed at the old constraint quietly optimizes the wrong thing. The rest of this post walks ceremony by ceremony through what each becomes, which new ones earn a slot, and how the roles shift once code stops being the bottleneck.

When AI started writing real code, the obvious question for engineering leaders was which tools should we adopt? The more useful question, a couple of cycles later, is which of our ceremonies still earn their slot?

Code production has compressed by an order of magnitude. The rituals around it are still budgeted as if it hadn't. Standups still narrate yesterday's work. Grooming still hands fuzzy stories to engineers. Planning still estimates by author-time. Demos still show finished UIs. The ceremonies were built to coordinate humans typing — and most of them are now optimizing the wrong segment of the pipeline.

A fresh-look playbook: existing Agile ceremonies, the new ones that earn a place, the role shifts that come with them, and the bottlenecks that emerge once code stops being the constraint.

1. The unit of work has changed

The most consequential pattern in AI-assisted engineering, across the variants currently in play, is that the spec — not the ticket, not the PR — is becoming the unit of work. A few of the families to know:

Birgitta Böckeler, writing for Thoughtworks, organizes these into three levels of rigor: spec-first (write spec, then build), spec-anchored (keep the spec live and enforced as the system evolves), and spec-as-truth (the spec is the source; the code is generated). Most teams adopting this in earnest are operating somewhere between spec-first and spec-anchored.

The variants differ. The pattern doesn't: spec is durable, code is regenerable, validation is explicit.

2. What actually shifts underneath

The implications for ceremonies fall out of a handful of shifts:

Ceremonies that used to allocate time-to-implement now need to allocate time-to-align and time-to-validate. That's the lens for everything that follows.

3. The playbook — ceremony by ceremony

The five ceremonies any team will recognize, in their pre-AI form and the form that earns its slot today.

CeremonyPre-AI purposePre-AI failure modeAI-era shapeNow optimizes for
StandupSync work-in-flight, surface blockersStatus theaterDecision-and-blocker board, async-firstAlignment latency
Backlog refinement / groomingMake stories sprint-readyUnder-refined stories pushed into sprintSpec workshop — produces a versioned spec artifactSpec quality at the point of generation
Sprint planningCommit to a scopeEstimating in author-time onlyCommit to a validation budget; smaller batchesRealistic throughput across the whole pipe
Demo / sprint reviewShow what was builtPolished UI hides shaky decisionsDecision review — show spec, trade-offs, validation evidenceStakeholder confidence in the decision
RetrospectiveImprove processVague complaints, no follow-throughAdds AI-usage retro reading audit logs and eval trendsTrust calibration over time
Table 1 — Existing ceremonies, pre-AI vs. AI-era.

Standup — from status to decisions and blockers

Audit trails already say what was done. What they don't surface is where I'm stuck deciding and which spec needs another pair of eyes — and the synchronous slot earns its place only when there's something to decide together.

Backlog refinement — from grooming to spec workshop

"Ready" now means a structured spec — REASONS canvas, EARS, /specify, ADR — good enough to drive code generation. The session is a workshop; tech lead and senior engineer sit alongside the PM; the output is a versioned artifact, not a ticket comment.

Sprint planning — from author-budget to validator-budget

AI compressed author-hours; nobody compressed validator-hours, so capacity is validator-bound. Two-week boundaries get awkward both ways — too long for AI throughput, too short for validation depth. Plan in specs authored, generated against, validated, and shipped.

Demo / sprint review — from feature demo to decision review

AI makes everything look shippable, so demo-by-screenshot tells you less than it used to. The high-leverage demo is the decision review: spec, trade-offs taken, alternatives rejected, validation evidence. Stakeholders walk away knowing why this option, why these criteria, what we'd change if we did it again — the value a person stamps onto machine-assisted output.

Retrospective — same purpose, new evidence base

Failure modes are now AI-specific — hallucinated APIs that passed review, prompt regressions, eval drift, validators rubber-stamping output. These leave evidence: audit logs, eval trends, drift reports. Add an AI-usage retro that reads those artifacts. Where did we trust output we shouldn't have? Where did we re-verify what didn't need it? Trust calibration over time.

4. New ceremonies that earn a slot

A handful of ceremonies have no good pre-AI analogue. They earn a slot because the bottleneck moved.

New ceremonyWhy it now earns a slotCadenceBottleneck addressed
Spec reviewSenior judgment now lives in the spec; bad specs hide as fluent codePer spec, before generationAlignment latency, spec quality
Eval & regression reviewWithout a periodic read, regressions go silentWeekly or sprint-alignedEval coverage gaps, silent drift
Prompt-library / context tendingThe agent's context is a product; it rots without an ownerSprintly, often folded into spec or eval reviewContext / prompt-library decay
Validation triage (optional)Choke point is human reviewer capacity, not author capacityAs needed; small standing slotValidator capacity
Table 2 — New ceremonies for AI-assisted teams.

Spec review

Design review, reborn — lighter than a code review, heavier than story refinement, closest to an ADR review. Bad specs review as fluent code; good specs review as a system that holds together. Catch the difference before generation, not after.

Eval & regression review

Borrowed from ML ops, now standard for any AI-assisted codebase. New failure modes? Baseline regressions? Models silently degraded after an upgrade? If you don't have eval coverage, that's the project — start there.

Prompt-library / context tending

CLAUDE.md, AGENTS.md, spec library, shared prompts — a product the team consumes, with the hygiene of dependencies: owner, review cadence, retirement of stale entries. Often folded into spec or eval review — but somebody owns it.

Validation triage (optional)

Borrowed from on-call triage: which outputs warrant deep human review, which take cheap automated checks. Not every team needs this; the ones whose validation queue blows out a sprint know who they are.

5. How the roles shift

Ceremony change without role change is theatre. The shifts line up across the org:

RolePre-AI focusAI-era focusWhere this role now leans in
Engineer (IC)Implement to spec, write tests, peer-review codeAuthor specs, validate AI output, design eval cases, tend the prompt librarySpec workshops, spec review, AI-usage retros
Engineering managerCapacity by author-hours, unblock, coach craftCapacity by validator-hours, manage review queues, watch eval trendsSprint planning, validation triage, retros
Director / VP engineeringOrg design, headcount-to-throughputSmaller teams, denser accountability; fund eval / audit / prompt-library infrastructureTooling and platform investment, cross-team eval reviews
Product manager / partnerWrite stories, prioritize backlog, align stakeholdersOwn the spec, not just the story; co-write criteria precise enough to drive generationSpec workshops, spec review, decision-style demos
Stakeholder / business sponsorEngage at demo and releaseEngage at the spec stage, where input still shapes the outcomeSpec review, decision-style demos
QA / testerWrite and run tests against requirements; bug triageOwn the eval suite; design validation strategy; triageEval & regression review, validation triage, spec review
Platform / infrastructureCI/CD, observability, on-callSame plus agent infrastructure: audit, budget, eval pipelines, prompt-library toolingPrompt-library tending, eval pipelines, audit guardrails
Operations / SREUptime, incident response, capacitySame plus agent ops: cost circuit breakers, runaway-loop detection, drift detectionAI-usage retros (incidents), eval review (silent degradation)
Table 3 — Role shifts in AI-assisted engineering teams.

Each role moves up a level of abstraction. The org chart compresses; the responsibility per name does not.

6. The new bottlenecks (and where to put the touch points)

Reorganize ceremonies for yesterday's bottleneck and you'll regret it in six months. Today's pairs of bottleneck → touch point (owner):

These will move again. The discipline is asking each quarter: what's the choke point now, and which ceremony addresses it?

Closing thought

Ceremonies are forcing functions for alignment. AI didn't reduce the need for alignment — it concentrated the value of alignment into fewer, higher-stakes touch points. The teams that adopt these tools well aren't the ones with the shiniest agents. They're the ones who reorganized their rituals around the new bottleneck, lifted each role one level of abstraction to match, and put a ceremony exactly where it earned its slot.

Same lesson as before: leverage exposes the next constraint. Do the work to find it, and put a ceremony there.


Further reading

← All posts