Product Delivery · 9 min read · Updated 2026-06-18

Scrum in the Agentic Era: Which Ceremonies Survive

When agents write much of the code, the question is not whether Scrum dies. It is that its ceremonies were tuned for scarce engineering capacity — and capacity is no longer the constraint. Re-point them at the one that is: acceptance.

In brief

Scrum’s ceremonies were tuned for scarce engineering capacity; in agentic delivery capacity is abundant and acceptance is scarce, so each ceremony must be re-pointed from coordinating production to governing acceptance.

  • The ceremonies survive as slots in the week, but their job changes — from coordinating the build to governing what gets accepted.
  • Refinement becomes specification: the unit of work is an executable spec with acceptance criteria, decomposed into atomic, independently verifiable tasks with human gates.
  • Effort is decoupled from value, so velocity/story points mislead alone — size the verification load and pair every effort metric with an acceptance or outcome metric.
  • The sprint survives as an assurance cadence (inspect acceptance, rework, the harness), not a fixed batch of build capacity.
  • The Scrum Master re-points toward flow stewardship, agent-autonomy governance and context/eval curation; transactional ceremony-administration is most exposed.

The question doing the rounds is "if agents write the code, is Scrum dead?" It is the wrong question, and it leads teams to the wrong answers — either ripping out the ceremonies in a burst of enthusiasm, or defending them unchanged because they still fill the calendar. Both miss what actually happened. Scrum’s ceremonies were designed for a world in which engineering capacity was the scarce resource: planning rationed it, estimation forecast it, the standup unblocked it, the retro tuned it. Agentic delivery removes that scarcity. Generation becomes abundant — and the ceremonies are now optimising for a bottleneck that has moved.

So the honest framing is neither "dead" nor "unchanged". The ceremonies survive as slots in the week; their *job* changes. Each one has to be re-pointed from coordinating production to governing acceptance. Get that right and Scrum becomes more useful in agentic delivery, not less — because the work it now coordinates (deciding what to build, whether to trust what was built, and how far to let agents act) is exactly the work that has become scarce.

The constraint moved; the calendar did not

Start with the mechanism. When a developer wrote most of the code, cycle time was dominated by the time to build. Speed up the building and you sped up delivery, which is why a decade of tooling — and a decade of ceremonies — aimed there. With agents drafting changes across many files in minutes, that step compresses, and the dominant cost shifts downstream to review and verification. Practitioner analyses now describe this directly: faster generation produces slower review, because a human still has to understand and accept each change, and there are suddenly far more of them. LeadDev catalogued the metrics this breaks; engineering teams such as Agoda have reported the bottleneck moving squarely into code review. This is the same shift we write about everywhere on this site — once generation is abundant, acceptance is the constraint — showing up in the team’s weekly rhythm.

The bottleneck moves downstream: generation is cheap, acceptance is dear. Ceremonies tuned for the left-hand constraint are mis-tuned for the right.
Ceremonies optimised for scarce engineering capacity are mis-tuned when capacity is abundant and assurance is the scarce resource.

Refinement becomes specification

Backlog refinement used to mean slicing stories small enough to estimate and assign. In agentic delivery the unit of work changes shape: the artefact an agent acts on is a specification precise enough to be executed. The industry has converged on a name for this — spec-driven development — and the practical guidance from GitHub and O’Reilly is consistent: write the intent, the constraints and the acceptance criteria so unambiguously that they become executable inputs, then decompose the work into atomic, independently verifiable tasks, each with a human gate. "Grooming" stops being a sizing exercise and becomes the authoring of the contract the agent will be held to. The acceptance criteria are no longer a footnote to the story; they are the most important thing in it, because they are what the evaluation harness will check.

Estimation: effort stops predicting value

Story points and velocity were always a proxy. They measured effort and trusted that effort tracked value closely enough to plan with. Agentic delivery weakens that link: the same outcome can cost wildly different amounts of human effort depending on how much review and rework the generated change demands. Be precise about the claim, though — it is tempting to say AI makes effort "free" and velocity therefore meaningless, and that overshoots the evidence. Effort is not free; it has moved, from typing to verifying. The defensible statement is narrower and more useful: effort is now decoupled from value, so effort-based metrics mislead when read alone. The fix is not to abandon measurement but to size the verification load rather than the typing, and to pair every effort metric with an acceptance or outcome metric — measure what lands, not what was generated.

Standup, review and retro: same slots, new jobs

Once you see the pattern, the rest of the ceremonies re-point themselves. The daily standup stops being about unblocking the build and becomes about stewarding flow through the review-and-verification queue — what is waiting to be accepted, what is safe to accept, and where an agent’s autonomy needs a human decision. The sprint review stops being a demo of what was built and becomes a presentation of what was accepted: the evidence, the evaluation results, and — just as important — what was rejected and why. The retrospective stops improving only the team’s process and starts improving the harness and the governance: the context the agents work from, the evaluations that gate them, and the boundaries of their autonomy. Same hour in the calendar; a different question on the agenda.

Does the sprint still make sense?

If cycle time collapses, the sprint-as-batch — commit a fortnight of capacity, deliver it, review it — starts to look arbitrary. Some have responded by shortening the loop dramatically (Scrum.org has floated daily sprints); others propose a dual rhythm that separates the fast generate-and-verify loop from a slower planning-and-alignment cadence. This part is genuinely unsettled, and we hold it loosely. Our working view: the sprint survives, but as an assurance cadence rather than a production batch — a regular checkpoint where the team inspects acceptance, rework and the state of the harness, not how many points were burned down. The iteration was always meant to create a feedback loop; keep the loop, change what it inspects.

The sprint as an assurance cadence: the loop survives, but it inspects acceptance, rework and the harness — not velocity.

The Scrum Master: from facilitator to flow-and-assurance steward

The role most exposed by all of this is the Scrum Master, and the commentary is loud and contradictory — from "the role is in decline" to "it has never mattered more". Read across the sources and a believable middle emerges, which we grade as observed rather than settled: the purely transactional Scrum Master who administers ceremonies is at risk, while the role itself re-points toward stewarding flow, governing agent autonomy, and curating the context and evaluations the team relies on. In many organisations that work will merge with engineering-management, TPM or a delivery-assurance function rather than survive as a standalone title. The skill that appreciates is judgement about throughput and risk; the skill that depreciates is running the meeting. We would not bet the org chart on any single prediction here yet — but we would invest in the assurance skills regardless of what the box is called.

Re-point the ceremony: what to do on Monday

You do not need a new framework to act on this; you need to give each existing ceremony a new job description tuned to the constraint that now binds. Concretely:

  • Refinement: author acceptance criteria precise enough to be executable, decomposed into atomic, independently verifiable tasks — not story slices to estimate.
  • Planning: commit to an acceptance budget (how much can we review and verify well), not just a capacity budget (how much can we build).
  • Estimation: size the verification load, and pair every effort metric with an acceptance or outcome metric.
  • Standup: manage the review-and-verification queue and surface agent-autonomy decisions, not just blockers to building.
  • Review: show what was accepted — evidence and evaluation results — and what was rejected and why, not only what was built.
  • Retrospective: improve the harness, the context and the governance, not only the team’s process.
Re-point the ceremony: same six slots, a new job each — old job tuned for scarce capacity, new job tuned for scarce assurance.

Scrum is not dead in the agentic era, and it is not untouched. Its ceremonies were a good answer to a question — how do we coordinate scarce engineering capacity — that is no longer the binding one. Re-point them at the question that is, and the framework earns its place. For the organisational version of this argument, read why alignment beats agile; for the lifecycle it sits inside, the agentic SDLC; and for the metrics that should anchor the new cadence, how to measure AI engineering.

Frequently asked

Is Scrum dead in AI-native (agentic) software engineering?
No. The ceremonies survive as slots in the week, but their job changes. They were tuned to coordinate scarce engineering capacity; with agents generating much of the code, the scarce work becomes specifying, reviewing and accepting change. Re-point each ceremony at acceptance and Scrum becomes more useful, not less.
Should we still use story points and velocity?
Use them with care. Velocity measured effort as a proxy for value; AI decouples effort from value, so effort-based metrics mislead when read alone. Effort is not "free" — it moves from typing to verifying. Size the verification load and pair every effort metric with an acceptance or outcome metric.
Do sprints still make sense if cycle time collapses?
This is genuinely contested. Our view is that the sprint survives as an assurance cadence — a regular checkpoint to inspect acceptance, rework and the state of the harness — rather than as a fixed batch of build capacity. Keep the feedback loop; change what it inspects.
What happens to the Scrum Master role?
It re-points from facilitating process toward stewarding flow, governing agent autonomy, and curating context and evaluations. Transactional, ceremony-administering Scrum Masters are most exposed; in many teams the work merges with engineering management, TPM or delivery assurance. The evidence here is observed, not settled.
What is the new unit of work?
An executable specification. Refinement becomes the authoring of intent, constraints and acceptance criteria precise enough to be executed by an agent, decomposed into atomic, independently verifiable tasks each with a human acceptance gate.

Our perspective

The common view

AI either kills Scrum (ceremonies are waste once agents code) or changes nothing (keep the framework as-is).

The Ivaaya view

Neither. The ceremonies are mis-tuned, not obsolete: they were optimised for scarce engineering capacity, and the binding constraint has moved to acceptance — so re-point each ceremony from coordinating production to governing acceptance.

If AI makes effort nearly free, velocity is meaningless and estimation is pointless.
Effort is not free — it moves from typing to verifying. The accurate claim is narrower: effort is decoupled from value, so effort-based metrics mislead alone. Size the verification load and pair effort with acceptance/outcome metrics rather than abandoning measurement.
Collapsing cycle time makes the sprint obsolete.
The batch interpretation weakens, but the feedback loop does not. Keep the sprint as an assurance cadence that inspects acceptance, rework and the harness; change what it inspects, not whether it exists.
  • Plan an acceptance budget alongside the capacity budget.
  • Make acceptance criteria executable and the review queue the thing the standup manages.
  • Invest in flow/assurance skills regardless of whether the "Scrum Master" title survives.
The evidence & related ideas →

What we’ve observed

  • With agents generating change in minutes, review and verification — not coding — increasingly dominate cycle time, so a cadence tuned to build throughput is mis-tuned.
  • Effort-based metrics (velocity, story points, PR volume) decouple from value under AI, but effort is not free — it moves from typing to verifying.
  • The Scrum Master role is contested: transactional ceremony-administration is most exposed, while flow/assurance stewardship appreciates.

How certain are we?

  • The delivery bottleneck shifts from coding to review/verification under agentic deliveryobserved: Seen consistently in our own work.
  • Effort-based velocity metrics decouple from value and mislead when read aloneobserved: Seen consistently in our own work.
  • The unit of work shifts toward executable specificationsobserved: Seen consistently in our own work.
  • The Scrum Master role re-points toward flow/assurance stewardshipemerging: Still early, but increasingly visible.

Related ideas