The question doing the rounds is "if agents write the code, is Scrum dead?" It is the wrong question, and it leads teams to the wrong answers — either ripping out the ceremonies in a burst of enthusiasm, or defending them unchanged because they still fill the calendar. Both miss what actually happened. Scrum’s ceremonies were designed for a world in which engineering capacity was the scarce resource: planning rationed it, estimation forecast it, the standup unblocked it, the retro tuned it. Agentic delivery removes that scarcity. Generation becomes abundant — and the ceremonies are now optimising for a bottleneck that has moved.
So the honest framing is neither "dead" nor "unchanged". The ceremonies survive as slots in the week; their *job* changes. Each one has to be re-pointed from coordinating production to governing acceptance. Get that right and Scrum becomes more useful in agentic delivery, not less — because the work it now coordinates (deciding what to build, whether to trust what was built, and how far to let agents act) is exactly the work that has become scarce.
The constraint moved; the calendar did not
Start with the mechanism. When a developer wrote most of the code, cycle time was dominated by the time to build. Speed up the building and you sped up delivery, which is why a decade of tooling — and a decade of ceremonies — aimed there. With agents drafting changes across many files in minutes, that step compresses, and the dominant cost shifts downstream to review and verification. Practitioner analyses now describe this directly: faster generation produces slower review, because a human still has to understand and accept each change, and there are suddenly far more of them. LeadDev catalogued the metrics this breaks; engineering teams such as Agoda have reported the bottleneck moving squarely into code review. This is the same shift we write about everywhere on this site — once generation is abundant, acceptance is the constraint — showing up in the team’s weekly rhythm.
Ceremonies optimised for scarce engineering capacity are mis-tuned when capacity is abundant and assurance is the scarce resource.
Refinement becomes specification
Backlog refinement used to mean slicing stories small enough to estimate and assign. In agentic delivery the unit of work changes shape: the artefact an agent acts on is a specification precise enough to be executed. The industry has converged on a name for this — spec-driven development — and the practical guidance from GitHub and O’Reilly is consistent: write the intent, the constraints and the acceptance criteria so unambiguously that they become executable inputs, then decompose the work into atomic, independently verifiable tasks, each with a human gate. "Grooming" stops being a sizing exercise and becomes the authoring of the contract the agent will be held to. The acceptance criteria are no longer a footnote to the story; they are the most important thing in it, because they are what the evaluation harness will check.
Estimation: effort stops predicting value
Story points and velocity were always a proxy. They measured effort and trusted that effort tracked value closely enough to plan with. Agentic delivery weakens that link: the same outcome can cost wildly different amounts of human effort depending on how much review and rework the generated change demands. Be precise about the claim, though — it is tempting to say AI makes effort "free" and velocity therefore meaningless, and that overshoots the evidence. Effort is not free; it has moved, from typing to verifying. The defensible statement is narrower and more useful: effort is now decoupled from value, so effort-based metrics mislead when read alone. The fix is not to abandon measurement but to size the verification load rather than the typing, and to pair every effort metric with an acceptance or outcome metric — measure what lands, not what was generated.
Standup, review and retro: same slots, new jobs
Once you see the pattern, the rest of the ceremonies re-point themselves. The daily standup stops being about unblocking the build and becomes about stewarding flow through the review-and-verification queue — what is waiting to be accepted, what is safe to accept, and where an agent’s autonomy needs a human decision. The sprint review stops being a demo of what was built and becomes a presentation of what was accepted: the evidence, the evaluation results, and — just as important — what was rejected and why. The retrospective stops improving only the team’s process and starts improving the harness and the governance: the context the agents work from, the evaluations that gate them, and the boundaries of their autonomy. Same hour in the calendar; a different question on the agenda.
Does the sprint still make sense?
If cycle time collapses, the sprint-as-batch — commit a fortnight of capacity, deliver it, review it — starts to look arbitrary. Some have responded by shortening the loop dramatically (Scrum.org has floated daily sprints); others propose a dual rhythm that separates the fast generate-and-verify loop from a slower planning-and-alignment cadence. This part is genuinely unsettled, and we hold it loosely. Our working view: the sprint survives, but as an assurance cadence rather than a production batch — a regular checkpoint where the team inspects acceptance, rework and the state of the harness, not how many points were burned down. The iteration was always meant to create a feedback loop; keep the loop, change what it inspects.
The Scrum Master: from facilitator to flow-and-assurance steward
The role most exposed by all of this is the Scrum Master, and the commentary is loud and contradictory — from "the role is in decline" to "it has never mattered more". Read across the sources and a believable middle emerges, which we grade as observed rather than settled: the purely transactional Scrum Master who administers ceremonies is at risk, while the role itself re-points toward stewarding flow, governing agent autonomy, and curating the context and evaluations the team relies on. In many organisations that work will merge with engineering-management, TPM or a delivery-assurance function rather than survive as a standalone title. The skill that appreciates is judgement about throughput and risk; the skill that depreciates is running the meeting. We would not bet the org chart on any single prediction here yet — but we would invest in the assurance skills regardless of what the box is called.
Re-point the ceremony: what to do on Monday
You do not need a new framework to act on this; you need to give each existing ceremony a new job description tuned to the constraint that now binds. Concretely:
- Refinement: author acceptance criteria precise enough to be executable, decomposed into atomic, independently verifiable tasks — not story slices to estimate.
- Planning: commit to an acceptance budget (how much can we review and verify well), not just a capacity budget (how much can we build).
- Estimation: size the verification load, and pair every effort metric with an acceptance or outcome metric.
- Standup: manage the review-and-verification queue and surface agent-autonomy decisions, not just blockers to building.
- Review: show what was accepted — evidence and evaluation results — and what was rejected and why, not only what was built.
- Retrospective: improve the harness, the context and the governance, not only the team’s process.
Scrum is not dead in the agentic era, and it is not untouched. Its ceremonies were a good answer to a question — how do we coordinate scarce engineering capacity — that is no longer the binding one. Re-point them at the question that is, and the framework earns its place. For the organisational version of this argument, read why alignment beats agile; for the lifecycle it sits inside, the agentic SDLC; and for the metrics that should anchor the new cadence, how to measure AI engineering.
Frequently asked
- Is Scrum dead in AI-native (agentic) software engineering?
- No. The ceremonies survive as slots in the week, but their job changes. They were tuned to coordinate scarce engineering capacity; with agents generating much of the code, the scarce work becomes specifying, reviewing and accepting change. Re-point each ceremony at acceptance and Scrum becomes more useful, not less.
- Should we still use story points and velocity?
- Use them with care. Velocity measured effort as a proxy for value; AI decouples effort from value, so effort-based metrics mislead when read alone. Effort is not "free" — it moves from typing to verifying. Size the verification load and pair every effort metric with an acceptance or outcome metric.
- Do sprints still make sense if cycle time collapses?
- This is genuinely contested. Our view is that the sprint survives as an assurance cadence — a regular checkpoint to inspect acceptance, rework and the state of the harness — rather than as a fixed batch of build capacity. Keep the feedback loop; change what it inspects.
- What happens to the Scrum Master role?
- It re-points from facilitating process toward stewarding flow, governing agent autonomy, and curating context and evaluations. Transactional, ceremony-administering Scrum Masters are most exposed; in many teams the work merges with engineering management, TPM or delivery assurance. The evidence here is observed, not settled.
- What is the new unit of work?
- An executable specification. Refinement becomes the authoring of intent, constraints and acceptance criteria precise enough to be executed by an agent, decomposed into atomic, independently verifiable tasks each with a human acceptance gate.