Redr · Study Guide

The Phoenix Project

A Novel About IT, DevOps, and Helping Your Business Win

Gene Kim, Kevin Behr, and George Spafford

Unofficial AI-assisted study guide. Not affiliated with or endorsed by the author or publisher. For educational use — supplements, not replaces, the original work.

Contents

Part 01

Diagnosing the Plant — Bill Inherits the Mess

01Tuesday, September 2
02Tuesday, September 2
03Tuesday, September 2
04Wednesday, September 3
05Thursday, September 4
06Friday, September 5
07Friday, September 5
08Monday, September 8
09Tuesday, September 9

Part 02

Erik, the Four Types of Work, and the Three Ways

10Tuesday, September 9
11Thursday, September 11
12Friday, September 12
13Monday, September 15
14Tuesday, September 16
15Wednesday, September 17
16Friday, September 19
17Friday, September 19
18Saturday, September 20
19Monday, September 22
20Tuesday, September 23
21Friday, September 26

Part 03

DevOps, Unicorn, and the Future of IT

22Monday, September 22
23Tuesday, September 23
24Saturday, September 27
25Monday, September 29
26Wednesday, October 1
27Wednesday, October 8
28Thursday, October 16
29Friday, October 17
30Tuesday, October 21
31Friday, October 24
32Monday, November 10
33Monday, November 3
34Black Friday Week
35Friday, November 14

Part 01 · Diagnosing the Plant — Bill Inherits the Mess
01Tuesday, September 2
02Tuesday, September 2
03Tuesday, September 2
04Wednesday, September 3
05Thursday, September 4
06Friday, September 5
07Friday, September 5
08Monday, September 8
09Tuesday, September 9
Part 02 · Erik, the Four Types of Work, and the Three Ways
10Tuesday, September 9
11Thursday, September 11
12Friday, September 12
13Monday, September 15
14Tuesday, September 16
15Wednesday, September 17
16Friday, September 19
17Friday, September 19
18Saturday, September 20
19Monday, September 22
20Tuesday, September 23
21Friday, September 26
Part 03 · DevOps, Unicorn, and the Future of IT
22Monday, September 22
23Tuesday, September 23
24Saturday, September 27
25Monday, September 29
26Wednesday, October 1
27Wednesday, October 8
28Thursday, October 16
29Friday, October 17
30Tuesday, October 21
31Friday, October 24
32Monday, November 10
33Monday, November 3
34Black Friday Week
35Friday, November 14

Part 01

Diagnosing the Plant — Bill Inherits the Mess

Ch. 1–9

Ch. 01

Tuesday, September 2

Bill Palmer is ambushed with a promotion to VP of IT Operations after the CIO and his boss are fired, then plunged into a payroll outage within the hour. The opening chapter establishes the book's central tension: IT is mission-critical but treated like plumbing, and unplanned work immediately crushes everything Bill planned to do.

Ch. 01

IT as Invisible Plumbing

CEO Steve Masters frames IT as "like the toilet" — only noticed when broken. The chapter sets up the recurring fight to reframe IT as a strategic capability rather than a cost center.

Ch. 01

CIO Churn as a Systemic Symptom

Parts Unlimited burns through CIOs every couple of years. When the same role keeps failing, the system is broken — not the individuals.

Ch. 01

Unplanned Work Crushes Planned Work

Bill's first hour as VP is consumed by a Sev 1, foreshadowing the book's core insight: unplanned work is the most destructive of the four work types because it displaces everything else.

Ch. 01

Business-First Incident Triage

Bill instinctively asks "who is affected and what's the deadline?" before chasing technical detail — modeling business-impact framing instead of engineer reflex.

Ch. 01

Single Points of Failure in People

Wes and Patty reflexively invoke Brent in every escalation. The audience is being primed for the bottleneck reveal that defines the whole novel.

Ch. 01

Heroics Culture

The org rewards firefighters more than fire-preventers, normalizing crisis as the path to recognition. Bill will later identify this as the disease, not the cure.

Ch. 01 · Vocab

VP of IT Operations: Executive accountable for the day-to-day running of production systems.
Project Phoenix: The strategic, over-budget retail-revival initiative that frames every conflict in the novel.
Payroll run: The scheduled batch process that calculates and disburses employee pay; missing it has legal and union consequences.
Stakeholder: Anyone with a vested interest in an IT outcome — HR, Finance, union, employees.

Ch. 01 · Vocab

Sev 1: The highest-priority production incident; business-critical, all-hands.
Parts Unlimited: The fictional auto-parts manufacturer where the novel takes place.

Ch. 01 · Quiz1 / 4

Multiple choice

Steve Masters tells Bill that IT should be like the toilet — only noticed when broken. What pathology is this attitude meant to expose?

Ch. 01 · Quiz2 / 4

True / False

Bill's first hour as VP of IT Operations is spent on a planned project rollout.

Ch. 01 · Quiz3 / 4

Spot the issue

A new VP walks into a Sev 1 bridge and immediately starts asking the on-call engineer detailed questions about which library version was loaded into the JVM. What's the leadership mistake?

Ch. 01 · Quiz4 / 4

Multiple choice

Parts Unlimited has burned through several CIOs in a few years. What conclusion does the chapter want the reader to draw?

Ch. 02

Tuesday, September 2

Bill drives in, meets his lieutenants Wes and Patty, and works the payroll incident. A long-deferred SAN firmware upgrade and an InfoSec tokenization deployment ran concurrently on the same systems, making root cause analysis nearly impossible — exposing the absence of basic change discipline.

Ch. 02

Brent the Bottleneck (First Sighting)

Every escalation routes to one engineer. Wes and Patty instinctively reach for Brent — the bottleneck pattern appears before it is named.

Ch. 02

Concurrent Unauthorized Changes

A SAN firmware upgrade and a tokenization deployment fired in parallel without coordination. Change collisions make failures impossible to isolate.

Ch. 02

Deferred Maintenance as Risk

The SAN firmware was years behind. Postponing routine work accumulates change debt that detonates all at once.

Ch. 02

Triage Over Blame

Bill insists on restoring service and reconstructing a timeline of changes first, rather than hunting for someone to fire — modeling good incident command.

Ch. 02

Functional Silos

Server, network, DBA, and security teams operate as separate kingdoms with no shared change calendar. The silos are political, not technical.

Ch. 02 · Vocab

SAN (Storage Area Network): Shared block-storage fabric serving many servers; a foundational dependency whose failure cascades.
Firmware upgrade: Low-level software update to a device; risky because it touches the platform every workload depends on.
Tokenization: Replacing sensitive data (e.g., SSNs) with non-sensitive tokens for compliance.
NOC (Network Operations Center): The 24/7 room or bridge where outages are managed.

Ch. 02 · Vocab

Runbook: Documented procedure for performing or recovering from a specific operational task.
Blast radius: The scope of systems and customers affected by a failure.

Ch. 02 · Quiz1 / 4

Multiple choice

What made root cause analysis of the payroll outage nearly impossible?

Ch. 02 · Quiz2 / 4

Spot the issue

A team has been postponing a SAN firmware upgrade for three years because "it's risky." When they finally do it, three workloads break simultaneously. What lesson is this telling?

Ch. 02 · Quiz3 / 4

Multiple choice

Bill's first move on the incident bridge is to insist on reconstructing a timeline of all recent changes rather than identifying a culprit. Why?

Ch. 02 · Quiz4 / 4

True / False

The fact that Wes and Patty both instinctively reach for Brent during the outage is evidence of a healthy on-call rotation.

Ch. 03

Tuesday, September 2

Bill discovers John Pesche's InfoSec team deployed an untested SSN-tokenization application — bypassing the Change Advisory Board — at exactly the time of the SAN upgrade. The CAB exists on paper but is widely ignored. Bill mandates reconstruction of every change from the last three days.

Ch. 03

CAB Dysfunction

The Change Advisory Board exists but attendance is optional and changes route around it — process theater common in real IT shops.

Ch. 03

Security as a Rogue Stakeholder

John Pesche bypasses change control "because of an audit deadline" — showing how compliance pressure can perversely cause the very outages it claims to prevent.

Ch. 03

Root Cause Requires a Change Timeline

Without an authoritative log of what changed when, post-incident review degenerates into finger-pointing. Visibility precedes accountability.

Ch. 03

Process Discipline Over Heroics

Bill's first executive act is reasserting the CAB rather than chasing the bug — a leadership signal that the system, not the engineer, needs fixing.

Ch. 03

Brent's Trading Desk

The physical image of one engineer in front of four monitors visually encodes the bottleneck. Knowledge concentrated in one human is a load-bearing person, not a load-bearing system.

Ch. 03 · Vocab

CAB (Change Advisory Board): Standing meeting that reviews and approves proposed production changes for risk and conflicts.
CISO (Chief Information Security Officer): John Pesche's title; head of information security.
Production change: Any modification to a live system; the unit of work the CAB governs.
Unauthorized change: A modification deployed without going through the approval process.

Ch. 03 · Vocab

Post-incident review: Structured analysis of what failed and why; depends on accurate change history.
Maintenance window: A pre-scheduled, communicated period during which disruptive changes may occur.

Ch. 03 · Quiz1 / 4

Multiple choice

John Pesche bypasses the CAB to deploy the tokenization change because of an audit deadline. What does the book want this to illustrate?

Ch. 03 · Quiz2 / 4

Multiple choice

The CAB at Parts Unlimited exists on paper, but attendance is optional and changes flow around it. What term best describes this state?

Ch. 03 · Quiz3 / 4

Spot the issue

An engineer sits at a desk surrounded by four monitors, juggling pages from every team in the company. His manager calls him "irreplaceable." What's wrong with this picture?

Ch. 03 · Quiz4 / 4

True / False

Bill's first executive act in Chapter 3 is to identify and fire whoever caused the outage.

Ch. 04

Wednesday, September 3

Bill wakes to 526 emails after 22 hours on the job and attends a Phoenix status meeting where Sarah Moulton blames Operations for delays Development actually caused. Phoenix has ballooned to ~50 people, and Bill recognizes the death-march pattern: compress the schedule, squeeze testing and Ops out at the end. The SOX-404 audit emerges as a parallel threat.

Ch. 04

The Death-March Anti-pattern

Date-driven projects where developers consume all slack, leaving zero time for testing or deployment. Bill names the pattern explicitly.

Ch. 04

Dev-vs-Ops Blame Cycle

Business stakeholders reflexively blame Ops even when Dev is late. The silo wall is political, not technical.

Ch. 04

Brooks's Law

Throwing offshore contractors at a late project makes it later — adding bodies adds coordination cost faster than throughput.

Ch. 04

Hidden Off-the-Books Work

Stakeholders walk projects into Ops without intake, so capacity is invisible. What can't be seen can't be managed.

Ch. 04

Audit Pressure as Parallel Threat

The looming SOX-404 audit competes with Phoenix for the same scarce people, foreshadowing the capacity crisis that will dominate Part One.

Ch. 04 · Vocab

Phoenix Project: Parts Unlimited's strategic retail/e-commerce program; the recurring antagonist work item.
Brooks's Law: "Adding manpower to a late software project makes it later" (Fred Brooks).
Hand-off: Transfer of work between teams; every hand-off adds delay and defect risk.
Death march: Industry slang for a project doomed by impossible deadlines that nobody can renegotiate.

Ch. 04 · Vocab

SOX-404: Section 404 of the Sarbanes-Oxley Act requiring management to assess internal financial-reporting controls.
Production environment: The live systems serving customers; what every change ultimately threatens.

Ch. 04 · Quiz1 / 4

Multiple choice

Sarah Moulton's Phoenix team is late. Management's response is to add 20 offshore contractors. Which principle predicts this will make things worse?

Ch. 04 · Quiz2 / 4

Multiple choice

Bill notices that stakeholders are walking projects into his team without any intake step. What problem does this create?

Ch. 04 · Quiz3 / 4

Spot the issue

A Phoenix sub-team announces, "We're behind, so we're going to compress testing and roll straight to deploy on the original date." What pattern is this?

Ch. 04 · Quiz4 / 4

True / False

The SOX-404 audit is unrelated to Phoenix and can be safely deprioritized.

Ch. 05

Thursday, September 4

Nancy Mailer, the Chief Audit Executive, reveals a preliminary SOX-404 audit with ~950 findings and 16 significant deficiencies — mostly around uncontrolled production changes touching financial systems. Remediation alone would consume more capacity than Phoenix. The same lack of change discipline that broke payroll is now a material weakness in financial controls.

Ch. 05

Audit Findings as Symptoms

Hundreds of findings don't mean hundreds of bugs — they mean one broken system (change management) producing many violations.

Ch. 05

Significant Deficiency vs. Material Weakness

SOX language for "could lead to a financial misstatement" vs. "reasonable possibility of going undetected." Material weakness carries personal liability for executives.

Ch. 05

Compliance vs. Delivery as a False Choice

Audit work and Phoenix appear to compete, but both depend on the same underlying capability: controlled, visible change.

Ch. 05

Capacity Is Finite and Invisible

Bill cannot say yes or no to compliance work because he doesn't know what his team is already committed to. Hidden capacity drives bad commitments.

Ch. 05

IT General Controls (ITGCs)

The three pillars auditors test: access management, change management, and operations. All three are broken at Parts Unlimited.

Ch. 05 · Vocab

SOX-404: Sarbanes-Oxley §404; requires management and auditors to attest to internal-control effectiveness.
Significant deficiency: A control weakness that could lead to a financial misstatement.
Material weakness: A deficiency such that there's a reasonable possibility of a misstatement going undetected; disclosable to investors.
Internal audit: In-house function that tests controls before external auditors do.

Ch. 05 · Vocab

Segregation of duties (SoD): The control that no one person can both make and approve a financial-system change.
Compensating control: An alternate control used when the primary is missing or broken.

Ch. 05 · Quiz1 / 4

Multiple choice

The SOX-404 audit finds ~950 issues, but most trace to a single broken process. Which one?

Ch. 05 · Quiz2 / 4

True / False

A "material weakness" and a "significant deficiency" mean the same thing under SOX.

Ch. 05 · Quiz3 / 4

Multiple choice

Bill can't decide whether to commit to remediation work because he doesn't know what his team is already doing. What underlying problem is this?

Ch. 05 · Quiz4 / 4

Spot the issue

An IT director says, "We'll just defer the audit findings until Phoenix is done; they're a separate problem." Why is this dangerous?

Ch. 06

Friday, September 5

At another Phoenix meeting, Bill realizes Dev has punted testing into the next release. Pulling data with Patty, he discovers Operations has ~70+ active projects and that incident work consumes ~75% of staff time. The CAB experiments with index-card change requests funneled through Patty — a proto-Kanban intake.

Ch. 06

Visualize the Work

You cannot manage what you cannot see. Making changes physically visible on index cards is the first lean intervention.

Ch. 06

WIP Overload

A roughly 1:1 staff-to-project ratio means everyone multitasks and nothing finishes — classic queueing-theory pathology.

Ch. 06

Utilization vs. Throughput

Pushing utilization toward 100% explodes wait times. Throughput, not busy-ness, is the metric that matters.

Ch. 06

75% of Capacity Is Unplanned

Operations has almost no discretionary capacity left for projects — which is why Phoenix and the audit both stall.

Ch. 06

Lightweight Intake Beats Heavyweight Forms

Replacing complex forms with simple cards lowers friction and increases compliance — the opposite of the intuition that more bureaucracy creates more control.

Ch. 06 · Vocab

WIP (Work in Process): Started-but-unfinished work; the inventory of an IT system. Less is better.
Throughput: Rate at which finished work exits the system.
Lead time: Total elapsed time from request to delivery.
Cycle time: Time actually spent doing the work (excludes waiting).

Ch. 06 · Vocab

Kanban card: A physical token representing one unit of work.
Break-fix: Reactive repair work on broken systems; a major consumer of unplanned capacity.

Ch. 06 · Quiz1 / 4

Multiple choice

A team's staff-to-project ratio is roughly 1:1. What's the predictable result?

Ch. 06 · Quiz2 / 4

True / False

Operations should aim for 100% utilization to maximize value from staff.

Ch. 06 · Quiz3 / 4

Multiple choice

Patty replaces a long change-request form with a simple index card system. Why does compliance go *up*?

Ch. 06 · Quiz4 / 4

Spot the issue

Operations leadership claims they have "plenty of project capacity" but never seems to deliver any projects. Patty pulls the data and shows ~75% of hours go to unplanned break-fix work. What is leadership missing?

Ch. 07

Friday, September 5

Erik Reid — a rumpled prospective board member — drags Bill to MRP-8, one of Parts Unlimited's manufacturing plants. On the floor, Erik teaches that IT operations and a factory share the same problem: ensure fast, predictable flow of planned work. He introduces the **Four Types of Work**, the **Theory of Constraints**, and the seeds of the **Three Ways**.

Ch. 07

The Four Types of Work

Business projects, internal IT projects, changes, and unplanned work. The fourth is the killer — it's where capacity goes to die.

Ch. 07

Theory of Constraints

Any improvement not made at the bottleneck is an illusion; throughput is set by the constraint. Goldratt's five focusing steps: identify, exploit, subordinate, elevate, repeat.

Ch. 07

WIP Is the Silent Killer

Erik's headline lesson: releasing more work than the constraint can absorb is the root cause of every delivery problem.

Ch. 07

Utilization Drives Wait Time

Wait time grows roughly as %busy / %idle. A resource at 90% busy has wait time of about 9; at 99% it's about 99. This directly explains Brent.

Ch. 07

Local vs. Global Optimization

Improving a non-constraint can *reduce* total throughput. Only improvements at the constraint help the whole system.

Ch. 07

IT as a Factory

The plant tour is the book's central metaphor: IT obeys the same physics as a manufacturing line — flow, inventory, defects, and constraints all apply.

Ch. 07 · Vocab

Theory of Constraints: Goldratt's philosophy — system throughput is set by the single most-limiting resource.
Bottleneck: The resource whose capacity defines throughput; in IT Ops, Brent.
Four Types of Work: Business projects, internal IT projects, changes, unplanned work.
MRP-8: The manufacturing-resource-planning plant Erik tours with Bill.

Ch. 07 · Vocab

Lean / TPS (Toyota Production System): The manufacturing tradition behind small batches, pull systems, waste elimination.
Local vs. global optimization: Improving a station can hurt the whole system if it's not the constraint.

Ch. 07 · Quiz1 / 4

Multiple choice

Erik names four types of work. Which one does he call the most destructive?

Ch. 07 · Quiz2 / 4

Spot the issue

A team triples the throughput of its provisioning step, which sits upstream of an already-saturated database engineer. Why doesn't system throughput improve?

Ch. 07 · Quiz3 / 4

Multiple choice

Erik says wait time grows roughly as the ratio of percent-busy to percent-idle. At 90% utilization, that ratio is 9. At 99% it's ~99. Which IT character does this most directly explain?

Ch. 07 · Quiz4 / 4

True / False

The Theory of Constraints says you should optimize every station equally.

Ch. 08

Monday, September 8

Bill prepares a PowerPoint asking Steve to delay Phoenix, but Sarah Moulton is already in Steve's office and gets him to overrule Bill. Patty's CAB meeting reveals 173 changes scheduled for the same Friday Phoenix releases. The team begins triaging changes by risk — an early experiment in formal change control.

Ch. 08

Executive Override of Operational Reality

Steve makes a date-driven decision over IT's capacity warnings — the recurring pathology Erik described in the plant.

Ch. 08

Date-Driven vs. Capacity-Driven Planning

Setting launch dates independent of remaining work is how death marches start.

Ch. 08

Change Collision Risk

173 changes on one day stacked with a major release is a near-guaranteed multi-failure event. Visibility makes risk negotiable.

Ch. 08

Risk-Based Change Classification

Rather than treating all changes equally, the team sorts them into high/medium/low risk to focus scrutiny where it matters.

Ch. 08

War Room Mode Is Not a Strategy

Living in the Phoenix bridge generates activity, not flow. It's a symptom of broken upstream practices, not a solution.

Ch. 08 · Vocab

War room: A dedicated room or bridge for coordinating a major release or incident.
Risk classification: Bucketing changes by potential blast radius so review effort is proportional.
Standard change: A pre-approved, low-risk change type executable without per-instance CAB approval.
Release: A bundled deployment of changes to production.

Ch. 08 · Vocab

Cutover: The moment of switching customer traffic from the old system to the new one.
Change freeze: A period during which no non-emergency changes may be made.

Ch. 08 · Quiz1 / 4

Multiple choice

Patty's CAB exposes 173 changes scheduled for the same Phoenix-launch Friday. What benefit does that visibility provide?

Ch. 08 · Quiz2 / 4

True / False

Setting a launch date independent of remaining engineering work is sound business discipline.

Ch. 08 · Quiz3 / 4

Spot the issue

Bill and Wes essentially live in the Phoenix war room — they're on the bridge eighteen hours a day. What does Erik's framework tell us about this?

Ch. 08 · Quiz4 / 4

Multiple choice

The team begins classifying changes as high, medium, or low risk. Why is this an improvement over treating all changes equally?

Ch. 09

Tuesday, September 9

A Sev 1 fires — credit card processing is down across all stores. Dev, Networking, and Database teams point fingers; nobody owns the incident. Brent performs an unauthorized fix and the systems recover, leaving Bill suspicious that Brent both caused and resolved the outage. The next CAB has full attendance and discovers 100+ changes still scheduled for Phoenix Friday — Bill begins naming the four work types.

Ch. 09

Cross-Team Incident Dysfunction

Without a single incident commander, a Sev 1 degenerates into a finger-pointing tribunal.

Ch. 09

The Hero / Cowboy Anti-pattern

Brent fixing things off-bridge without telling anyone destroys observability and concentrates knowledge dangerously.

Ch. 09

Single-Person Dependency Risk

When recovery requires one specific human, the system has no resilience. Brent is a load-bearing person, not a load-bearing system.

Ch. 09

Fire Drills as Practice

Bill institutes scheduled fire drills so incident response is rehearsed rather than improvised — converting unplanned work into trainable scenarios.

Ch. 09

Visibility Forces Conversation

Once 100+ changes are visible on one day, stakeholders are forced to negotiate priority. Invisible work cannot be negotiated.

Ch. 09 · Vocab

MTTR (Mean Time To Recover): Average time to restore service after an incident.
Incident commander: The single person who runs an incident bridge, makes calls, and owns the timeline.
Conference bridge: The phone or voice channel where an incident is coordinated.
Root cause analysis (RCA): Structured technique to find the underlying cause rather than the proximate symptom.

Ch. 09 · Vocab

Fire drill: Practiced rehearsal of incident response.
Unplanned work: The fourth type of work; reactive work that displaces planned work.

Ch. 09 · Quiz1 / 4

Multiple choice

During the credit-card outage, Dev, Networking, and Database leads point fingers at each other on the bridge. What's the structural problem?

Ch. 09 · Quiz2 / 4

True / False

Brent quietly fixing a Sev 1 off-bridge without telling anyone is the kind of behavior the book celebrates as heroic.

Ch. 09 · Quiz3 / 4

Multiple choice

Bill institutes scheduled fire drills. What's the deeper purpose beyond just practicing?

Ch. 09 · Quiz4 / 4

Spot the issue

A second 100+ changes pile up for next Friday. Patty asks Steve to choose which ones get prioritized; Steve says "they're all priority one." What's wrong?

Part 02

Erik, the Four Types of Work, and the Three Ways

Ch. 10–21

Ch. 10

Tuesday, September 9

Steve summons Bill for an executive performance review and unwittingly inflicts more politics on him. Bill leaves frustrated. Meanwhile the team continues sorting work onto the new Kanban board and Erik's MRP-8 lesson keeps echoing — IT is not a special snowflake, it's a work-center system with flow, inventory, and constraints.

Ch. 10

Work Centers

Any IT process can be modeled as a work center with four elements: the machine, the man, the method, and the measures. IT obeys the same physics as a factory.

Ch. 10

Inventory in IT = WIP

Pending tickets, half-built features, and partially deployed changes are the IT equivalent of unfinished goods between machines.

Ch. 10

The "We're a Special Snowflake" Defense

IT teams resist manufacturing analogies, but the resistance is denial. The same Lean principles apply directly to technology work.

Ch. 10

The Mentor Archetype

Erik plays the Socratic Goldratt-style sensei — he refuses to give Bill answers, only questions and assignments, mirroring Jonah in *The Goal*.

Ch. 10

Visibility of Work

A factory floor lets you see every part moving; in IT, work is hidden inside servers, tickets, and people's heads — making it unmanageable.

Ch. 10 · Vocab

Work Center: A defined location where value-adding work happens, consuming inputs and producing outputs.
WIP (Work in Process): Started-but-unfinished work; the most important leading indicator of flow problems.
Operational Excellence: Erik's measuring stick — running IT like a high-performing plant rather than a heroic firefight.
The Goal: Goldratt's 1984 business novel; the literary template for *The Phoenix Project*.

Ch. 10 · Vocab

Throughput: Rate at which the system produces completed units of value.

Ch. 10 · Quiz1 / 4

Multiple choice

Erik claims any IT process can be modeled as a "work center" with four elements. What are they?

Ch. 10 · Quiz2 / 4

True / False

An IT shop's resistance to manufacturing analogies — "we're a special snowflake" — is generally well-founded because software is fundamentally different from hardware production.

Ch. 10 · Quiz3 / 4

Multiple choice

In an IT work-center model, what plays the role of "inventory"?

Ch. 10 · Quiz4 / 4

Spot the issue

Bill demands Erik just tell him "what to do." Erik responds with another question and another plant visit. Why does the book endorse Erik's refusal?

Ch. 11

Thursday, September 11

Bill tries to map IT operations as work centers and drowns in Phoenix demands, SOX findings, and yet more escalations. Until you can see and categorize your work, every new request feels equally urgent and the constraint cannot be protected.

Ch. 11

Unplanned Work as the Enemy

Outages, escalations, and audit fire drills consume capacity supposedly allocated to planned project work.

Ch. 11

Audit Findings as Process Symptoms

SOX-404 deficiencies are not a compliance problem — they're a process-control symptom of invisible chaos.

Ch. 11

"Hot Project" Pathology

Every executive treats Phoenix as the top priority, but "everything priority one" means nothing is prioritized.

Ch. 11

The Cost of Context Switching

Each interruption adds setup time and erodes the team's ability to finish anything. Multitasking destroys flow.

Ch. 11

Local vs. Global Optimization (Reprise)

Each manager optimizes their own queue without regard for total system throughput — guaranteeing that the system suffers even when every department "wins."

Ch. 11 · Vocab

Escalation: A ticket promoted past normal queues because someone important is yelling.
Change Request: A proposed modification to a production system.
Significant Deficiency: Audit language for a control weakness serious enough to potentially misstate financials.
Context switching: Switching attention between tasks; carries hidden setup-and-recovery costs.

Ch. 11 · Vocab

Capacity allocation: How a team's hours are divided across types of work.

Ch. 11 · Quiz1 / 4

Multiple choice

Every executive insists Phoenix is the top priority. What pathology does this create?

Ch. 11 · Quiz2 / 4

True / False

SOX-404 audit findings are best understood as a separate compliance workstream from operational chaos.

Ch. 11 · Quiz3 / 4

Spot the issue

An engineer is on five projects simultaneously, switching between them every hour. Her output is dropping even though she's "working harder." What's happening?

Ch. 11 · Quiz4 / 4

Multiple choice

Sarah optimizes her sales-feature queue, John optimizes his security findings, Chris optimizes his release schedule — and global throughput falls. Which principle explains this?

Ch. 12

Friday, September 12

Walking the floor with Wes and Patty, Bill catalogs work the teams actually do and realizes the unplanned-work volume is enormous. One engineer — Brent — is implicated in nearly every critical incident. Heroes are bottlenecks in disguise, and tribal knowledge is a single point of failure.

Ch. 12

Brent as the Constraint

Brent is the most talented engineer and therefore the most over-subscribed. Every escalation routes to him, making him the bottleneck for the entire IT system.

Ch. 12

Tribal Knowledge

Critical know-how lives only in Brent's head — undocumented runbooks, hand-tuned configs, "ask Brent" workflows.

Ch. 12

Hero Culture

Organizations reward firefighters more than fire-preventers, so the system selects for and exhausts its best people.

Ch. 12

The Five Focusing Steps

Goldratt: identify the constraint → exploit it → subordinate everything to it → elevate it → repeat. Bill is at step one.

Ch. 12

Documentation as a Throughput Tool

Writing down what Brent knows is how you remove him from the critical path of every ticket — a flow intervention, not a paperwork exercise.

Ch. 12 · Vocab

Constraint / Bottleneck: The resource whose capacity defines system throughput.
Subject Matter Expert (SME): A person whose unique knowledge is required for certain work.
Single Point of Failure (SPOF): Any node whose loss halts the system — humans included.
Five Focusing Steps: Identify, exploit, subordinate, elevate, repeat — Goldratt's constraint-management procedure.

Ch. 12 · Vocab

Patty McKee: Director of IT Service Support; Bill's process and change-management lieutenant.

Ch. 12 · Quiz1 / 4

Multiple choice

What is the central irony of Brent's role at Parts Unlimited?

Ch. 12 · Quiz2 / 4

True / False

Documenting Brent's procedures is primarily a compliance exercise.

Ch. 12 · Quiz3 / 4

Multiple choice

Goldratt's Five Focusing Steps begin with: identify the constraint, exploit it, subordinate everything to it, elevate it, repeat. Which step is Bill on at the end of Chapter 12?

Ch. 12 · Quiz4 / 4

Spot the issue

An IT shop's hiring plan is to find "more Brents." What's the structural problem with this idea?

Ch. 13

Monday, September 15

Bill, Wes, and Patty install a change-management process: index cards on a whiteboard for every proposed change, plus a CAB to approve them. You cannot manage work you cannot see, and a visual board is the cheapest, fastest way to make IT work visible.

Ch. 13

Visual Work Management

The whiteboard of index cards is essentially a Kanban board — work made visible becomes manageable.

Ch. 13

Change Advisory Board (Reinstated)

A cross-functional group that reviews, schedules, and approves proposed changes before they hit production.

Ch. 13

Change Categorization

Standard (pre-approved, low risk), normal (needs CAB), and emergency changes — different paths reduce friction without losing control.

Ch. 13

Freeze During Crisis

Bill imposes change freezes around high-risk windows to protect stability during release events.

Ch. 13

Engineering Resistance to Process

Engineers — especially Brent — push back; process feels like bureaucracy but is how you protect the constraint.

Ch. 13 · Vocab

Change: Any addition, modification, or removal that could affect a production IT service.
CAB: The governance body for changes; canonical ITIL concept.
Kanban Board: Visual signaling system from Toyota; columns = workflow states, cards = work items.
Standard Change: Pre-approved, low-risk, repeatable change.

Ch. 13 · Vocab

Emergency Change: Change made under time pressure with abbreviated approval — tracked carefully to prevent abuse.

Ch. 13 · Quiz1 / 4

Multiple choice

Why does the team use physical index cards on a whiteboard instead of digital change tickets?

Ch. 13 · Quiz2 / 4

Multiple choice

The new CAB introduces three change categories: standard, normal, and emergency. Why three rather than one?

Ch. 13 · Quiz3 / 4

True / False

Engineer pushback against the new change process is a sign the process is wrong and should be rolled back.

Ch. 13 · Quiz4 / 4

Spot the issue

An IT shop has *one* change category for everything from "change a desktop wallpaper" to "migrate the production database." What problem does this create?

Ch. 14

Tuesday, September 16

The new board immediately surfaces hundreds of pending changes — far more than the team can safely execute — and reveals most route through Brent. Scheduling without regard to the constraint is theater; every change must be evaluated against Brent's available capacity.

Ch. 14

Demand vs. Capacity Made Visible

Listing all the work makes brutally obvious that demand massively exceeds capacity, which was hidden when work was invisible.

Ch. 14

Scheduling Around the Constraint

A change that doesn't need Brent can be scheduled freely; one that does must wait its turn behind everything else competing for him.

Ch. 14

Backlog as Honest Mirror

A long backlog isn't a failure of the board — it's a true measurement of organizational over-commitment.

Ch. 14

Prioritization Discipline

Forcing executives to choose which changes get Brent's time is how IT pushes business trade-offs back to the business.

Ch. 14

Small-Batch Stability

Many smaller, well-understood changes are safer than a few mega-changes — a precursor to continuous delivery.

Ch. 14 · Vocab

Backlog: The ordered queue of work waiting to start.
Capacity Planning: Matching committed work to actual available resource hours.
Batch Size: Amount of work moved through a step at once; smaller = faster feedback, lower risk.
Lead Time: Time from request to delivery.

Ch. 14 · Vocab

Cycle Time: Time from start of work to completion (subset of lead time).

Ch. 14 · Quiz1 / 4

Multiple choice

After the change board goes up, the backlog suddenly looks enormous. What does this reveal?

Ch. 14 · Quiz2 / 4

Spot the issue

A team schedules ten changes for the same week, six of which require Brent. None of the others depend on Brent. What's wrong?

Ch. 14 · Quiz3 / 4

Multiple choice

Why does forcing executives to choose which changes get Brent's time count as a *good* outcome of the new process?

Ch. 14 · Quiz4 / 4

True / False

Many small changes are riskier than a few large changes because they create more deploy events.

Ch. 15

Wednesday, September 17

Erik reappears, takes Bill back to the plant, and formally names the **Four Types of Work**. Bill can only name three; the fourth — unplanned work — is the one that destroys all the others. Until you can categorize every hour of IT effort into one of these four buckets, you cannot manage IT.

Ch. 15

Type 1 — Business Projects

Revenue-generating or strategic initiatives the business funds (e.g., Phoenix). The most visible category.

Ch. 15

Type 2 — Internal IT Projects

Infrastructure work, automation, tooling, refactors — the work that makes future work cheaper but rarely appears on executive dashboards.

Ch. 15

Type 3 — Changes

Modifications to existing systems generated by Types 1 and 2 — the unit the CAB governs.

Ch. 15

Type 4 — Unplanned Work

Incidents, outages, and firefighting. It cannibalizes capacity from the other three types and is the most expensive form of work.

Ch. 15

Anti-Work

Unplanned work is anti-work — it doesn't add value, it prevents value-adding work from completing. The only goal is to minimize it.

Ch. 15 · Vocab

Four Types of Work: Business Projects, Internal IT Projects, Changes, Unplanned Work.
Unplanned Work: Work that arrives unscheduled, usually as an incident or escalation.
Recovery Work: Effort spent restoring service after a failure.
Technical Debt: Accumulated shortcuts that increase the rate of unplanned work over time.

Ch. 15 · Vocab

Erik Reid: The Goldratt-style mentor delivering these frameworks.

Ch. 15 · Quiz1 / 4

Multiple choice

Bill can name three types of work but Erik insists there's a fourth. Which one is it?

Ch. 15 · Quiz2 / 4

Multiple choice

Erik calls Type 2 "internal IT projects." Why does the book treat this category as critical despite its low visibility to executives?

Ch. 15 · Quiz3 / 4

True / False

Erik refers to unplanned work as "anti-work" because it doesn't add value — it actively prevents value-adding work from completing.

Ch. 15 · Quiz4 / 4

Spot the issue

A CIO presents to the board: "We delivered 18 business projects this quarter." Why is this number alone insufficient as a measure of IT health?

Ch. 16

Friday, September 19

Phoenix's deployment looms and the team discovers undocumented dependencies, missing environments, and Development tossing builds over the wall. Deployment is not a step at the end of a project — it's a capability that has to be designed in from the start.

Ch. 16

Throwing It Over the Wall

Dev finishes coding and hands the package to Ops with little context. Ops absorbs the integration pain — the canonical pre-DevOps anti-pattern.

Ch. 16

Environment Parity

Dev, test, staging, and production diverged, so "it works on my machine" is meaningless. Lack of parity is a primary cause of release failure.

Ch. 16

Deployment as a Constraint

A slow, painful deploy process becomes its own bottleneck independent of any human.

Ch. 16

Hidden Dependencies

Phoenix depends on databases, middleware, and network configs nobody fully mapped — a flow risk that only surfaces at deploy time.

Ch. 16

Date-Driven Release Pressure

Sarah and Steve insist on a date for marketing reasons, regardless of readiness — the business optimizing locally and ignoring system reality.

Ch. 16 · Vocab

Deployment Pipeline: The automated sequence that takes code from commit to production.
Release: A bundled set of changes promoted to production at a point in time.
Staging Environment: A pre-production environment intended to mirror production for final validation.
Cutover: The act of switching traffic to the new system.

Ch. 16 · Vocab

Rollback: The plan and mechanism to revert to the prior state if a release fails.

Ch. 16 · Quiz1 / 4

Multiple choice

Phoenix Dev finishes coding and hands a package to Ops with little context. What's the name and problem of this pattern?

Ch. 16 · Quiz2 / 4

True / False

"It works on my machine" is a sign that environment parity is good.

Ch. 16 · Quiz3 / 4

Spot the issue

An IT director assumes deployment is "the easy part" at the end of a project. Phoenix proves him wrong. What's the corrective principle?

Ch. 16 · Quiz4 / 4

Multiple choice

Sarah insists on the Phoenix launch date for marketing reasons even though Ops says systems aren't ready. Which Erik-ism applies?

Ch. 17

Friday, September 19

Bill, John, Wes, and Patty try to enumerate everything Phoenix actually needs to ship — servers, licenses, firewall rules, data migrations — and realize lead times alone make the deadline impossible. Long-lead procurement and provisioning are part of the value stream and must be visible far in advance.

Ch. 17

Value Stream Mapping

Drawing every step from "developer commits code" to "customer uses feature" exposes wait time, which usually dwarfs work time.

Ch. 17

Procurement Lead Time

Hardware, licenses, and vendor work-orders can take weeks or months — invisible until they block a release.

Ch. 17

Wait Time vs. Touch Time

The vast majority of total lead time in IT is queue time, not actual work — same finding as in Lean manufacturing.

Ch. 17

Pre-Production Readiness Reviews

A discipline of inspecting non-functional requirements (capacity, security, monitoring) well before launch, not on the day.

Ch. 17

The Cost of a Bad Release

A failed Phoenix deploy will cost more than the delay would have — but the org is not yet wired to weigh that trade-off honestly.

Ch. 17 · Vocab

Value Stream: The end-to-end sequence of activities required to deliver value to a customer.
Lead Time: Elapsed time from request to delivery.
Touch Time / Process Time: Time actually spent working on the item.
Provisioning: Setting up the infrastructure a service needs to run.

Ch. 17 · Vocab

Production Readiness: The state in which a service can be safely operated under real load.

Ch. 17 · Quiz1 / 4

Multiple choice

When the team maps Phoenix's value stream end-to-end, what dominant pattern do they discover?

Ch. 17 · Quiz2 / 4

True / False

Procurement lead times for servers and licenses are part of the deployment value stream and must be visible at planning time, not at deploy time.

Ch. 17 · Quiz3 / 4

Spot the issue

A team plans to test "capacity, security, and monitoring" the night of launch. What's wrong?

Ch. 17 · Quiz4 / 4

Multiple choice

Bill argues a delay would cost less than a failed Phoenix deploy. The org pushes back. What systemic gap does this reveal?

Ch. 18

Saturday, September 20

Phoenix is deployed against Bill's objections; the launch is a disaster — POS systems crash, credit-card data is mishandled, customers are turned away, and the team scrambles all weekend. Forcing a date over capacity and stability concerns converts business pressure into a much larger pile of unplanned work.

Ch. 18

Deployment Failure Cascade

One unforeseen interaction triggers another, and without observability the team is debugging blind.

Ch. 18

Customer-Facing Outage

Unlike internal failures, a retail POS outage is immediately visible to customers and press — magnifying business cost.

Ch. 18

Compliance Blast Radius

The credit-card handling problem creates potential PCI-DSS exposure on top of the outage itself — failure modes stack.

Ch. 18

All-Hands Firefight

Every engineer (especially Brent) is pulled in, halting every other piece of planned work in the company.

Ch. 18

Sunk-Cost Politics

Sarah and Steve resist rollback because retreating publicly is more painful to them than the technical pain of pressing on.

Ch. 18 · Vocab

POS (Point of Sale): In-store transaction system whose failure stops revenue at the register.
PCI-DSS: Payment Card Industry Data Security Standard governing cardholder-data handling.
Blast Radius: The scope of systems and customers affected by a failure.
Incident: An unplanned interruption or quality reduction of an IT service.

Ch. 18 · Vocab

War Room: Ad-hoc command center assembled during a major incident.

Ch. 18 · Quiz1 / 4

Multiple choice

Phoenix's launch failure cascades from POS systems to credit-card handling to multiple downstream services. What capability is most missing?

Ch. 18 · Quiz2 / 4

True / False

A retail POS outage is roughly equivalent in business cost to an equally long internal back-office outage.

Ch. 18 · Quiz3 / 4

Spot the issue

Sarah and Steve refuse to roll Phoenix back even as the failures mount. Bill argues for rollback. What is driving the executive resistance?

Ch. 18 · Quiz4 / 4

Multiple choice

During the Phoenix launch crisis, every engineer is pulled into the war room — including Brent. What is the second-order cost?

Ch. 19

Monday, September 22

In the post-launch wreckage, the team stabilizes Phoenix piece by piece while absorbing political fallout. Bill recognizes Brent is now permanently saturated and that nothing planned will move until he is protected. After a failure, the priority is to restore flow by ruthlessly protecting the constraint, not piling on blame.

Ch. 19

Post-Incident Stabilization

First restore service, then improve, then learn — sequencing matters.

Ch. 19

Protecting the Constraint

Brent is moved off ad-hoc tickets; work must be queued and prioritized through him deliberately.

Ch. 19

Blameless Response

Bill resists the urge to scapegoat individuals; the system produced the failure, not any one person.

Ch. 19

Executive Trust as Capital

Bill's earlier warnings now give him political room to impose stricter process. Credibility was the real currency.

Ch. 19

Stop-the-Line Authority

Someone in IT must be empowered to halt a release the way an Andon cord halts a Toyota line.

Ch. 19 · Vocab

Andon Cord: Toyota mechanism letting any worker stop the line on detecting a defect.
Postmortem: Structured review of an incident to extract learning.
Mean Time to Restore (MTTR): Time from incident detection to service restoration.
Mean Time Between Failures (MTBF): Average uptime between incidents.

Ch. 19 · Vocab

Toil: Repetitive operational work that scales with service growth; classic unplanned-work fuel.

Ch. 19 · Quiz1 / 4

Multiple choice

After the Phoenix disaster, what is Bill's first priority — and why does the order matter?

Ch. 19 · Quiz2 / 4

True / False

Bill responds to the Phoenix failure by identifying which individual engineers caused the cascade and disciplining them.

Ch. 19 · Quiz3 / 4

Spot the issue

A release is clearly in trouble but no one stops it. Engineers see the failure coming and stay silent. What organizational capability is missing?

Ch. 19 · Quiz4 / 4

Multiple choice

After Phoenix fails, Bill suddenly has political room to impose stricter change control. What does the book say this proves?

Ch. 20

Tuesday, September 23

Erik takes Bill back to MRP-8 and explicitly names the **Three Ways**, mapping Lean and TPS onto IT. This is the philosophical center of Part Two: every tactic Bill has been groping toward — visibility, change control, protecting Brent, smaller batches — is an instance of one of these Ways.

Ch. 20

The First Way — Flow

Optimize the left-to-right flow of work from Development to Operations to the customer. Tactics: small batches, reducing WIP, eliminating wait time, never passing defects downstream.

Ch. 20

The Second Way — Feedback

Create fast, constant feedback loops at every stage so problems are detected and fixed at the source. Tactics: telemetry, stop-the-line authority, swarming on defects.

Ch. 20

The Third Way — Continual Learning

Build a culture that rewards experimentation, repetition, and learning from failure. Failures become learning, not punishment.

Ch. 20

DevOps as TPS for IT

Erik maps Lean concepts (Jidoka, Kaizen, Heijunka) onto IT — DevOps is not new physics, it's Toyota Production System applied to software.

Ch. 20

The Constraint and the Three Ways

Identifying and protecting Brent is First Way (flow); building telemetry around him is Second Way (feedback); automating his knowledge is Third Way (continual learning).

Ch. 20 · Vocab

The Three Ways: Flow, Feedback, Continual Learning — the underlying principles of DevOps.
Kaizen: Continuous incremental improvement.
Jidoka: Automation with a human touch — stopping the line on defect detection.
Heijunka: Production leveling to smooth demand and reduce batch sizes.

Ch. 20 · Vocab

Telemetry: Instrumentation that produces continuous signals about system behavior.

Ch. 20 · Quiz1 / 4

Multiple choice

Which of the following is the First Way?

Ch. 20 · Quiz2 / 4

Multiple choice

Which Way is best illustrated by giving any engineer the authority to halt a release when they see a defect?

Ch. 20 · Quiz3 / 4

True / False

The Third Way is about adding more rigorous processes so failures are eliminated.

Ch. 20 · Quiz4 / 4

Spot the issue

A team announces, "We're doing DevOps now — we bought a new CI/CD tool." What's the conceptual gap?

Ch. 21

Friday, September 26

Armed with the Three Ways, Bill imposes **WIP limits** and a project freeze: no new work enters the system, and Brent is firewalled behind Patty's scheduling so he only works on pre-approved items. Throughput improves within days. Reducing WIP, not adding people, is how you increase throughput at a constrained system.

Ch. 21

Freezing New Work

Stop the intake of new projects so existing WIP can drain. Counterintuitive but Lean-canonical.

Ch. 21

WIP Limits

Hard caps on how many items can be in-progress at any stage. Force prioritization and expose bottlenecks.

Ch. 21

Firewalling the Constraint

Brent is removed from on-call and ad-hoc queues; all requests for him are filtered and scheduled through Patty.

Ch. 21

Little's Law

Lead time = WIP / throughput. Cutting WIP cuts lead time at constant throughput — the mathematical reason why doing less finishes more.

Ch. 21

Pull vs. Push

Upstream stops pushing work onto Brent; Brent pulls the next item when ready. Pull systems naturally subordinate to the constraint.

Ch. 21

Documenting Tribal Knowledge

Brent's protected time is partly redirected to writing down his procedures so others can do the work next time.

Ch. 21 · Vocab

WIP Limit: Maximum number of work items allowed in a given workflow state at one time.
Pull System: Workflow where downstream signals capacity and upstream releases work only on demand.
Little's Law: L = λW; in queue form, WIP = throughput × lead time.
Freeze Period: Bounded interval in which no new work or non-essential changes are allowed.

Ch. 21 · Vocab

Swarming: Multiple people converging on a single high-priority item to drive it to completion.
Brent Geller: The senior engineer who embodies the constraint throughout the novel.

Ch. 21 · Quiz1 / 4

Multiple choice

Little's Law in its queue form says lead time = WIP / throughput. What practical conclusion follows?

Ch. 21 · Quiz2 / 4

True / False

Freezing new work is a sign of failure — a healthy IT shop should always be able to accept incoming demand.

Ch. 21 · Quiz3 / 4

Spot the issue

A team complains that imposing WIP limits is "making us slower." A week later their throughput has actually increased. What's happening?

Ch. 21 · Quiz4 / 4

Multiple choice

Brent is firewalled behind Patty's scheduling. Why is "pulling" work to Brent better than letting teams push work onto him?

Part 03

DevOps, Unicorn, and the Future of IT

Ch. 22–35

Ch. 22

Monday, September 22

John has gone missing and the NOC is openly betting on what happened to him. Bill's team launches a monitoring initiative to take routine work off Brent, and Patty rolls out a Kanban system modeled on MRP-8 with Ready/Doing/Done columns. Visualizing WIP and protecting the constraint convert chaos into predictable flow.

Ch. 22

Kanban for IT Operations

Physical visualization of WIP states (Ready/Doing/Done) so the team can see queues forming and limit work-in-progress.

Ch. 22

Protecting the Constraint

Every new project must either reduce load on Brent or transfer his knowledge elsewhere; nothing else gets prioritized.

Ch. 22

Monitoring as Preventive Maintenance

Proactive telemetry catches issues before they become Sev 1 incidents and reduces firefighting that drains the constraint.

Ch. 22

Knowledge Transfer

Codifying tribal knowledge so any senior engineer can do what Brent does today — making the constraint reproducible.

Ch. 22

The First Way in Operation

Workflow management, defect prevention, and pace-setting around the bottleneck — Flow made operational.

Ch. 22 · Vocab

Kanban: A pull-based visual workflow system originating in lean manufacturing.
NOC: Network Operations Center; the team monitoring infrastructure 24/7.
WIP: Number of tasks started but not finished; high WIP destroys throughput.
Telemetry: Operational data emitted by systems to enable monitoring and feedback.

Ch. 22 · Vocab

Preventive maintenance: Work done before failure to reduce future incidents.

Ch. 22 · Quiz1 / 4

Multiple choice

The new Kanban board uses Ready/Doing/Done columns. What is its primary effect on flow?

Ch. 22 · Quiz2 / 4

True / False

A new project that doesn't reduce load on Brent or transfer his knowledge should be prioritized if business value is high.

Ch. 22 · Quiz3 / 4

Multiple choice

Bill invests in better monitoring as a way to free Brent. What's the indirect mechanism?

Ch. 22 · Quiz4 / 4

Spot the issue

A team plans to "make Brent more productive" by giving him a faster laptop. Why is this missing the point?

Ch. 23

Tuesday, September 23

Brent is falling behind on Phoenix. Erik and Bill work out, mathematically, why: wait time grows non-linearly with utilization. Patty reframes deployments as the "final assembly step" and adds Kanban swim lanes for recurring large tasks. Idle time isn't waste — it's the buffer that lets work actually flow.

Ch. 23

Wait Time = %Busy / %Idle

At 50% utilization, wait time ratio is 1. At 90% it's 9. At 99% it's 99. Over-utilized people create exponential delays.

Ch. 23

Queue Theory in Knowledge Work

The same math that explains traffic jams explains IT delivery delays. Knowledge work is not exempt from queueing.

Ch. 23

Deployment as Final Assembly

The biggest defect-injection point in the value stream. It deserves the most rigor, not the least.

Ch. 23

Swim Lanes for Recurring Work

Separating planned/recurring tasks from ad-hoc requests so each gets predictable throughput.

Ch. 23

Idle Capacity Is a Feature

Slack in the constraint is what lets the system absorb variance. Saturated systems crash under any perturbation.

Ch. 23 · Vocab

Lead time: Total elapsed time from request to delivery.
Cycle time: Time actually spent doing the work.
Utilization: Fraction of available time a resource is actively working.
Swim lane: A horizontal track on a Kanban board reserved for a specific class of work.

Ch. 23 · Vocab

Final assembly: Manufacturing term for where components integrate; analogous to production deployment.

Ch. 23 · Quiz1 / 4

Multiple choice

Erik's wait-time heuristic says wait grows as %busy / %idle. What does that imply for a resource pushed from 90% to 99% utilization?

Ch. 23 · Quiz2 / 4

True / False

Idle time on the constraint is wasted capacity that should be filled.

Ch. 23 · Quiz3 / 4

Multiple choice

Patty calls deployment "the final assembly step." Why is that framing useful?

Ch. 23 · Quiz4 / 4

Spot the issue

An engineering manager argues that queueing theory "doesn't apply to knowledge work because each ticket is unique." What's the problem with this argument?

Ch. 24

Saturday, September 27

Bill bumps into an intoxicated, deflated John at a hotel bar; John is questioning whether his security crusade ever produced business value. The chapter sets up John's reinvention. A control function (Security, Compliance, Audit) that doesn't connect to business outcomes will be ignored or worked around.

Ch. 24

Security Must Align with Business Outcomes

Controls justified only by "best practice" lose credibility. They must defend revenue, margin, or trust.

Ch. 24

Empathy Across Silos

Bill engaging John humanly is the cultural seed for the later Dev/Ops/Sec super-tribe.

Ch. 24

Personal Rock-Bottom as Catalyst

Change in the book often comes from a leader admitting the old model failed — John's bar scene is that moment.

Ch. 24

Compliance Theater Diagnosis

Activity that produces audit evidence without reducing real risk. John's epiphany is that he's been generating evidence, not security.

Ch. 24 · Vocab

CISO: Chief Information Security Officer (John's role).
Compensating control: An alternative safeguard used when the primary control isn't feasible.
Compliance theater: Activity that produces audit evidence without reducing real risk.
Best practice: An industry-standard control; weak justification on its own.

Ch. 24 · Vocab

Business outcome: A measurable business effect like revenue, retention, or trust.

Ch. 24 · Quiz1 / 4

Multiple choice

What is the book's diagnosis of John's pre-rock-bottom security program?

Ch. 24 · Quiz2 / 4

True / False

A security control justified solely by "it's a best practice" is a strong justification at the executive level.

Ch. 24 · Quiz3 / 4

Multiple choice

Bill choosing to engage John humanly at the bar is set up as which kind of inflection point?

Ch. 24 · Quiz4 / 4

Spot the issue

A CISO presents 47 controls to the board, all justified as "industry best practice." None map to a specific business risk or outcome. What's the predictable result?

Ch. 25

Monday, September 29

John resurfaces transformed and joins Bill for a meeting with Dick, the CFO. Dick walks them through what the business actually measures, and Erik tells Bill the dual mission: find where IT *under*-scopes (missing business risk) and where Security *over*-scopes (controls that don't matter). IT goals must trace to measurable business performance.

Ch. 25

Business Performance Measures

Revenue growth, market share, profitability, customer satisfaction, order-to-cash cycle time — the things the CFO actually watches.

Ch. 25

Tying IT Risk to Business Risk

Every control or capability should map to a business outcome it protects or enables.

Ch. 25

CIA Triad Refocused

Confidentiality, Integrity, Availability — framed around what the business actually needs, not blanket policy.

Ch. 25

Under-Scoping vs. Over-Scoping

Twin failure modes: missing a risk that matters, or spending on controls that don't. Both leak credibility.

Ch. 25

Security as Enabler

Security joining the flow instead of standing outside it — the start of DevSecOps.

Ch. 25 · Vocab

CFO: Chief Financial Officer.
KPI: Key Performance Indicator.
CIA triad: Confidentiality, Integrity, Availability — the classic infosec model.
Order-to-cash: The end-to-end process from customer order to recognized revenue.

Ch. 25 · Vocab

Risk: The probability and impact of an adverse business outcome.

Ch. 25 · Quiz1 / 4

Multiple choice

Erik gives John a dual mission with two failure modes. What are they?

Ch. 25 · Quiz2 / 4

Multiple choice

The CFO tells Bill which measures actually matter. Which set is closest to what Dick names?

Ch. 25 · Quiz3 / 4

True / False

Refocusing the CIA triad on business needs is more useful than enforcing blanket confidentiality, integrity, and availability policies.

Ch. 25 · Quiz4 / 4

Spot the issue

A security team treats every server as equally critical and applies the same controls everywhere. What pathology is this?

Ch. 26

Wednesday, October 1

Bill interviews business process owners (sales, merchandising, manufacturing) and learns the sales forecast is fictional, demand signals are stale, and Maggie desperately needs shorter feedback cycles. Long IT delivery cycles strand business capital; speed-to-market and fail-fast are competitive weapons.

Ch. 26

Time-to-Market

Maggie wants 6-12 month cycles, not 3-year ones. The cycle time of IT delivery sets the cycle time of business strategy.

Ch. 26

Fail Fast

Small experiments produce cheap failures, and cheap failures are how the org learns.

Ch. 26

Capital Efficiency of IT

WIP locked in a multi-year Phoenix is capital that earns no return. Long-lived WIP is a balance-sheet problem, not just a flow problem.

Ch. 26

Customer Demand Signal

Real order/usage data must reach decision-makers quickly to set inventory and pricing.

Ch. 26

Business Process Owners

The executives who own outcomes that IT must enable — Maggie for merchandising, Ron for sales, manufacturing leads on the floor.

Ch. 26 · Vocab

Time-to-market: Elapsed time from idea to a customer being able to buy it.
Demand signal: Empirical evidence of what customers want, used to align supply.
Fail fast: Biasing for small bets so wrong answers surface cheaply.
Sales pipeline: Staged view of opportunities from prospect to closed deal.

Ch. 26 · Vocab

Merchandising: Selecting, pricing, and presenting products for sale.

Ch. 26 · Quiz1 / 4

Multiple choice

Maggie wants the merchandising cycle compressed from three years to 6-12 months. What is the structural argument for why IT cycle time matters for the business?

Ch. 26 · Quiz2 / 4

True / False

Multi-year WIP locked in unfinished features is purely a flow problem, not a financial one.

Ch. 26 · Quiz3 / 4

Multiple choice

"Fail fast" is more than a slogan in this chapter. What's the principle?

Ch. 26 · Quiz4 / 4

Spot the issue

A merchandising team makes pricing decisions based on data that's two months old. Why is that worse than it sounds?

Ch. 27

Wednesday, October 8

John presents his rebuilt program: tightly scope SOX/PCI work, pay down security technical debt, and integrate security into daily work. Bill commits to surfacing IT risks as leading indicators of business risk on the executive scorecard. Shifting security and compliance left is cheaper and more effective than auditing them in afterwards.

Ch. 27

Shift-Left Security

Integrating security checks into design, build, and deploy rather than post-hoc audits — defects are cheaper to fix earlier.

Ch. 27

Technical Debt Paydown

Fixing the root systems generating audit findings, not compensating around them.

Ch. 27

Leading vs. Lagging Indicators

Leading: deploy frequency, MTTR. Lagging: revenue, audit findings. Leading indicators predict; lagging ones report past results.

Ch. 27

Scope Reduction in Compliance

Narrowing the systems in audit scope is a force multiplier — fewer systems to control means controls can actually be enforced.

Ch. 27

Security in the Flow

Security joins Dev and Ops in the value stream rather than gating it from outside.

Ch. 27 · Vocab

SOX-404: Sarbanes-Oxley section on internal controls over financial reporting.
PCI-DSS: Payment Card Industry Data Security Standard.
Technical debt: Accumulated shortcuts that increase future change cost.
Leading indicator: A measure that predicts a future outcome.

Ch. 27 · Vocab

Shift-left: Moving a concern earlier in the lifecycle, where defects are cheaper.

Ch. 27 · Quiz1 / 4

Multiple choice

John's rebuilt program narrows the systems in SOX/PCI scope rather than expanding controls. Why is scope reduction a force multiplier?

Ch. 27 · Quiz2 / 4

True / False

Deploy frequency and MTTR are lagging indicators of IT health.

Ch. 27 · Quiz3 / 4

Multiple choice

What does "shift-left security" mean in this chapter?

Ch. 27 · Quiz4 / 4

Spot the issue

A security team's plan is to add layers of compensating controls around an inherently broken legacy system. What's the deeper move the book recommends instead?

Ch. 28

Thursday, October 16

The improvements pay off — Sev 1 outages drop roughly two-thirds, recovery time halves, and proactive monitoring is catching issues. But cracks appear: Sarah has been spinning up rogue cloud projects that violate privacy policy, and a major database migration fails because Brent quietly changed something in production undocumented. Results validate the First Way, but undisciplined change still wrecks releases.

Ch. 28

MTTR Improvement

Mean Time To Recover halves — a measurable Second-Way feedback win.

Ch. 28

Production Change Discipline

Undocumented changes are the leading cause of failed deploys, even when everything else is improving.

Ch. 28

Environment Drift

Dev/QA/Prod diverging over time guarantees surprises at release. Drift is silent until it isn't.

Ch. 28

Shadow IT

Sarah-style unsanctioned procurement (often cloud) that bypasses governance — re-introduces the chaos Bill just fixed.

Ch. 28

Observability vs. Monitoring

Being able to ask new questions of running systems, not just watch pre-defined dashboards.

Ch. 28 · Vocab

Sev 1: Severity-one incident; highest-priority customer-impacting outage.
MTTR: Mean Time To Recover/Repair.
Shadow IT: Technology purchased and run outside the IT organization.
Environment parity: Keeping dev, test, and production identically configured.

Ch. 28 · Vocab

Change control: Formal process to authorize and record production changes.

Ch. 28 · Quiz1 / 4

Multiple choice

Sev 1 incidents drop two-thirds and MTTR halves — yet a database migration still fails. What is the immediate cause of the migration failure?

Ch. 28 · Quiz2 / 4

True / False

Monitoring and observability are the same thing.

Ch. 28 · Quiz3 / 4

Multiple choice

Sarah spins up cloud projects that bypass governance. What general pattern is this?

Ch. 28 · Quiz4 / 4

Spot the issue

Dev, QA, and Prod have slowly diverged over months. The team thinks "it's fine — everything still works." What's the silent failure mode?

Ch. 29

Friday, October 17

Phoenix slips again. Sarah's side-projects need infrastructure rework, board members are agitating to split the company, and Bill proposes pausing to synchronize environments and stand up a small SWAT team focused on revenue features. Erik hammers the Second Way: bigger batches mean larger variance and slower feedback. Lengthening release intervals doesn't reduce risk — it amplifies it.

Ch. 29

The Second Way (Feedback) Reasserted

Amplify and shorten the feedback loops from Ops (and the customer) back to Dev.

Ch. 29

Batch-Size Reduction

Smaller releases reduce variance, blast radius, and time-to-detect.

Ch. 29

Single-Piece Flow

The ideal: one unit of work moves end-to-end without queuing. The asymptote of batch-size reduction.

Ch. 29

Rework as Failure Signal

Work flowing backward (defects, missed specs) is the alarm to investigate, not the new normal to accept.

Ch. 29

SWAT Team

Small cross-functional team free of legacy entanglements — the seed of what becomes Project Unicorn.

Ch. 29 · Vocab

Feedback loop: Channel by which downstream signal reaches upstream decision-makers.
Batch size: Units of work released together; bigger batches = bigger blast radius.
Variance: Variability in outcomes; lean treats it as the enemy of flow.
Blast radius: Scope of harm caused by a single failure.

Ch. 29 · Vocab

Rework: Work redone because of defects or missed specs.

Ch. 29 · Quiz1 / 4

Multiple choice

A common executive instinct is to lengthen release intervals after a bad release ("we'll batch up more changes to be more careful"). Why is this exactly wrong?

Ch. 29 · Quiz2 / 4

True / False

A growing volume of rework is a healthy sign that quality is being caught.

Ch. 29 · Quiz3 / 4

Multiple choice

Single-piece flow is the asymptote of which concept?

Ch. 29 · Quiz4 / 4

Spot the issue

A program manager argues feedback loops aren't important because "the design was right at the start." What's the missing premise?

Ch. 30

Tuesday, October 21

Erik takes Bill back to MRP-8 and reveals his Special-Forces past. Standing on the plant floor he challenges Bill to make IT capable of **ten deploys a day**, citing Flickr and Etsy. He maps manufacturing constructs (takt time, setup-time reduction, single-piece flow) directly onto a software deployment pipeline. IT must operate at the cycle time of customer demand.

Ch. 30

Takt Time

German for "beat" — cycle time that matches customer demand. Anything slower starves the business.

Ch. 30

The Deployment Pipeline

End-to-end automated path from code check-in to production. The factory line for software.

Ch. 30

Infrastructure as Code

Environments defined as code, version-controlled and reproducible. Eliminates drift and snowflake servers.

Ch. 30

Continuous Delivery

Every change is releasable on demand because the pipeline says so — the trunk is always shippable.

Ch. 30

Setup-Time Reduction

Automating the slow, manual hand-offs (env builds, approvals, smoke tests) so release becomes a non-event.

Ch. 30

Business Agility

Not raw speed, but the ability to detect market change and respond quickly — the strategic payoff of fast deploy.

Ch. 30 · Vocab

Takt time: German *Takt* = beat; the available time per unit of demand.
Deployment pipeline: Automated stages a change passes through to reach production.
Infrastructure as Code (IaC): Provisioning and configuration via version-controlled scripts.
Continuous Delivery (CD): Engineering practice that keeps the trunk always shippable.

Ch. 30 · Vocab

Trunk-based development: Developers integrate small changes to a shared mainline frequently.

Ch. 30 · Quiz1 / 4

Multiple choice

What does "takt time" mean in Erik's mapping from manufacturing to IT?

Ch. 30 · Quiz2 / 4

True / False

Infrastructure as Code is primarily a tool choice rather than a discipline.

Ch. 30 · Quiz3 / 4

Multiple choice

Erik cites Flickr and Etsy doing ten deploys a day. Which underlying capability makes that possible?

Ch. 30 · Quiz4 / 4

Spot the issue

A team claims they "do continuous delivery" but releases require a 4-hour manual smoke-test ritual and three approval emails. What's the gap?

Ch. 31

Friday, October 24

Chris co-leads a new SWAT team aimed at the holiday-promotions opportunity. They map every deployment step's value stream and find that nearly every step has a history of failures. Brent volunteers to automate environment provisioning. You can't improve a pipeline you can't see; mapping the value stream surfaces the waste.

Ch. 31

Value-Stream Mapping (Applied)

Listing every step, touch time, and wait time from idea to production. Exposes the waste that intuition misses.

Ch. 31

Automated Environment Provisioning

Scripts — not tickets — stand up identical Dev/QA/Prod environments. Provisioning becomes a non-event.

Ch. 31

Embedded Ops in Sprints

Operations engineers embedded in the development team's sprints rather than handed work at the end.

Ch. 31

Artifact Promotion

One immutable artifact built once and promoted through environments — eliminates "rebuild for prod" surprises.

Ch. 31

Version-Control Everything

Code, configuration, and environment definitions all live in the same repo. One source of truth.

Ch. 31 · Vocab

Value-stream map: Diagram of every step that adds (or doesn't add) value to a piece of work.
Provisioning: Creating and configuring infrastructure ready for use.
Artifact: The built, deployable output of a pipeline (binary, container image, package).
Sprint: Short fixed-length iteration in agile development.

Ch. 31 · Vocab

Immutable: Cannot be modified after creation; promoted as-is.

Ch. 31 · Quiz1 / 4

Multiple choice

The SWAT team maps Phoenix's deployment value stream and finds nearly every step has a failure history. What does mapping accomplish that intuition didn't?

Ch. 31 · Quiz2 / 4

True / False

Promoting the same immutable artifact through Dev/QA/Prod is interchangeable with rebuilding the artifact in each environment.

Ch. 31 · Quiz3 / 4

Spot the issue

A team puts code in version control but keeps environment configuration in a spreadsheet that ops engineers edit by hand. What's the predictable failure?

Ch. 31 · Quiz4 / 4

Multiple choice

Embedding Ops engineers inside the development team's sprints addresses which canonical anti-pattern?

Ch. 32

Monday, November 10

The SWAT team is christened **Project Unicorn**: a decoupled codebase, its own data store, its own pipeline — a mini-Phoenix without Phoenix's accumulated baggage. Standardized OS, library, and DB images make every environment look the same. Loose coupling — architectural and organizational — is what lets you ship fast without breaking the world.

Ch. 32

Project Unicorn

A strangler/sidecar effort decoupled from Phoenix so it can move fast. The book's case study for what good architecture enables.

Ch. 32

Loose Coupling (Architecture)

Independently deployable services reduce blast radius and let teams ship without coordinating with everyone.

Ch. 32

Loose Coupling (Teams)

Small autonomous teams own a service end-to-end — precursor to microservices and two-pizza teams.

Ch. 32

Golden Images

One blessed OS+library baseline used everywhere. The death of snowflake servers.

Ch. 32

Reduced Cross-Team Dependencies

A team should be able to ship without waiting on other teams — coordination cost is what kills fast delivery.

Ch. 32 · Vocab

Loose coupling: Architectural property where components depend on minimal, stable interfaces.
Golden image: Pre-baked OS/runtime image used as a standard starting point.
Decoupled codebase: A code branch or service that can evolve and deploy independently.
Cross-functional team: Single team with all skills (Dev, Ops, QA, Security, Product) needed to ship.

Ch. 32 · Vocab

Strangler pattern: Gradually replacing a legacy system by routing new functionality around it.

Ch. 32 · Quiz1 / 4

Multiple choice

Project Unicorn is structurally decoupled from Phoenix's codebase, data store, and pipeline. What's the strategic point?

Ch. 32 · Quiz2 / 4

True / False

Loose coupling matters only at the architectural level, not at the organizational level.

Ch. 32 · Quiz3 / 4

Multiple choice

Golden images are introduced to solve which problem?

Ch. 32 · Quiz4 / 4

Spot the issue

A team's velocity is high in isolation but slow in practice because every release requires coordinating with five other teams. What architectural property would help most?

Ch. 33

Monday, November 3

Unicorn ships. Maggie's targeted-promotions email goes to ~1% of customers as a controlled trial; ~20% click through and ~6% buy — a 5x conversion lift. The team scales on cloud during the spike, automated security tests run in the pipeline, and a production glitch is caught and fixed inside a day. Small-batch, hypothesis-driven releases plus fast feedback turn IT into a revenue engine.

Ch. 33

Hypothesis-Driven Development

Every feature ships as a falsifiable hypothesis ("if we recommend X to segment Y, conversion will rise Z%").

Ch. 33

A/B Testing

Run feature variants against subsets of users and let data pick the winner — opinion is downgraded from authority to hypothesis.

Ch. 33

Canary Release

Roll out to a small slice (here, 1% of customers) before the full population. Small blast radius for unknown unknowns.

Ch. 33

Cloud Elasticity

Provision capacity on demand to absorb traffic spikes — the new variable factor of production.

Ch. 33

DevSecOps

Security checks run with every build as part of the pipeline. Security joins the flow rather than gating it.

Ch. 33

Telemetry-Driven Decisions

Product choices made from live data, not opinion. The Second Way operationalized for product.

Ch. 33 · Vocab

A/B test: Controlled experiment comparing two variants.
Canary release: Exposing a change to a small fraction of users first.
Conversion rate: Percentage of recipients who take the desired action.
Cloud elasticity: Automatic scaling up/down of cloud capacity.

Ch. 33 · Vocab

Hypothesis-driven development: Building features as experiments with measurable outcomes.

Ch. 33 · Quiz1 / 4

Multiple choice

Unicorn's promotions email is sent to ~1% of customers first. What is the canonical name and purpose of this approach?

Ch. 33 · Quiz2 / 4

True / False

Hypothesis-driven development means every feature ships as a falsifiable claim with measurable success criteria.

Ch. 33 · Quiz3 / 4

Multiple choice

Security tests run automatically in Unicorn's pipeline on every build. Which broader practice is this an instance of?

Ch. 33 · Quiz4 / 4

Spot the issue

A product manager argues that an A/B test isn't needed because "we already know what customers want." What's the missing premise?

Ch. 34

Black Friday Week

Black Friday hits hard: a Sev 1 traffic surge forces the team to scale servers and disable resource-heavy features in minutes using **feature toggles**. The quarter hits record revenue and profitability. Then Sarah announces competitors are launching build-to-order kits and top-line items drop 20% — proving that even a winning IT capability has to keep adapting. When you can deploy daily, you can respond to anything — including bad news.

Ch. 34

Feature Toggles / Flags

Config-driven switches that decouple deploy from release — code can ship dark and turn on later (or turn off under load).

Ch. 34

Andon-Cord Behavior

Anyone seeing a problem can stop the line, get help, and fix the systemic cause before resuming.

Ch. 34

Graceful Degradation

Turning off expensive features under load instead of failing entirely. Resilience over rigidity.

Ch. 34

Resilience Engineering

Designing systems and teams to absorb shocks and adapt. Reliability isn't about preventing failure; it's about surviving it.

Ch. 34

IT Enabling Competitive Response

The speed of IT becomes the speed of business strategy. When competitors launch, IT determines whether you can respond in weeks or quarters.

Ch. 34 · Vocab

Feature toggle / flag: A runtime switch that enables or disables a code path without redeploying.
Andon cord: TPS device any worker can pull to stop the line on a defect.
Graceful degradation: A system retains core function while shedding non-essential features under stress.
Resilience: Capacity to absorb disturbance and continue serving customers.

Ch. 34 · Vocab

Dark launch: Deploying code to production behind a flag, off by default.

Ch. 34 · Quiz1 / 4

Multiple choice

During Black Friday, the team disables expensive features in minutes using feature toggles. What is the deeper architectural property this demonstrates?

Ch. 34 · Quiz2 / 4

True / False

Graceful degradation under load is a sign of a fragile system that should fail hard instead.

Ch. 34 · Quiz3 / 4

Multiple choice

Competitors launch a new product and Parts Unlimited's top-line items drop 20% in days. What capability determines whether the company can respond?

Ch. 34 · Quiz4 / 4

Spot the issue

A team handles a traffic spike by frantically adding servers but has no way to shed non-essential features. The site stays up but with severe latency. What capability is missing?

Ch. 35

Friday, November 14

Sev 1s keep falling. Bill institutes **Project Narwhal**, a chaos-engineering program that randomly kills processes and instances to expose fragility before customers do. Steve offers Bill the CIO job, then a bigger one: rotation through sales, manufacturing, supply chain, and international on the track to COO. Chris becomes CIO; Sarah is gone. Erik's parting charge: write **The DevOps Cookbook**.

Ch. 35

The Third Way (Continual Learning)

Make experimentation, repetition, and learning from failure a daily ritual. Failures become learning, not punishment.

Ch. 35

Chaos Engineering

Deliberately injecting failure (Chaos Monkey / Project Narwhal) to build antifragile systems before customers discover the fragility.

Ch. 35

Game Days

Scheduled drills where teams practice responding to simulated failures. Erik's rule: "five minutes a day beats three hours once a week."

Ch. 35

Blameless Postmortems

Treat failure as learning, not punishment, so people surface problems early instead of hiding them.

Ch. 35

IT as Career Path to COO

Operational leadership now requires fluency in the IT systems that run operations. Tomorrow's general managers come up through IT.

Ch. 35

The DevOps Cookbook

Erik's in-novel name for a replicable transformation playbook — foreshadowing the real-world *DevOps Handbook* that followed.

Ch. 35 · Vocab

Chaos Monkey: Netflix-originated tool that randomly terminates production components to test resilience.
Game day: Scheduled rehearsal where teams practice responding to simulated failures.
Blameless postmortem: A retrospective focused on systemic causes rather than individual fault.
The Three Ways: Flow, Feedback, Continual Learning — the book's organizing DevOps principles.

Ch. 35 · Vocab

COO: Chief Operating Officer; the role Bill is being groomed for.

Ch. 35 · Quiz1 / 4

Multiple choice

Project Narwhal randomly kills production processes and instances. What is this practice called and what's its purpose?

Ch. 35 · Quiz2 / 4

True / False

Erik's "five minutes a day beats three hours once a week" rule prefers infrequent intensive drills over frequent small ones.

Ch. 35 · Quiz3 / 4

Multiple choice

Steve's career plan for Bill is a rotation through sales, manufacturing, supply chain, and international with Erik mentoring. What is the book's underlying argument?

Ch. 35 · Quiz4 / 4

Spot the issue

A team holds postmortems but always ends them by naming "who to blame." Over time, engineers stop reporting near-misses. What capability has the team lost?

Key Takeaways

01

IT operations obeys the same physics as a factory floor — flow, inventory, and constraints are universal.

02

Unplanned work is the silent killer; until you can see and categorize all four types of work, you cannot manage IT.

03

Throughput is set by the constraint, so any improvement made anywhere else is an illusion — protect Brent first.

04

The Three Ways (Flow, Feedback, Continual Learning) unify every DevOps tactic from Kanban to chaos engineering.

05

Reducing WIP, not adding people, is how you cut lead time — Little's Law applies just as cleanly to software as to cars.

06

Every business is an IT business; the speed of IT now sets the speed of business strategy itself.