Tardigrade under electron microscope

The DR Number Almost No One Records

Disaster recovery has three numbers. Almost no organization records all three. The first is the number written into the plan. The second is the number measured during exercises, if exercises happen. The third is the number observed during real incidents. The distance between them is the only metric that matters. It is also the metric that almost no one calculates. The Three States of D.R. Capability Disaster recovery capability exists in three forms simultaneously, and the three forms produce three different numbers. ...

May 22, 2026 · 9 min · 1804 words · Andre Rocha

Shadow Infrastructure

Observation: Modern platforms often contain internal infrastructure that is not visible in the primary operational model used by administrators. These resources include internal networks, control-plane communication paths, service networks, operator-managed components, and reconciliation controllers. They exist to support platform behavior rather than application workloads, and are frequently created automatically during cluster deployment. Because they are not part of the infrastructure model operators typically reason about, they remain largely invisible until they interact with external resources or cause unexpected conflicts. ...

March 27, 2026 · 1 min · 131 words · Andre Rocha
FN-0011

The Abstraction Tax

Observation: Every abstraction layer hides complexity from the user while introducing additional operational mechanics behind the scenes. Controllers reconcile desired state. Operators manage lifecycle logic. Networking overlays create new routing paths. These mechanisms remain mostly invisible during normal operation. They become visible only when something fails. Implication: The operational overhead created by abstraction layers can be understood as an abstraction tax: a cost paid by the platform team in exchange for simplified interfaces offered to users. ...

March 24, 2026 · 1 min · 105 words · Andre Rocha
FN-0010
Cloud-Native Fragility

Cloud-Native, Same Old Fragility

Modern systems are distributed. But fragility didn’t disappear. It just became harder to see. They run across clusters, regions, providers . They are observable, containerized, orchestrated . ...

March 23, 2026 · 3 min · 549 words · Andre Rocha

Abstractions Simplify Usage, Not Operation

Observation: Platform abstractions reduce cognitive load for users. A developer deploying an application rarely needs to understand how scheduling, networking, storage provisioning, or cluster lifecycle actually work. The interface becomes simple: deploy, expose, scale. However, the operational side of the platform moves in the opposite direction. Each abstraction layer introduces additional controllers, reconciliation loops, networking paths, and state dependencies that must be understood when something fails. Implication: Abstractions successfully simplify usage, but they rarely simplify operation. ...

March 12, 2026 · 1 min · 106 words · Andre Rocha
FN-0006