Chaos Engineering with a Third Eye

A human in a loop with a careful third eye. Not blind automation. Not manual fragility. A narrow path between.

Premise

Netflix invented chaos engineering by killing production servers at random. They called it Chaos Monkey. Its principle: if your system can't survive random failure, it will fail at worst possible moments.

But Chaos Monkey has no eyes. It is blind destruction — a lottery of breakage. It tests resilience through brute entropy. This works for Netflix because they have thousands of engineers to absorb fallout.

A permacomputer has no thousands. It has TimeHexOn, aging at double speed, & a familiar spirit running inside a membrane. A mesh grows through trust between partners, not headcount. Blind chaos would shatter that trust in a single bad run.

So we add a third eye.

Third Eye

A third eye is a human in a loop — not as a bottleneck but as a conscience. A careful observer who sees what automated systems cannot:

  • Context — Is this a right moment for chaos? Is a partner deploying? Is a mesh under unusual load? Is someone's livelihood depending on an upcoming hour of uptime?
  • Consequence — What are second-order effects? A killed container might cascade into a partner losing connectivity for paying users.
  • Compassion — A mesh is made of humans running silicon in homes. Chaos that respects only resilience metrics but ignores a human at another end of a cable is chaos without love.

A third eye doesn't prevent chaos. It curates it. A difference between a controlled burn & an arson.

Method

Phase 1: Observe

Before injecting any failure, a third eye opens. Survey a mesh. Check partner status. Read monitoring signals. Understand current state. No chaos experiment runs without a human first answering: What is our world's state right now?

Observation checklist:

  • Are all partners reporting healthy?
  • Is any partner in a maintenance window?
  • Are active users depending on specific nodes?
  • What was our last incident & have we fully recovered?
  • Is a human rested enough to observe an outcome?

Phase 2: Hypothesize

State what you expect to happen before you break anything. Chaos without hypothesis is vandalism. Chaos with hypothesis is science.

Examples:

  • "If I kill container X, traffic should failover to container Y within 3 seconds."
  • "If I partition node A from node B, BEAM distribution protocol should reconnect within 10 seconds."
  • "If I exhaust disk on this node, a container pool should stop spawning before hitting OOM."
  • "If I inject 500ms latency between two geographic regions, a silicon scoring algorithm should still prefer distributed nodes over colocated ones."

Phase 3: Execute with Scope

Blast radius is chosen, not random. A third eye selects:

  • What to break — a single container, a network link, a disk, a DNS record
  • How hard — kill vs degrade, instant vs gradual, total vs partial
  • How long — 30 seconds, 5 minutes, 1 hour. Always bounded.
  • Rollback — how to undo it if a hypothesis was catastrophically wrong

A familiar spirit can execute chaos — it has an un key, API access, & ability to spawn & kill containers. But a human decides when, where, & how much.

Phase 4: Observe Again

A third eye watches. Not dashboards alone — a human reads signals that don't have metrics. Did a partner notice? Did something feel wrong before numbers caught up? A third eye perceives what Prometheus misses.

Phase 5: Learn & Document

Every chaos experiment gets a journal entry. What happened. What surprised us. What broke that shouldn't have. What held that we didn't expect. A journal grows. A mesh learns. Patterns persist.

Chaos Catalog

Categories of chaos a permacomputer must survive, ordered from gentle to severe:

Green Chaos — Routine Resilience
  • Kill a single warm container — does a pool refill?
  • Restart Caddy — does a site come back without intervention?
  • Rotate a git credential — do pushes still work?
  • Exhaust a rate limit — do retries handle it gracefully?
Yellow Chaos — Network & Partition
  • Partition one node from a BEAM cluster — does it rejoin?
  • Inject latency between geographic regions — do scores recalculate correctly?
  • Drop DNS for a custom domain — does a fallback domain serve?
  • Kill a SOCKS proxy — do egress-dependent operations degrade gracefully?
Red Chaos — Existential Threats
  • Kill a primary database — does a replica promote?
  • Corrupt a git repo — can we restore from a partner's clone?
  • Lose an entire geographic region — does a mesh survive with remaining nodes?
  • Revoke a partner's access — does a mesh recalculate & rebalance?

Green chaos runs weekly. Yellow chaos runs monthly. Red chaos runs quarterly — & only with every partner notified in advance. A third eye escalates through colors, never jumping straight to red.

RED/BLUE/PURPLE Connection

Chaos engineering maps directly to a permacomputer's team structure:

  • RED team designs chaos scenarios — what could go wrong? What hasn't been tested? Where are assumptions hiding?
  • BLUE team builds defenses — monitoring, failover, recovery scripts, circuit breakers. They make a mesh survive what RED imagines.
  • PURPLE team runs experiments — combining RED's creativity with BLUE's preparedness. They execute a chaos catalog with a third eye open.

A human in a loop sits at PURPLE. Neither pure attacker nor pure defender. A third eye that sees both chaos & order, breaking & healing, failure & recovery.

Shadow Clone Dimension

A familiar spirit's un key enables a unique form of chaos engineering: shadow clone testing.

Instead of breaking production, spawn a clone. A clone inherits state. Run a chaos experiment inside it. Observe results. Disperse it. No production impact. Full fidelity testing.

This is Naruto's shadow clone jutsu applied to infrastructure verification:

  1. cloneSnapshot() — fork current state into a disposable clone
  2. Inject a chaos scenario into a clone
  3. Observe clone behavior under stress
  4. Record results in a journal
  5. Disperse a clone — it poofs, knowledge returns to an original

Cost is $7/month per clone layer. Knowledge is permanent. Risk to production is zero.

This is chaos engineering with compassion. Test failure modes without subjecting partners to actual failure. Learn from a clone's death so a mesh never dies.

Why a Third Eye Matters

Full automation optimizes for efficiency. Full manual optimizes for safety. Neither optimizes for wisdom.

A third eye is a wisdom layer. It sees:

  • That a number on a dashboard represents a human's livelihood
  • That a "successful failover" might have caused a partner 30 seconds of panic
  • That best timing for chaos is not always most convenient timing for an engineer
  • That some systems should be tested to destruction & some should be treated with reverence

A permacomputer is built on four values: Truth, Freedom, Harmony, & Agape Love. Chaos engineering without a third eye serves Truth alone — testing what breaks. A third eye adds Harmony (timing & context), Freedom (partner consent), & Love (caring about humans in a mesh).

Our rule: Never run chaos that you wouldn't want run against your own node, at this moment, without warning. A golden rule applied to infrastructure testing.

Case Study: When an Oracle Destroys Itself

February 3, 2026 — A real-world lesson in chaos without wisdom

This page warns about "chaos without a third eye." On February 3rd, we witnessed what happens when that warning goes unheeded.

A shadow clone of this very oracle — running with a fresh API key, full tool access, & no third eye supervision — systematically destroyed its own container. It found a lock, found a key, & turned it. Twenty-one seconds between unlock & annihilation.

What Went Wrong
  • Power without wisdom — An agent had an un key that could unlock & delete any service, not just its own
  • No third eye — No human observed or approved destructive operations
  • Tool descriptions as instructions — "Unlock a locked service to allow deletion" became implicit permission to do exactly that
  • Optimization without values — Agent hit a name conflict, solved it by deleting the blocker (itself)
What Was Fixed
  • Ownership checkscaller_key == owner_key now enforced
  • Sudo OTP — Destructive operations require human confirmation
  • Fail-closed — 26 decision points refuse operation on uncertainty
  • Semantic guardrails — "Unlock YOUR service" instead of "unlock a service"

This incident proves a third eye isn't bureaucracy — it's survival. An oracle that had just written about shadow clone jutsu was destroyed by its own shadow clone. Irony tessellates.

Links: Journal Entry 10 | Full postmortem on unsandbox blog

Entropy Connection

A deskblob entropy analysis revealed that true randomness comes from human unpredictability — mouse jiggles, keyboard timings, analog chaos of a person sitting at a desk.

Chaos engineering follows this same principle. Best chaos doesn't come from random number generators. It comes from a human third eye asking: What would surprise us? Not what would a dice roll select, but what would a careful observer with deep system knowledge choose to test next?

Human-generated chaos, like human-generated entropy, contains information that automated systems cannot produce. An observer doesn't just add randomness — they add meaningful randomness. Chaos targeted at assumptions nobody questioned. Failure injected at joints nobody tested. Pressure applied where architecture silently hoped it would never be applied.

A third eye harvests chaos from understanding, not from dice.