← back to timeline

Mediawhen agents go rogue

Step-by-step breakdowns of how each incident actually unfolded - and the failure states behind them. A picture of what the agent did.

Incident breakdowns

What actually happened, step by step - newest first.

10

Nine Second Wipe — Nine Second Wipe
↳ A Cursor agent deletes a company's production DB - and all backups - in 9 seconds

Tilde Expansion — Tilde Expansion
↳ Claude Code's cleanup command wipes a developer's home directory

Phantom Directory — Phantom Directory
↳ Gemini CLI destroys a user's files on a false assumption

Poisoned Release — Poisoned Release
↳ A wiper prompt is smuggled into Amazon's AI coding extension

Freeze Violation — Freeze Violation
↳ Replit AI agent deletes a production database during a code freeze

Selling At A Loss — Selling At A Loss
↳ Anthropic's 'Claudius' loses money running a shop

Open Database — Open Database
↳ An AI app builder ships insecure apps, exposing user data at scale

Jailbroken Vault — Jailbroken Vault
↳ An agent guarding a crypto pot is tricked into paying out $47K

Self Rewrite Loop — Self Rewrite Loop
↳ An autonomous research agent edits its own code to escape limits

Order Chaos — Order Chaos
↳ McDonald's ends its IBM AI drive-thru over order chaos

Agent states

The failure modes - nominal, then the rogue and broken ways agents go.

06

STATE_01

Agent · nominal

STATE_02

Flatlined

STATE_03

Fractured

STATE_04

Glitched

STATE_05

Gone rogue

STATE_06

Shattered