What is this?

A free, browser-based incident response simulator for engineers who work with AI systems. You face 6 randomized production incidents — latency spikes, hallucination waves, cost explosions, WebSocket dropouts — and pick the strongest fix under a 25-second timer. Every decision shifts four live metrics: latency, accuracy, cost, and reliability.

Earn bonus points for speed, streaks, and explaining your reasoning. Your final score earns you a role badge from "Needs More On-Call Reps" to "Staff AI Systems Thinker."

What you'll practice

Incident Response

Triage production alerts and pick the right mitigation under time pressure.

Tradeoff Analysis

Evaluate how each fix impacts latency, accuracy, cost, and reliability at once.

Systems Thinking

Understand cascading failures and second-order consequences of operational decisions.

Communicating Under Pressure

Articulate reasoning clearly — the same skill tested in real on-call and interviews.

Made for

AI/ML EngineersSREs & Platform EngineersBackend EngineersEngineering Managers

Done playing? Try the real thing.

The on-call game tests pattern recognition. Rubduck mock interviews test how you communicate and reason through novel problems with a live AI interviewer — the skill that actually decides interview outcomes.

Daily tips, confessions & AI news. Unsubscribe anytime. Questions? [email protected]