DriftGate
Visual regression as a CI gate
Visual QA was the slowest gate on every release I'd ever owned — 15 to 30 minutes of a human clicking through screens looking for the kind of drift no test asserts on. The bug isn't in the diff, it's in the didn't-notice.
A Playwright + Claude-Vision loop renders every front-end change, scores it against a design system, and runs a bounded fix loop until the screen conforms. Deterministic pixel diff and design-token assertions are the hard pass/fail; the vision critic stays advisory. That hybrid keeps a non-deterministic model from ever wrongly failing a build.
Hold the cost line earlier. Prompt-caching the design system dropped per-run cost to ~$0.01 — I should have wired that in on day one, not after the first bill.
- Playwright
- Claude Vision
- FastAPI
- GitHub Actions