I recently built something I wish I'd had at my last startup. We were building on top of an LLM, and it felt like trying to nail jello to a wall. Fix one issue, cause three more. The worst part? We couldn't even measure how bad the regressions were.