This week we look at chapter 9 Simplicity and chapter 10 Practical Alerting from Time-Series Data from Site Reliability Engineering: How Google Runs Production Systems.
If you've found these episodes useful, please send me an email at [email protected] and tell me about it.
The companion discussions to this podcast happen Thursdays at 7 pm Eastern. You can join by signing up at bookclub.dev/thursdays.
In the episode, I mention some books I've enjoyed.
Prometheus is mentioned as a tool for monitoring and alerting along with AppInsights.
Next week we'll look at chapter 11 Being On-Call and chapter 12 Effective Troubleshooting.
Podchaser is the ultimate destination for podcast data, search, and discovery. Learn More