Senior Staff SRE @ Zendesk
Fred is a resident SLOgician and Observability Economist at Zendesk, where he works to ensure reliability is world class through the use of SLOs and Error Budgets. He previously worked with high scale operational telemetry at Circonus, and before that at Turnitin.com. Fred recently received a patent for Inverse Cumulative histograms, which Zendesk uses to power SLOs and Error Budgets.
SLIs, SLOs, and Error Budgets at Scale
How can one democratize the implementation of SLIs, SLOs, and Error Budgets to put them in the hands of a thousand engineers at once? At Zendesk we developed simple algorithms and practical approaches for implementing SLIs, SLOs, and Error Budgets at scale using a number of observability tools. This talk will show the approaches developed and how we were able to manage observability instrumentation across dozens of teams quickly in a complex ecosystem (CDN, UI, middleware, backend, queues, dbs, queues, etc). This talk is for engineers and operations folks who are putting SLIs, SLOs, and Error Budgets into practice. Attendees will come away with concrete examples of how to communicate and implement Error Budgets across multiple teams and diverse service architectures.Watch Talk