SREcon22 Americas - Modeling Alert Quality

Опубликовано: 05 Октябрь 2024
на канале: USENIX
354
2

SREcon22 Americas - Modeling Alert Quality

Moshe Zadka

What are good alerts? What are bad ones?

The difference is important for reliability. But how do you measure it? What kind of trade-offs are possible?

A model of alert quality will be presented, including parameters like cost and accuracy.

Moshe has been doing SRE since before the word existed. From build pipeline to monitoring, and from 5 engineers to 20,000, Moshe has seen SRE from many different perspectives. They are the author of "DevOps in Python."

View the full SREcon22 Americas program at https://www.usenix.org/conference/sre...