SREcon22 Europe/Middle East/Africa - SRE and ML: Why It Matters

Опубликовано: 05 Октябрь 2024
на канале: USENIX
1,214
30

SRE and ML: Why It Matters

Todd Underwood, Google

Machine Learning is an incredibly hyped set of technologies. It seems that ML is becoming an important part of distributed computing. I'll review whether SREs need to know anything about ML yet (probably you do—sorry!). And since ML reliability is challenging, I'll suggest some changes required for most SREs and even some significant changes to our profession. Finally, I'll review the state of using ML to automate production with an extremely skeptical eye.

View the full SREcon22 Europe/Middle East/Africa program at https://www.usenix.org/conference/sre...