Video recording of a presentation given at JDK IO conference in Copenhagen in June 2018 at the IT-University organized by the Danish Java User Group - Javagruppen.
https://jdk.io/jdk-io-2018/speakers/2...
Nailing down bugs in distributed systems
Speaker: Kamil Szymański
Finding bugs in distributed systems is challenging.
Finding bugs on production in distributed systems is even harder due to time pressure, especially if the bug is on critical path.
At times like that you can't afford guessing what's wrong, you have to take decisions based on real data.
In this slides-free session we will see how can we use live data for making bugs discovery quicker and finding root causes of such bugs easier.
Moreover we will discuss how to deal with production outages and quickly recover from them.