VirtualTam's bookmarks

    1. Blocking TCP port 53 traffic leads to very strange failures. Don't.
    2. The source you're looking at is not the code running in production.
    3. "Prod" is just another name for "staging".
  1. With so many interacting components, the number of things that can go wrong in a distributed system is enormous. You’ll never be able to prevent all possible failure modes, but you can identify many of the weaknesses in your system before they’re triggered by these events. This report introduces you to Chaos Engineering, a method of experimenting on infrastructure that lets you expose weaknesses before they become a real problem.