VirtualTam's bookmarks
31 bookmarks found
-
- Scuba: Diving into Data at Facebook - International Conference on Very Large Data Bases (VLDB) (PDF, Talk)
- HN discussion
- Axiom - Stop sampling, observe every event
- Eventrelay - An event streaming/storage system for the rest of us
- Honeycomb - Identify Outliers
- Rill - Fast operational dashboards that your team will actually use
- Strangeloop 17 - Why We Built Our Own Distributed Column Store by Sam Stokes
-
-
On-call team rotas
2020-05-26 Handling rotas:
- https://rachelbythebay.com/w/2019/01/14/rotation/
- https://rachelbythebay.com/w/2019/01/28/oncall/
- https://grafana.com/blog/2019/07/01/pro-tips-how-amgen-manages-on-calls-and-burnout-with-grafana/
- https://grafana.com/blog/2019/05/29/grafana-labs-at-kubecon-foolproof-kubernetes-dashboards-for-sleep-deprived-on-calls/
- https://www.youtube.com/watch?v=WQsyMguI8xQ
On-call stories:
-
GPU monitoring with Prometheus
2019-04-27 - https://stackoverflow.com/questions/8223811/top-command-for-gpus-using-cuda
- https://github.com/wookayin/gpustat
- https://github.com/zhebrak/nvidia_smi_exporter (Go)
- https://github.com/tankbusta/nvidia_exporter (C, Go)
- https://github.com/teh/nvidia-smi-prometheus (Haskell)
- https://github.com/gsperry2011/prometheus/blob/master/nvidia/ewbf_scraper.py (Python)
- https://grafana.com/dashboards/1500
-
Playlist | Grafana Documentation
2018-10-03 -