VirtualTam's bookmarks
34 bookmarks found
-
-
- https://grafana.com/docs/tempo/latest/getting-started/
- https://grafana.com/docs/tempo/latest/configuration/
- https://github.com/grafana/tempo
- https://grafana.com/blog/2020/10/27/announcing-grafana-tempo-a-massively-scalable-distributed-tracing-system/
- https://grafana.com/blog/2020/11/09/trace-discovery-in-grafana-tempo-using-prometheus-exemplars-loki-2.0-queries-and-more/
- https://grafana.com/go/observabilitycon/observability-with-logs-grafana/
- https://grafana.com/go/observabilitycon/tracing-made-simple-with-grafana/
- https://gouthamve.dev/tempo-a-game-of-trade-offs/
-
Open questions, potential solutions or workarounds:
- Loki does not support out-of-order message ingestion
- There is not Java client for Loki
- The Wiced MQTT queue might be limited depending on the volume of logs
-
On-call team rotas
2020-05-26 Handling rotas:
- https://rachelbythebay.com/w/2019/01/14/rotation/
- https://rachelbythebay.com/w/2019/01/28/oncall/
- https://grafana.com/blog/2019/07/01/pro-tips-how-amgen-manages-on-calls-and-burnout-with-grafana/
- https://grafana.com/blog/2019/05/29/grafana-labs-at-kubecon-foolproof-kubernetes-dashboards-for-sleep-deprived-on-calls/
- https://www.youtube.com/watch?v=WQsyMguI8xQ
On-call stories:
-
GPU monitoring with Prometheus
2019-04-27 - https://stackoverflow.com/questions/8223811/top-command-for-gpus-using-cuda
- https://github.com/wookayin/gpustat
- https://github.com/zhebrak/nvidia_smi_exporter (Go)
- https://github.com/tankbusta/nvidia_exporter (C, Go)
- https://github.com/teh/nvidia-smi-prometheus (Haskell)
- https://github.com/gsperry2011/prometheus/blob/master/nvidia/ewbf_scraper.py (Python)
- https://grafana.com/dashboards/1500