Monitoring¶
Observability stack — metrics, logs, dashboards, and alerting.
Grafana¶
Dashboards and visualisation. Connects to Prometheus, Loki, and InfluxDB to provide unified monitoring views across the whole stack.
Documentation · values.sops.yaml
Prometheus Stack¶
Metrics collection and alerting. Deployed via kube-prometheus-stack, includes Prometheus, Alertmanager, kube-state-metrics, and node exporters.
Documentation · values.sops.yaml
Loki¶
Log aggregation. Collects and indexes logs from all pods and ships them to Grafana for querying with LogQL.
Alloy¶
OpenTelemetry collector and telemetry pipeline. Scrapes metrics, tails logs, and forwards traces to the appropriate backends (Prometheus, Loki).
Uptime Kuma¶
Service uptime monitoring. Checks HTTP endpoints, TCP ports, and DNS records at configurable intervals and sends alerts on downtime.
ntfy¶
Self-hosted push notification server. Receives alerts from Prometheus Alertmanager, Uptime Kuma, and other services and delivers them to mobile devices.