Skip to content

Monitoring

Observability stack — metrics, logs, dashboards, and alerting.


Grafana

Dashboards and visualisation. Connects to Prometheus, Loki, and InfluxDB to provide unified monitoring views across the whole stack.

Documentation  ·  values.sops.yaml


Prometheus Stack

Metrics collection and alerting. Deployed via kube-prometheus-stack, includes Prometheus, Alertmanager, kube-state-metrics, and node exporters.

Documentation  ·  values.sops.yaml


Loki

Log aggregation. Collects and indexes logs from all pods and ships them to Grafana for querying with LogQL.

Documentation  ·  values.yaml


Alloy

OpenTelemetry collector and telemetry pipeline. Scrapes metrics, tails logs, and forwards traces to the appropriate backends (Prometheus, Loki).

Documentation  ·  values.yaml


Uptime Kuma

Service uptime monitoring. Checks HTTP endpoints, TCP ports, and DNS records at configurable intervals and sends alerts on downtime.

Documentation  ·  values.yaml


ntfy

Self-hosted push notification server. Receives alerts from Prometheus Alertmanager, Uptime Kuma, and other services and delivers them to mobile devices.

Documentation  ·  values.sops.yaml