Operations
The operations knowledge base — how we run, monitor, and fix the OpenDesk platform by hand. This is the day-2 half of the docs: the procedures an operator runs against a live cluster when automation isn't enough.
What lives here
- Runbooks — step-by-step procedures for known situations: manual upgrade dances, break-glass recovery, IdP/Keycloak fixes, certificate issues, Nextcloud/OX/Element bootstrap deadlocks, and more.
- Incidents — write-ups of past incidents and how they were resolved.
- Manual deployment — the by-hand equivalent of the deployer's steps (prerequisites, in-cluster infra, rendering & pushing manifests, certificates, Keycloak/Nextcloud/OX init, validation).
- Monitoring & operations concepts — the Betriebskonzept and how the central monitoring platform is run.
Every runbook states its symptom, when (not) to run it, its blast radius + rollback, the steps, and how to verify.
Looking for how the deployer itself is built? That's under Development.