Skip to main content

Operations

The operations knowledge base — how we run, monitor, and fix the OpenDesk platform by hand. This is the day-2 half of the docs: the procedures an operator runs against a live cluster when automation isn't enough.

What lives here

  • Runbooks — step-by-step procedures for known situations: manual upgrade dances, break-glass recovery, IdP/Keycloak fixes, certificate issues, Nextcloud/OX/Element bootstrap deadlocks, and more.
  • Incidents — write-ups of past incidents and how they were resolved.
  • Manual deployment — the by-hand equivalent of the deployer's steps (prerequisites, in-cluster infra, rendering & pushing manifests, certificates, Keycloak/Nextcloud/OX init, validation).
  • Monitoring & operations concepts — the Betriebskonzept and how the central monitoring platform is run.

Every runbook states its symptom, when (not) to run it, its blast radius + rollback, the steps, and how to verify.

Looking for how the deployer itself is built? That's under Development.