Table of Contents

Proxmox maintenance

This is a runbook for planned maintenance on the Hackeriet Proxmox hosts in klynge001. It is documentation and procedure, not inventory. Use NetBox for canonical device, IP, cabling, and VM placement data.

Current scope

Planned hosts:

Current goals:

Announcement draft

Planned Proxmox maintenance for Hackeriet

I plan to do maintenance on the Proxmox hosts host006 and host007 in the klynge001 cluster one of the next days.

Scope:

Expected impact:

DNS and service risk

Live DNS checks on 2026-05-23 showed that hackeriet.no has two authoritative nameservers:

Both authoritative nameservers served the same SOA serial when checked. DNS resolution should survive a short outage of ns0 because ns.hyp.net is external and synced. Do not treat this as service redundancy.

Important service dependencies observed:

Maintenance implications:

Certificate automation for internal Proxmox hostnames is documented at Proxmox ACME DNS automation.

Pre-maintenance checks

Run on both host006 and host007 before making changes:

On host006, also check local storage pressure:

Before rebooting anything, check DNS redundancy:

The SOA serial should match.

Maintenance procedure

Work one host at a time. Do not reboot both host006 and host007 at once.

Suggested order:

  1. Start with host006 if the main concern is storage and backup health.
  2. Start with host007 if host006-hosted ingress services must stay stable first.

For each host:

  1. Confirm cluster state with pvecm status.
  2. Confirm storage state with pvesm status and df -h.
  3. Review failed units with systemctl –failed –no-pager.
  4. Run apt update.
  5. Review apt list –upgradable.
  6. Apply updates only after reviewing the package set.
  7. Reboot only if required or clearly useful.
  8. After reboot, wait for the node to return and confirm cluster health before touching the next host.

Suggested update commands, after review:

Do not change guest VM configuration as part of host maintenance unless needed for recovery.

Post-host checks

After each host update or reboot:

Check DNS and key service names:

Check actual services, not only DNS, when the relevant host has been touched.

Host006 notes

host006 has about 1 TB physical storage, but Proxmox local storage is on the root filesystem. The root filesystem was previously close to full, and local backups under /var/lib/vz/dump were the main pressure point.

Known cleanup/remediation context is documented at Proxmox backups.

During maintenance, avoid casual LVM reshaping. It can put VM disks at risk and should only be done with a maintenance window and recovery plan.

Safety notes