This page documents maintenance for the Hackeriet Proxmox hosts in klynge001. It is a runbook and maintenance log.
Scope: host006 and host007 in the klynge001 Proxmox cluster.
Actions performed:
105 / blade510 / ingress511 / app-01Final state after maintenance:
Follow-up actions completed after the maintenance:
munin-node.service.lock: migrate locks on moved/stopped VMs.105 / blade510 / ingress511 / app-01601 / idp1systemctl –failed was clear on both nodes after Munin retirement.munin-node.service had been failing for months on both host006 and host007. It is now intentionally retired on these hosts.onboot: 1 from before migration. Clear stale locks only after confirming there is no active migration task.pv1 during update-grub, but rebooted successfully on the new kernel. The active LVM metadata still contains an internal pv1 label.Work one host at a time. Do not reboot both host006 and host007 at once.
Before making changes on either host:
hostname -f pveversion -v uname -r pvecm status systemctl --failed --no-pager pvesm status df -h -x tmpfs -x devtmpfs qm list cat /etc/pve/jobs.cfg apt update apt list --upgradable test -f /var/run/reboot-required && cat /var/run/reboot-required || true
For host006, also check local storage pressure:
du -sh /var/lib/vz/dump /var/lib/vz/template/iso /var/log /var/cache/apt /root/proxmox-templates /var/lib/fail2ban
Update flow for each host:
pvecm status.pvesm status and df -h.qm list.systemctl –failed –no-pager.Suggested commands after review:
apt-get -s full-upgrade apt full-upgrade
When one node is rebooted, the remaining node may temporarily lose quorum. If that happens during planned maintenance, set expected votes to 1 on the remaining node:
pvecm expected 1 pvecm status
After the rebooted node rejoins, confirm the cluster has returned to two nodes and expected votes 2:
pvecm status
Do not use this as an incident workaround without understanding which node has the correct cluster state.
After each host update or reboot:
hostname -f pveversion uname -r pvecm status systemctl --failed --no-pager pvesm status df -h -x tmpfs -x devtmpfs qm list apt list --upgradable
Check guests affected by the touched host:
qm status <vmid> qm agent <vmid> ping qm guest cmd <vmid> network-get-interfaces qm config <vmid> | sed -n '/^ipconfig/p;/^net/p' ping -c 2 <ip> nc -vz -w3 <ip> 22
Interpretation:
qm status running means the hypervisor sees the VM running.qm agent <vmid> ping means the guest OS and QEMU guest agent are responsive.ping means basic network path works, if ICMP is allowed.nc -vz -w3 <ip> 22 means SSH is listening, if SSH is expected for that guest.As of 2026-06-01, host006 and host007 are monitored in LibreNMS as Proxmox hypervisors. This replaces the old Munin host monitoring for these nodes.
LibreNMS records:
proxmox.proxmox application enabled with app instance klynge001.Host-side setup:
snmpd is installed, enabled, and running on both hosts.10.10.50.26:161/udp10.10.50.27:161/udplibrenms_klynge001.infrastructure/librenms-klynge001-snmpv3. Do not put the credential values in the wiki or NetBox./usr/local/libexec/librenms-proxmox.
extend proxmox /usr/bin/sudo /usr/local/libexec/librenms-proxmox
/etc/sudoers.d/librenms-proxmox allows the Debian-snmp user to run only that script via sudo.Firewall setup:
10.10.50.51 to the klynge001 IP set./etc/pve/firewall/cluster.fw:
IN ACCEPT -source 10.10.50.51/32 -dest +klynge001 -p udp -dport 161 -log nolog # LibreNMS SNMP polling from app-01
LibreNMS setup:
enable_proxmox is set to true.unix-agent poller module is disabled per device for host006 and host007, because this setup uses SNMP extend instead of the LibreNMS unix-agent on port 6556.Useful verification commands:
# On each Proxmox host systemctl is-active snmpd ss -lunp | grep ':161' sudo -u Debian-snmp sudo /usr/local/libexec/librenms-proxmox # From the LibreNMS container on app-01 snmpwalk -v3 -l authPriv -u librenms_klynge001 -a SHA -A '<auth password>' -x AES -X '<privacy password>' host006.hackeriet.no SNMPv2-MIB::sysName.0 snmpget -v3 -l authPriv -u librenms_klynge001 -a SHA -A '<auth password>' -x AES -X '<privacy password>' -Oqv host006.hackeriet.no .1.3.6.1.4.1.8072.1.3.2.3.1.2.7.112.114.111.120.109.111.120 lnms device:poll -m applications host006.hackeriet.no
pv1 warning.