infra:operations:proxmox-backups
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| infra:operations:proxmox-backups [2026/05/23 11:59] – created atluxity_idp.hackeriet.no | infra:operations:proxmox-backups [2026/05/23 13:29] (current) – atluxity_idp.hackeriet.no | ||
|---|---|---|---|
| Line 14: | Line 14: | ||
| * [[infra: | * [[infra: | ||
| - | A weekly vzdump job was observed from the cluster configuration while documenting host006. The observed job used: | + | A weekly vzdump job was observed from the cluster configuration while documenting host006 |
| * Schedule: Sunday 03:00 | * Schedule: Sunday 03:00 | ||
| Line 25: | Line 25: | ||
| Treat this as observed state, not as a reviewed backup policy. | Treat this as observed state, not as a reviewed backup policy. | ||
| - | ===== Known issue: host006 disk pressure | + | ===== Host006 storage finding |
| - | host006 | + | host006 |
| + | |||
| + | * Physical disk observed: about 1 TB NVMe. | ||
| + | * LVM volume group observed: about 953G, with 0G free. | ||
| + | * Root filesystem observed: about 94G usable, about 94% used, about 6.2G free. | ||
| + | * Proxmox local storage is on the root filesystem. | ||
| + | * / | ||
| + | * local-lvm is the large thin pool for VM disks, not file-based backup dumps. | ||
| + | |||
| + | Several recent vzdump failures on host006 had errors like: | ||
| * vma_queue_write: | * vma_queue_write: | ||
| - | Disk pressure on host006 is the first suspect for those failures. Investigate storage before changing guests. | + | Disk pressure on host006 |
| ===== Contrast: host007 ===== | ===== Contrast: host007 ===== | ||
| - | host007 | + | host007 |
| + | |||
| + | * Root filesystem observed: about 18% used. | ||
| + | * / | ||
| + | * Proxmox local storage observed around 17% used. | ||
| + | * local-lvm observed around 27% used. | ||
| + | |||
| + | This makes host006 disk-pressure failure mode much less likely on host007. | ||
| + | |||
| + | |||
| + | ===== Remediation options ===== | ||
| + | |||
| + | Possible ways to reduce recurrence risk: | ||
| + | |||
| + | * Short term: review and remove obsolete files from / | ||
| + | * Better medium term: add dedicated backup storage for host006, either mounted at / | ||
| + | * Longer term: use Proxmox Backup Server for clearer retention and deduplicated backups. | ||
| + | * Avoid casual in-place LVM reshaping; it can put VM disks at risk and should only be done with a maintenance window and recovery plan. | ||
| ===== First checks ===== | ===== First checks ===== | ||
/srv/hackeriet-wiki/dokuwiki/data/attic/infra/operations/proxmox-backups.1779537550.txt.gz · Last modified: by atluxity_idp.hackeriet.no