alright folks, let’s get real. we all have our sprawling digital fortresses, carefully constructed brick by brick. but there’s always that one piece of software, that one perfectly tuned instance, where if it so much as hiccuped, you’d be ready to throw the entire homelab out the window and start fresh in a cave. what is it for you? what’s your absolute, non-negotiable, ‘i will personally visit the datacenter if this fails’ self-hosted application? for me, it’s my media server stack. my wife would disown me. don’t let me down, arr suite.


I don’t believe you, but I’d like to be proven wrong.
I expect you have a UPS that feeds your hosts and networking equipment and something like ZFS for disk redundancy. This protects against the most common failures and is usually enough, but there are still single points of failure in such a setup, that are not as common, not as hard to deal with through manual intervention, and quite difficult to protect with redundancy.
I would be surprised if you are protected against the following single points of failure without manual intervention:
Ceph for the proxmox cluster, 2x48 port switch + 16 port 10gbit as the core, 2xNAS (technically one is the backup, and there would be a few moments of downtime as the containers restart - a different container with the same config pointed to the backup NAS instead).
UPS and internet are the SPoF.