The 'if this goes down, I riot' self-hosted app

Melon Husk™@sh.itjust.works · 7 months ago

The 'if this goes down, I riot' self-hosted app

fhoekstra@feddit.nl · 7 months ago

I don’t believe you, but I’d like to be proven wrong.

I expect you have a UPS that feeds your hosts and networking equipment and something like ZFS for disk redundancy. This protects against the most common failures and is usually enough, but there are still single points of failure in such a setup, that are not as common, not as hard to deal with through manual intervention, and quite difficult to protect with redundancy.

I would be surprised if you are protected against the following single points of failure without manual intervention:

NAS machine (not just disk) failure. You would need to have a multi-node distributed storage, like Ceph, to protect against this.
Networking equipment failure. I think you can do some magic with BGP to do this, but I’m not a network engineer and I’ve never set up a redundant network.

curbstickle@anarchist.nexus · 7 months ago

Ceph for the proxmox cluster, 2x48 port switch + 16 port 10gbit as the core, 2xNAS (technically one is the backup, and there would be a few moments of downtime as the containers restart - a different container with the same config pointed to the backup NAS instead).

UPS and internet are the SPoF.

fhoekstra@feddit.nl · 7 months ago

Impressive!

octobob@lemmy.ml · edit-2 7 months ago

One thing I’d add is a whole house surge suppressor.

I saw the power lines arcing to either each other or the bamboo outside our house last week during a bad storm.

A whole house surge suppressor is only like $100, I’m gonna get one soon and install it. I saw it’s best to install it as close as possible to the main incoming power lugs, one lead on each leg of the split phase 120/240.

A UPS will protect against surges but it’s just a good idea with how many appliances and devices have circuit boards in homes these days. Like your furnace, oven, washing machine, game console, TV, etc.

I had an insane surge last winter so it’s a long time coming haha. I woke up and half my circuits were off. I measured 170v to gnd on one of the legs. Power company and fire dept had to show up to fix it.

Power is ehh not great where I live.

Edit: for your point about a NAS failure. If that were to happen, since I use unRAID, I could just throw the disks on any Linux PC and my data would be fine.