Resiliency Checklist
In order to determine the necessary skills, hardware, and good practices needed for a Stake Pool Operators to be more resilient to various unforeseen events that may take down their pools from the network we have broken the checklist into the following sections: Stake Pool Operations Recommended Skills and Resources, Resilience Options, and Redundancy
Stake Pool Operator Skills
Stake Pool Operation
Improved Resilience Options
Designated off-grid power duration (12-24 hrs)
Redundancy (data, software, infrastructure, and Hardware)
Data and Software Backup
Backup keys and passwords
Written Down on paper
Electronic backups on USB/external hard drive
Backup Configuration Files
Backup Node Software (node and cli binaries)
Backup DB snapshot
Backup Tools/software
Hardware
Spare node hardware (physical location with access)
Spare node hardware (cloud based)
Spare SSD/ hard drives
Cables
Ethernet Cables
SSD/HDD Adapters and Cables
Back Up Power Supplies for Nodes
Internet
Main ISP
Fiber
DSL
Satellite
Cable/Coaxial
Backup ISP
Cellular/4G-5G wireless
Satellite/Starlink
Secondary cloud based ISP (AWS, Azure, GCP, etc)
Secondary location ( should be out of your region) & ISP with your own hardware
Power supply
Failover
UPS
Solar panels + batteries
Generator
Last updated