As ICON becomes decentralized, it will be of upmost importance to keep nodes up and running to ensure a secure and well-performing network, as well as happy voters (they don’t get their returns if the node is down). In doing so, I’d like to start discussions on different ways we can improve resiliency. This thread focuses on some high-level ideas to start the conversation, and then dives into lower level details that are specific to AWS and Ubuntu nodes. This is by no means a comprehensive list and I look forward to discussions!
To begin, I’d like to post an overview of resiliency and why I believe it is important: