Page cover image

Uptime Performance: Addressing Missed Blocks

Overview

This discussion focuses on the configuration issues faced by validators during the early Initia testnet phase, particularly when they encountered high transaction volumes that led to performance degradation. We will explore the potential causes and offer solutions to mitigate these problems.

Potential Causes & Solutions

  1. Low Hardware Specifications

One of the primary factors contributing to uptime degradation is inadequate hardware. Validators must ensure their systems meet the recommended specifications to handle high transaction volumes effectively. Key performance issues such as high iowait, internet congestion, disk read/write speeds, and temperature should be monitored and managed.

Recommended Hardware Specifications:

  • CPU: 16 cores

  • Memory: 32GB RAM

  • Disk: 2 TB NVMe/SSD Storage with Write Throughput > 1000 MiBps

  • Bandwidth: 100 Mbps

These specifications can be checked using tools like htop, glances, ifconfig, and vnstat.

  1. Reliable Server/VPS Providers

Choosing reliable server or VPS providers is crucial. Ensure your provider offers:

  • Comprehensive monitoring dashboards

  • Quick response times to mitigate disconnections

  • Transparent maintenance schedules

  • Uninterrupted connections

  1. Geolocation

Report by A41 indicates that the location of servers significantly impacts uptime performance. Internet connection quality and workload distribution are key factors. Nodejumper’s decentralization map reveals that most validators are operating in Europe, particularly Germany. While relocating servers to these areas may improve performance, it can also reduce regional decentralization and increase security risks.

  1. Configurations

Configurations are based on our own experience and opinions from trusted validators when interact with various Cosmos-based networks. We suggest to make changes to these configurations while the network is under stress.

  1. Suggestions & Feedbacks

We suggest to reproduce the situation to prepare for mainnet since it could happen in Token Generation Event. A tool from Somatic Labs or a simple spammy transaction script (redelegate, undelegate, delegate, etc) could be helpful.

In phase 2 of the testnet, selected validators were required to open RPC ports for performance evaluation which were vulnerable to security risks (e.g. DDoS attacks), the team should warn validators to change their RPC Server Configuration Options to mitigate the problem to some extent. Another thing was that, there was a way for inactive validators to always catch up with network height by "cheating" using timeout_propose_delta, timeout_prevote_delta, timeout_precommit_delta = 0ms.

Other than these suggestions, Initia team did a very good job. This milestone is a testament to your hard work, dedication, and innovative spirit. As stepping into this new chapter, may Initia network thrive, community grow, and the vision for a decentralized future become a reality. Wishing great success and a seamless journey ahead!

Last updated