Polkadot Project
Background
Infrastructure team is running several infrastructure components of Polkadot and Kusama Mainnet. These are network critical services and require high availability and responsive monitoring.
List of Services
- Snapshot Service
- Kusama Validator (1KV Program) - On-hold
Deployment or Upgrade Steps
We use Ansible to deploy the services in a containerized environment. Once the server is initialized, we use the same command for the first time deployment and subsequent upgrades.
Deployment type: Recreate
- Get the latest image tag from here
- Update the tag for the respective host in polkadot-execution.
- If it is a snapshot host, make sure the snapshot service is not running and there is no active export process.
- Dry-Run the respective ansible command with
--diff --checkto make sure what changes will be applied. - Re-run the actual command to apply the changes.
- Check the deployment and make sure the service is up.
- Raise the PR and ask the team for approval.
Runbook for Troubleshooting
We track actionable alerts, and each alert has steps to fix them.
Checkout Polkadot runbook here.