Reliability
Features and best practices for reliable cluster operations
This section covers features and best practices for maintaining reliable EKS Anywhere cluster operations and upgrades.
Reliability Features
EKS Anywhere includes mechanisms to help ensure operational reliability:
- Admission Webhook Protection: Prevents custom admission webhooks from interfering with system operations and cluster upgrades
- Cluster Validation: Pre-flight checks identify potential issues before cluster creation or upgrades
- Support Bundle Collection: Diagnostic data collection for troubleshooting
Best Practices
For reliable cluster operations:
- Enable admission webhook protection for production clusters
- Keep clusters up-to-date with the latest EKS Anywhere releases
- Monitor cluster health and resource utilization
- Test upgrades in non-production environments before production
- Maintain backup and disaster recovery procedures
Preventing custom admission webhooks from interfering with cluster operations