Run a reliable Consul cluster
This document outlines implementation resources for maintaining a reliable Consul cluster. When you implement proper reliability measures, you ensure high availability, fault tolerance, and consistent performance of your Consul infrastructure.
The following sections cover architecture, monitoring, resource management, recovery, and resilience.
Architecture
Learn about Consul Community Edition and Enterprise architecture and best practices for building reliable Consul clusters.
- Improving a Consul cluster resilience
- Recommendations for operating Consul clusters at scale
Monitoring
Monitor Consul to collect telemetry data to view performance, audits, and infrastructure usage to ensure Consul is reliable.
- Monitor service-to-service communication with Envoy
- Monitor Raft metrics and logs for WAL
- Monitor Consul components
- Monitoring and alerts recommendations for Consul
- Monitor key metrics for Consul agent telemetry
Resource management
Efficiently manage your Consul infrastructure, scaling, and performance.
- Optimize Consul server performance for overall throughput and health.
- Collect metrics with Consul Dataplane Telemetry.
- Learn about capacity planning recommendations when deploying and maintaining a Consul cluster in production.
- Monitor Raft metrics and logs for WAL.
Recovery
Recover Consul in the case of cluster degradation through the use of regular backups.
- General considerations for Consul disaster recovery
- Disaster recovery for Consul clusters
- Disaster recovery for federated primary Consul datacenter
- Disaster recovery for Consul on Kubernetes
- Disaster recovery for Consul on multi-cluster deployments
Resilience
Run a resilient Consul cluster to avoid application downtime.
- Understand fault tolerance in Consul clusters
- Provide fault-tolerance with Consul Redundancy Zones
- Failover configuration overview
- Gain insight into service mesh events and errors with Consul proxy access logs
Next steps
In this document, you learned about the HashiCorp resources for implementing and running a reliable Consul cluster. The following are implementation guides on the other HashiCorp products.