About This Architecture
Horizontal Pod Autoscaler (HPA) architecture that scales deployment replicas based on CPU and custom metrics. The metrics server feeds utilization data to the HPA controller, which adjusts replica count within configured min/max bounds.