Kubernetes 101 : Scaling your pods automatically - Horizontal Pod Autoscaler -



Below is a Yaml file representing a deployment for two nginx pod replicas, with defined resource limits and requests:

Remark:

A container can use more than the requested CPU, but it can't go beyond the set limit.


The below command runs the Autoscaler on the deployment "nginx-dep":


The Autoscaler will add or remove pods - number of pods will be between "2" and "10" pods at all times -, while maintaining a CPU consumption of "50%" of CPU resources, as stated above in the "--cpu-percent" parameter.


We solicit the nginx pods - by sending them requests - then, we check the Autoscaler output parameter "TARGET = 50% (half) out of the 50% is used":


Now, we could check our deployment, using the below command:


We see that the deployment is running three pods now - it added one pod -.

The Yaml file of our Autoscaler will look like the below:


Comments

Leave as a comment:

Archive