Autoscaling an application server

Autoscaling is a fundamental component of computing in the cloud. To understand autoscaling, you need to understand the concepts of vertical and horizontal scaling. With vertical scaling, a single machine is upgraded to a more powerful instance by adding more CPU power, more RAM, or more disk capacity. This can be effective to an extent, but eventually, the complexity and costs associated with vertical scaling make it impractical. With horizontal scaling, an application workload is spread out over several smaller machines, and adding new machines provides a nearly linear increase in the load that can be managed by the application. Adding extra machines is called scaling up, and removing machines that are no longer needed is called scaling down.

EC2 autoscaling provides not only the ability to scale up and down in response to application load but also redundancy. It does this by ensuring that capacity is always available. Even in the unlikely event of an AZ outage, the autoscaling group will ensure that instances are available to run your application if you have configured it to provision instances in all AZs.

Autoscaling also allows you to pay for only the EC2 capacity you need because underutilized servers can be automatically deprovisioned.

Table of Contents for Autoscaling an application server

Create new playlist

Sign In

Sign Up

Table of Contents for
Autoscaling an application server