web-devproductivity
Auto-scaling
Auto-scaling is the ability of a system to automatically adjust the number of running instances or allocated resources based on current demand. When traffic spikes, new instances are provisioned; when demand drops, excess resources are released to save costs. Cloud platforms like AWS, GCP, and Azure offer auto-scaling groups with configurable policies based on CPU usage, request count, queue depth, or custom metrics.
#web-dev#productivity