Home
Devops
auto-scaling-configurator

auto-scaling-configurator

Name: auto-scaling-configurator
Author: Jeremy Longshore

Verified Gold · 92/100 devops v1.0.0 by Jeremy Longshore

Configure auto-scaling policies for applications and infrastructure

MIT License

Free Pricing

Installation

Open Claude Code and run this command:

/plugin install auto-scaling-configurator@claude-code-plugins-plus

Use --global to install for all projects, or --project for current project only.

What It Does

Configure auto-scaling policies for applications and infrastructure

Features

Production-ready
Best practices
Multi-platform
Well-documented

Skills (1)

configuring-auto-scaling-policies SKILL.md View full skill →

'Configure use when you need to work with auto-scaling.

ReadWriteEditGrepGlobBash(cmd:*)

Configuring Auto-Scaling Policies

Overview

Configure auto-scaling policies for cloud workloads across AWS Auto Scaling Groups, GCP Managed Instance Groups, Azure VMSS, and Kubernetes Horizontal Pod Autoscaler (HPA). Generate scaling configurations based on CPU, memory, request rate, or custom metrics with appropriate thresholds, cooldown periods, and scale-in protection.

Prerequisites

Cloud provider CLI installed and authenticated (aws, gcloud, or az)
For Kubernetes HPA: kubectl configured with cluster access and metrics-server deployed
Baseline performance data for the target workload (average CPU, memory, request rate)
Understanding of traffic patterns (steady, bursty, scheduled)
IAM permissions to create/modify scaling policies and CloudWatch/Stackdriver alarms

Instructions

Identify the scaling target: EC2 Auto Scaling Group, GCP MIG, Azure VMSS, or Kubernetes Deployment
Analyze current workload metrics to establish baseline utilization and peak patterns
Define scaling boundaries: minimum instances/pods, maximum instances/pods, desired count
Select scaling metric(s): CPU utilization, memory, request count, queue depth, or custom metrics
Set target thresholds: scale-out trigger (e.g., CPU > 70%), scale-in trigger (e.g., CPU < 30%)
Configure cooldown periods to prevent flapping (typically 300s scale-out, 600s scale-in)
Add scale-in protection for stateful workloads or leader nodes if needed
Generate the scaling policy configuration in the appropriate format (Terraform, YAML, or CLI commands)
Validate by simulating load and confirming scaling events fire correctly

Output

Terraform HCL for AWS ASG scaling policies with CloudWatch alarms
Kubernetes HPA manifests (YAML) with resource or custom metric targets
GCP autoscaler configurations for Managed Instance Groups
Scaling policy JSON/YAML for Azure VMSS
CloudWatch or Stackdriver alarm definitions tied to scaling actions

Error Handling

How It Works


/auto-scale

Ready to use auto-scaling-configurator?

Related Plugins

ansible-playbook-creator

Create Ansible playbooks for configuration management

/plugin install ansible-playbook-creator@claude-code-plugins-plus

backup-strategy-implementor

Implement backup strategies for databases and applications

/plugin install backup-strategy-implementor@claude-code-plugins-plus

ci-cd-pipeline-builder

Build CI/CD pipelines for GitHub Actions, GitLab CI, Jenkins, and more

/plugin install ci-cd-pipeline-builder@claude-code-plugins-plus

cloud-cost-optimizer

Optimize cloud costs and generate cost reports

/plugin install cloud-cost-optimizer@claude-code-plugins-plus

compliance-checker

Check infrastructure compliance (SOC2, HIPAA, PCI-DSS)

/plugin install compliance-checker@claude-code-plugins-plus

container-registry-manager

Manage container registries (ECR, GCR, Harbor)

/plugin install container-registry-manager@claude-code-plugins-plus

Stay in the Loop

No spam. Unsubscribe anytime.

Product

Explore
Skills
Cowork
Compare
Tools

Resources

Docs
Changelog
Collections
Playbooks
Research
Learning

Company

Community
Hall of Fame
GitHub

Legal

Privacy
Terms
Acceptable Use

Tons of Skills by Intent Solutions. Marine. Citadel Grad. 20 years ops → self-taught dev → AI architect.

Error	Cause	Solution
`No scaling activity despite high load`	Metric not reaching threshold or cooldown active	Verify metric source in CloudWatch/Stackdriver; check cooldown timer with `describe-scaling-activities`
`Scaling too aggressively (flapping)`	Cooldown too short or threshold too sensitive	Increase cooldown period and widen the gap between scale-out and scale-in thresholds
`Max capacity reached`	Instance/pod limit hit during traffic spike	Raise `max_size` or implement request queuing as a backpressure mechanism