Scaling Policies

Scaler’s auto-scaling is controlled by the scheduler policy engine. Pick the engine with policy_engine_type, then configure it with policy_content.

Available policy engines:

Policy Engine

Description

Best For

simple

General-purpose engine for most deployments.

  • One manager or a small manager set

  • Homogeneous workloads: even_load + vanilla

  • Capability-aware workloads: capability + capability

waterfall_v1

Priority-based multi-manager scaling.

  • Strict tier preference across managers

  • Cost tiers (local/on-prem first, cloud overflow second)

  • Latency tiers where preferred resources fill first

  • Deterministic scale-up and scale-down order