Spot instances
Spot instances are not guaranteed to be available. The chances of getting spot instances can be improved by providing instance_distribution
, a list of alternative instance types to the primary instance_type
you specified. If left blank, Cortex will only include the primary instance type in the instance_distribution
. When using instance_distribution
, use the instance type with the fewest compute resources as your primary instance_type
. Note that the default value for max_price
is the on-demand price of the primary instance type, but you may wish to set this to the on-demand price of the most expensive instance type in your instance_distribution
.
Spot instances can be mixed with on-demand instances by configuring on_demand_base_capacity
and on_demand_percentage_above_base_capacity
. on_demand_base_capacity
enforces the minimum number of nodes that will be fulfilled by on-demand instances as your cluster is scaling up. on_demand_percentage_above_base_capacity
defines the percentage of instances that will be on-demand after the base capacity has been fulfilled (the rest being spot instances). instance_pools
is the number of pools per availability zone to allocate your instances from. See here for more details.
Even if multiple instances are specified in your instance_distribution
on-demand instances are mixed, there is still a possibility of running into scale up issues when attempting to spin up spot instances. Spot instance requests may not be fulfilled for several reasons. Spot instance pricing fluctuates, therefore the max_price
may be lower than the current spot pricing rate. Another possibility could be that the availability zones of the cluster ran out of spot instances. The addition of another on-demand node group to node_groups
with a lower priority (by having a higher index in the node_groups
list) can mitigate the impact of unfulfilled spot requests by enabling the cluster to spin up on-demand instances if spot instance requests are not fulfilled within 5 minutes.
There is a spot instance limit associated with your AWS account for each instance family in each region. You can check your current limit and request an increase here (set the region in the upper right corner to your desired region, type "spot" in the search bar, and click on the quota that matches your instance type). Note that the quota values indicate the number of vCPUs available, not the number of instances; different instances have a different numbers of vCPUs, which can be seen here.
Example spot configuration
Only spot instances
3 on-demand base capacity with 0% on-demand above base capacity
0 on-demand base capacity with 50% on-demand above base capacity
Last updated