Multi-instance
Cortex can be configured to provision different instance types to improve workload performance and reduce cloud infrastructure spend.
Best practices
Spot node groups should have a higher priority than on-demand node groups.
CPU node groups should have higher priorities than GPU/Inferentia node groups.
Node groups with small instance types should have higher priorities than node groups with large instance types.
Examples
CPU spot cluster, with on-demand backup
On-demand cluster supporting CPU, GPU, and Inferentia
Spot cluster supporting CPU and GPU (with on-demand backup)
CPU spot cluster with multiple instance types and on-demand backup
Last updated