Get started

Create a cluster on your AWS account

# install the CLI
pip install cortex

# create a cluster
cortex cluster up cluster.yaml

Run machine learning workloads at scale

# deploy machine learning APIs
cortex deploy apis.yaml
  • RealtimeAPI - create HTTP/gRPC APIs that respond to prediction requests in real-time.

  • AsyncAPI - create APIs that respond to prediction requests asynchronously.

  • BatchAPI - create APIs that run distributed batch inference jobs.

  • TaskAPI - create APIs that run training or fine-tuning jobs.

Last updated