Get started
Create a cluster on your AWS account
Client installation - customize your client installation.
Cluster configuration - optimize your cluster for your workloads.
Environments - manage multiple clusters.
Run machine learning workloads at scale
RealtimeAPI - create HTTP/gRPC APIs that respond to prediction requests in real-time.
AsyncAPI - create APIs that respond to prediction requests asynchronously.
BatchAPI - create APIs that run distributed batch inference jobs.
TaskAPI - create APIs that run training or fine-tuning jobs.
Last updated