This document is relevant for: Inf1
Bring Your Own Neuron Container to SageMaker Hosting (Inf1) — Archived#
Warning
This document is archived. The Inf1 SageMaker BYOC developer flow is no longer maintained as a standalone page. It is provided here for reference only.
The original page described how to compile a model on an EC2 Inf1 instance
or a SageMaker Notebook, then deploy it to SageMaker Hosting on ml.inf1
using a custom container.
For current guidance, see:
Compile with Framework API and Deploy on EC2 Inf1 for the EC2 Inf1 deployment flow.
The AWS guide to Adapting Your Own Inference Container for the SageMaker Hosting BYOC pattern.
Tutorial How to Build and Run a Neuron Container for building a Neuron container image.
The BYOC HuggingFace pretrained BERT container to SageMaker tutorial for an end-to-end BYOC example.
This document is relevant for: Inf1