Deploy Neuron Container on Elastic Container Service (ECS)¶
You can use the Neuron version of the AWS Deep Learning Containers to run inference on Amazon Elastic Container Service (ECS). In this developer flow, you set up an ECS cluster with inf1 instances, create a task description for your inference service and deploy it to your cluster. This developer flow assumes:
- Set up an Amazon ECS cluster:
Follow the instructions on Setting up Amazon ECS for Deep Learning Containers
- Define an Inference Task:
Use the instruction on the DLC Inference on ECS Tutorial to define a task and create a service for the appropriate framework.
When creating tasks for inf1 instances on ECS, be aware of the considerations and requirements listed in Working with inference workloads on Amazon ECS.
Use the container image created using How to Build a Neuron Container as the
imagein your task definition.
Before deploying your task definition to your ECS cluster, make sure to push the image to ECR. Refer to Pushing a Docker image for more information.