This document is relevant for: Inf1, Inf2, Trn1, Trn1n

Deploy Containers with Neuron#

In this section you will find resources to help you use containers for your accelerated deep learning model acceleration on top of Inferentia and Trainium enabled instances.

The section is organized based on the target deployment environment and use case. In most cases, it is recommended to use a preconfigured Deep Learning Container (DLC) from AWS. Each DLC is pre-configured to have all of the Neuron components installed and is specific to the chosen ML Framework.

Locate Neuron DLC image

Introduction

The Pytorch Neuron DLC images are published to ECR Public, which is the recommended URL to use for most cases. If you are working within AWS SageMaker, you should use the Amazon ECR URL instead of the Amazon ECR Public one because of the restriction of Sagemaker. TensorFlow DLCs are not updated with the latest release. For earlier releases please check here.

Neuron DLC images in Amazon ECR Public

Framework	Neuron Package	Job Type	Supported EC2 Instance Types	Python Version Options	ECR Public Repo URL	Image Details	Other Packages
PyTorch 2.1.2	aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx	inference	trn1 and inf2	3.10 (py310)	https://gallery.ecr.aws/neuron/pytorch-inference-neuronx	https://github.com/aws-neuron/deep-learning-containers#pytorch-inference-neuronx	torchserve
PyTorch 2.1.2	aws-neuronx-tools, neuronx_distributed, torch-neuronx	training	trn1 and inf2	3.10 (py310)	https://gallery.ecr.aws/neuron/pytorch-training-neuronx	https://github.com/aws-neuron/deep-learning-containers#pytorch-training-neuronx
PyTorch 1.13.1	aws-neuronx-tools, torch-neuron	inference	inf1	3.10 (py310)	https://gallery.ecr.aws/neuron/pytorch-inference-neuron	https://github.com/aws-neuron/deep-learning-containers#pytorch-inference-neuron	torchserve
PyTorch 1.13.1	aws-neuronx-tools, neuronx_distributed, torch-neuronx, transformers-neuronx	inference	trn1 and inf2	3.10 (py310)	https://gallery.ecr.aws/neuron/pytorch-inference-neuronx	https://github.com/aws-neuron/deep-learning-containers#pytorch-inference-neuronx	torchserve
PyTorch 1.13.1	aws-neuronx-tools, neuronx_distributed, torch-neuronx	training	trn1 and inf2	3.10 (py310)	https://gallery.ecr.aws/neuron/pytorch-training-neuronx	https://github.com/aws-neuron/deep-learning-containers#pytorch-training-neuronx

Latest Neuron DLC images in Amazon ECR

Find latest Neuron DLC images.

Locate specific Neuron DLC release in Amazon ECR

In the DLC release page do a search for Neuron to get the ECR repo location of specific Neuron DLC release.

This document is relevant for: Inf1, Inf2, Trn1, Trn1n

AWS Neuron Documentation

Deploy Containers with Neuron

Deploy Containers with Neuron#