This document is relevant for: Inf1, Inf2, Trn1, Trn2

PyTorch Neuron#

PyTorch Neuron unlocks high-performance and cost-effective deep learning acceleration on AWS Trainium-based and AWS Inferentia-based Amazon EC2 instances.

The PyTorch Neuron plugin architecture enables native PyTorch models to be accelerated on Neuron devices, so you can use your existing framework application and get started easily with minimal code changes.

For help selecting a framework type for inference, see Comparison of torch-neuron (Inf1) versus torch-neuronx (Inf2 & Trn1) for Inference

PyTorch NeuronX#

PyTorch NeuronX for training on Trn1 and Trn2
Pytorch NeuronX for inference on Inf2, Trn1, and Trn2

PyTorch Neuron#

PyTorch Neuron for inference on Inf1

This document is relevant for: Inf1, Inf2, Trn1, Trn2