Welcome to AWS Neuron#

AWS Neuron is the SDK used to run deep learning workloads on AWS Inferentia and AWS Trainium based instances. It supports customers in their end-to-end ML development lifecycle to build new models, train and optimize these models, and then deploy them for production. To learn about the model architectures currently supported on Inf1 and Trn1 instances, please see Model Architecture Fit Guidelines. To learn about upcoming capabilities, please view the Roadmap.

AWS Neuron includes a deep learning compiler, runtime, and tools that are natively integrated into TensorFlow, PyTorch and Apache MXNet (incubating). The EC2 Trn1 instances are optimized for the highest performance and best price-performance training in AWS. The EC2 Inf1 instances are designed for high-performance deep learning inference applications. With Neuron, customers can quickly start using Inf/Trn instances through services like Amazon Sagemaker, Amazon Elastic Container Service (ECS), Amazon Elastic Kubernetes Service (EKS), AWS Batch, and AWS Parallel Cluster.

Check Annoucements and check Neuron 2.6.0 (12/12/2022) for latest release.

Get Started with PyTorch Neuron
Get Started with TensorFlow Neuron
Neuron Quick Links