Welcome to AWS Neuron#

AWS Neuron is the software development kit (SDK) used to run deep learning and generative AI workloads on AWS Inferentia and AWS Trainium powered Amazon EC2 instances (Amazon EC2 Inf1, Inf2, Trn1 and Trn2 instances). It includes a compiler, runtime, training and inference libraries, and profiling tools. Neuron supports customers in their end-to-end ML development lifecycle including building and deploying deep learning and AI models.

For more information about the latest AWS Neuron release, see Neuron 2.21.0 Beta (12/03/2024) and check the Announcements page.

For list of AWS Neuron model samples and tutorials on Amazon EC2 Inf1, Inf2, Trn1, and Trn2 instances, see Model samples and tutorials.

Get Started with Neuron
Neuron Quick Links