AWS Neuron Documentation#
AWS Neuron is a software stack that enables high-performance deep learning and generative AI workloads on AWS Inferentia and AWS Trainium instances. Neuron provides a complete machine learning development experience with compiler optimization, runtime efficiency, and comprehensive tooling.
For more details, see What is AWS Neuron? and What’s New in AWS Neuron?
For the latest release notes, see AWS Neuron Release Notes
Join our Beta program
Get early access to new Neuron features and tools! Fill out this form and apply to join our Beta program.
Looking to dive into Neuron development? Follow these links:
Learn more about AWS Neuron#
Select a card below to read more about these features:
Developer Tools
Profile and monitor your models as you develop, build, test, and deploy them with Neuron’s developer tools.
Neuron Kernel Interface
Low-level programming interface for custom kernel development on Trainium and Inferentia with direct hardware access.
Other Neuron features:
AWS and the AWS logo are trademarks of Amazon Web Services, Inc. or its affiliates. All rights reserved.