About the AWS Neuron SDK#
AWS Neuron is a software development kit (SDK) enabling high-performance deep learning acceleration using AWS Inferentia and Trainium, AWS’s custom designed machine learning accelerators. It enables you to develop, profile, and deploy high-performance machine learning workloads on AWS Inferentia and Trainium instances.
The AWS Neuron SDK includes:
Neuron Compiler - Compiles high-level, framework-based models for optimal performance on Neuron devices
Neuron Kernel Interface (NKI) - Provides direct compiler access to Neuron device capabilities
Neuron Runtime - Executes compiled models on Neuron devices
ML Framework integration - Deep support for PyTorch and JAX
Training and inference libraries - Distributable training and inference libraries for large-scale models
Deployment support - Integration with AWS services like SageMaker, EC2, EKS, and ECS
Developer tools - Profiling, monitoring, and debugging utilities
For a full list of AWS Neuron features, see What is AWS Neuron?
What is “NeuronX”?#
“NeuronX” refers to the next-generation AWS Neuron SDK, which provides enhanced capabilities for both inference and training on AWS Inferentia and Trainium instances. NeuronX includes:
Support for the latest versions of PyTorch and JAX
Advanced compiler optimizations for improved performance
Enhanced distributed training libraries for large-scale models
Improved profiling and debugging tools
Ongoing feature development and support for new instance types
Learn about AWS Neuron#
Neuron AWS Neuron Deep Learning Containers
Deploy the Neuron SDK using pre-built Docker deep learning containers (DLCs)
Resources#
Support#
AWS Neuron Support
Contact us#
For support, submit a request with AWS Neuron Github issues or visit the Neuron AWS forums for an answer.
If you want to request a feature or report a critical issue, you can contact us directly at aws-neuron-support@amazon.com
.