This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3
Get Started with AWS Neuron#
Get up and running with AWS Neuron SDK in minutes. These quickstarts guide you through your first training or inference workload on Inferentia and Trainium instances.
Note
First time using AWS Neuron? These quickstarts assume you have:
An active AWS account with EC2 access
Basic familiarity with your chosen ML framework (PyTorch, JAX, or TensorFlow)
SSH access to launch and connect to EC2 instances
For detailed installation instructions, see the Setup Guide.
Choose Your Path#
Select the quickstart that matches your use case:
Specialized Quickstarts#
Deploy large language models for production inference
Online serving (OpenAI-compatible API)
Time: ~20 minutes
Inf2 Trn1
Framework-Specific Guides#
Need framework-specific setup instructions?
Additional Resources#
Model samples and tutorials - Pre-tested model samples and tutorials
Amazon EC2 - Detailed EC2 deployment workflows
Neuron Containers - Use Deep Learning Containers
Neuron Quick Links - Quick links to all Neuron documentation
Neuron GitHub Samples - GitHub sample repositories
Legacy Quick-Start Pages (Inf1)#
Warning
The following pages are for legacy Inf1 instances only. For new projects, use the quickstarts above for Inf2, Trn1, Trn2, or Trn3.
Get Started with PyTorch Neuron - PyTorch on Inf1
Get Started with TensorFlow Neuron - TensorFlow on Inf1
Get Started with Apache MXNet Neuron - MXNet on Inf1
This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3