AWS Neuron
latest
Getting Started
QuickStart
PyTorch
TensorFlow
Apache MXNet (Incubating)
Tutorials
Performance
What’s New
Learning Neuron
Neuron Features
Neuron Developer Flows
Containers
Application Notes
Performance
General
PyTorch Neuron
Apache MXNet Neuron
Models
Inferentia Model Architecture Fit
Neuron Components
Introducing Neuron Runtime 2.x (libnrt.so)
Neuron FAQ
Neuron SDK
Setup Guide
Neuron Compiler
Neuron Runtime
Neuron Tools
NeuronPerf (Beta)
Release Details
Roadmap
Support
AWS Neuron
»
Application Notes
Edit on GitHub
×
We want your feedback about Neuron SDK!
Let us know by taking the
Neuron survey
Application Notes
¶
Performance
¶
General
¶
Performance Tuning
Mixed precision and performance-accuracy tuning
Parallel Execution using NEURON_RT_NUM_CORES
PyTorch Neuron
¶
Running inference on variable input shapes with bucketing
Data Parallel Inference on Torch Neuron
Apache MXNet Neuron
¶
Flexible Execution Group (FlexEG) in Neuron-MXNet
Models
¶
Inferentia Model Architecture Fit
Neuron Components
¶
Introducing Neuron Runtime 2.x (libnrt.so)
Read the Docs
v: latest
Versions
latest
v1.19.2
v1.19.1
v1.19.0
v1.18.0
v1.17.2
v1.17.1
v1.17.0
v1.16.3
v1.16.2
v1.16.1
v1.16.0
Downloads
pdf
On Read the Docs
Project Home
Builds