AWS Neuron Logo
v1.16.1

Getting Started

  • QuickStart
  • PyTorch
  • TensorFlow
  • Apache MXNet (Incubating)
  • Tutorials
  • Performance
  • What’s New

Learning Neuron

  • Neuron Features
  • Neuron Developer Flows
  • Containers
  • Application Notes
    • Performance
      • General
      • PyTorch Neuron
      • Apache MXNet Neuron
    • Models
      • Inferentia Model Architecture Fit
    • Neuron Components
      • Introducing Neuron Runtime 2.x (libnrt.so)
  • Neuron FAQ

Neuron SDK

  • Setup Guide
  • Neuron Compiler
  • Neuron Runtime
  • Neuron Tools
  • Release Details
  • Roadmap
  • Support
AWS Neuron
  • »
  • Application Notes
  • Edit on GitHub

Application Notes¶

Performance¶

General¶

  • Performance Tuning
  • Mixed precision and performance-accuracy tuning
  • Parallel Execution using NEURONCORE_GROUP_SIZES

PyTorch Neuron¶

  • Running inference on variable input shapes with bucketing
  • Data Parallel Inference on Torch Neuron

Apache MXNet Neuron¶

  • Flexible Execution Group (FlexEG) in Neuron-MXNet

Models¶

  • Inferentia Model Architecture Fit

Neuron Components¶

  • Introducing Neuron Runtime 2.x (libnrt.so)
Next Previous

© Copyright 2021, Amazon Web Services. Revision 83eb5422.

Built with Sphinx using a theme provided by Read the Docs.
Read the Docs v: v1.16.1
Versions
latest
v1.16.1
v1.16.0
v1.15.2
1.15.1
1.15.0
1.14.2
1.14.1
1.14.0
1.13.0
1.12.2
1.12.1
v1.12.0
1.11.0
Downloads
On Read the Docs
Project Home
Builds