This document is relevant for: Inf1, Inf2, Trn1, Trn2

Neuron Architecture#

The Neuron Architecture provides insights into Neuron enabled instances system, software and chip capabilities. The Amazon EC2 Trn and Inf instance architecture provides an overview of the Amazon EC2 instances powered by AWS Inferentia and AWS Trainium chips (Neuron devices), and the corresponding system features like inbox and network connectivity, memory hierarchy, and NeuronCore versions and capabilities. The Neuron model architecture fit provides insights to what is the best match between deep-learning model architectures and the NeuronCore version.

Instance and UltraServer Architecture
Amazon EC2 AI Chips Architecture
AWS NeuronCore Architecture

This document is relevant for: Inf1, Inf2, Trn1, Trn2