This document is relevant for: Inf1
AWS Neuron architecture guides#
Review and understand the hardware architecture of AWS Neuron instances, including AWS Elastic Compute Cloud (EC2) Trn and Inf instance types, AWS Inferentia and Trainium chips, and NeuronCore processing units. The documentation covers system specifications, memory hierarchies, interconnect topologies, and architectural considerations for machine learning workloads.
About Neuron Hardware#
AWS Neuron hardware consists of custom-designed machine learning accelerators optimized for deep learning workloads. This section covers the architecture and capabilities of AWS Inferentia and Trainium chips, their NeuronCore processing units, and the EC2 instances that host them.
Trainium Architecture#
Inferentia Architecture#
NeuronCore Architecture#
NeuronCores are fully-independent heterogenous compute-units that power Tranium, Tranium2, Inferentia, and Inferentia2 chips.
Neuron AWS EC2 Platform Architecture#
Overviews of the AWS Inf and Trn instance and UltraServer architectures.
This document is relevant for: Inf1