This document is relevant for: Inf1

AWS Neuron architecture guides#

Review and understand the hardware architecture of AWS Neuron instances, including AWS Elastic Compute Cloud (EC2) Trn and Inf instance types, AWS Inferentia and Trainium chips, and NeuronCore processing units. The documentation covers system specifications, memory hierarchies, interconnect topologies, and architectural considerations for machine learning workloads.

AWS Neuron hardware architecture#

Trainium and Inferentia architecture overview

Architecture overview of EC2 Trn and Inf instances with system specifications and connectivity details

NeuronDevices architecture

Deep dive into AWS Inferentia and Trainium chip architecture and capabilities

NeuronCores architecture

Detailed NeuronCore architecture including versions, features, and performance characteristics

This document is relevant for: Inf1