Neuron 2.28.0 is released! Check the What's New and Release Notes for more details.

AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes: Support files

Contents

This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3

AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes: Support files#

This page provides templates supporting AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes. You can view and download these files from the links below.

Resource Claim Specifications#

Example resource claim templates and pod specifications demonstrating different Neuron device allocation patterns for various workload requirements.

File Name	Description	Download
1x4-connected-devices.yaml	Resource claim template for allocating 4 connected Neuron devices with topology constraints for optimal performance.	`Download`
2-node-inference-us.yaml	Multi-node inference configuration for distributed workloads across 2 Trainium nodes.	`Download`
4-node-inference-us.yaml	Large-scale inference setup for distributed workloads spanning 4 Trainium nodes.	`Download`
all-devices.yaml	Resource claim template that allocates all available Neuron devices on a trn2.48xlarge instance.	`Download`
lnc-setting-trn2.yaml	Logical NeuronCore configuration template optimized for Trainium2 instances.	`Download`
specific-driver-version.yaml	Example configuration for requesting specific Neuron driver versions in resource claims.	`Download`
us-and-lnc-config.yaml	Example configuration for requesting UltraServer node with Logical NeuronCore configuration.	`Download`

This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3