This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3

AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes: Support files#

This page provides templates supporting AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes. You can view and download these files from the links below.

Resource Claim Specifications#

Example resource claim templates and pod specifications demonstrating different Neuron device allocation patterns for various workload requirements.

File Name

Description

Download

1x4-connected-devices.yaml

Resource claim template for allocating 4 connected Neuron devices with topology constraints for optimal performance.

Download

2-node-inference-us.yaml

Multi-node inference configuration for distributed workloads across 2 Trainium nodes.

Download

4-node-inference-us.yaml

Large-scale inference setup for distributed workloads spanning 4 Trainium nodes.

Download

all-devices.yaml

Resource claim template that allocates all available Neuron devices on a trn2.48xlarge instance.

Download

lnc-setting-trn2.yaml

Logical NeuronCore configuration template optimized for Trainium2 instances.

Download

specific-driver-version.yaml

Example configuration for requesting specific Neuron driver versions in resource claims.

Download

us-and-lnc-config.yaml

Example configuration for requesting UltraServer node with Logical NeuronCore configuration.

Download

This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3