This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3
AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes: Support files#
This page provides templates supporting AWS Neuron Dynamic Resource Allocation (DRA) on Kubernetes. You can view and download these files from the links below.
Resource Claim Specifications#
Example resource claim templates and pod specifications demonstrating different Neuron device allocation patterns for various workload requirements.
File Name |
Description |
Download |
|---|---|---|
1x4-connected-devices.yaml |
Resource claim template for allocating 4 connected Neuron devices with topology constraints for optimal performance. |
|
2-node-inference-us.yaml |
Multi-node inference configuration for distributed workloads across 2 Trainium nodes. |
|
4-node-inference-us.yaml |
Large-scale inference setup for distributed workloads spanning 4 Trainium nodes. |
|
all-devices.yaml |
Resource claim template that allocates all available Neuron devices on a trn2.48xlarge instance. |
|
lnc-setting-trn2.yaml |
Logical NeuronCore configuration template optimized for Trainium2 instances. |
|
specific-driver-version.yaml |
Example configuration for requesting specific Neuron driver versions in resource claims. |
|
us-and-lnc-config.yaml |
Example configuration for requesting UltraServer node with Logical NeuronCore configuration. |
This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3