This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3

Tutorials#

Profiling a vLLM Inference Workload

Learn how to capture and analyze device-level and system-level profiles for vLLM inference workloads on AWS Trainium.

Profiling a NKI Kernel

Learn how to profile a NKI kernel with Neuron Explorer.

Track System Resource Utilization during Training with Neuron Monitor

Learn how to monitor resource utilization using neuron-monitor, Prometheus and Grafana while running a multi-layer perceptron MNIST model on Trainium using PyTorch Neuron.

This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3