This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3
Tutorials
Profiling a vLLM Inference Workload
Learn how to capture and analyze device-level and system-level profiles for vLLM inference workloads on AWS Trainium.
Profiling a NKI Kernel
Learn how to profile a NKI kernel with Neuron Explorer.
Track System Resource Utilization during Training with Neuron Monitor
Learn how to monitor resource utilization using neuron-monitor, Prometheus and Grafana while running a multi-layer perceptron MNIST model on Trainium using PyTorch Neuron.
This document is relevant for: Inf1, Inf2, Trn1, Trn2, Trn3