This document is relevant for: Inf1, Inf2, Trn1, Trn2
NeuronPerf Terminology#
Model Inputs - An individual input or
listof inputs - Example:inputs = [(torch.ones((batch_size, 5))) for batch_size in batch_sizes]- Each input is associated with thebatch_sizesspecified, in the same order - Each input is fed individually to a corresponding model - If an input is provided as atuple, it will be destructured tomodel(*input)to support multiple args - See NeuronPerf Framework Notes for framework-specific requirements
- Latency
Time to execute a single
model(input)Typically measured in milliseconds
- Model
Your data model; varies by framework. See NeuronPerf Framework Notes
Models may be wrapped by submodules (
torch,tensorflow,mxnet) as callables
- Model Index
A JSON file that tracks compiled model artifacts
- Model Inputs
A
tupleof inputs passed to a model, i.e. a single complete exampleExample:
input = (torch.ones((5, 3, 224, 224)),)
- Throughput
Inferences / second
This document is relevant for: Inf1, Inf2, Trn1, Trn2