Inference
This document is relevant for: Inf1
, Inf2
, Trn1
, Trn1n
Inference#
Note
For help selecting a framework type, see:
Comparison of torch-neuron (Inf1) versus torch-neuronx (Inf2 & Trn1) for Inference
Tutorials (torch-neuronx
)
HuggingFace pretrained BERT tutorial [html] [notebook]
TorchServe tutorial [html]
LibTorch C++ tutorial (for torch-neuron and torch-neuronx) [html]
Torchvision ResNet50 tutorial [html] [notebook]
Additional Examples (torch-neuronx
)
API Reference Guide (torch-neuronx
)
Developer Guide (torch-neuronx
)
Misc (torch-neuronx
)
Transformers Neuron (transformers-neuronx
)
Tutorials (torch-neuron
)
ResNet-50 tutorial [html] [notebook]
PyTorch YOLOv4 tutorial [html] [notebook]
HuggingFace pretrained BERT tutorial [html] [notebook]
Bring your own HuggingFace pretrained BERT container to Sagemaker Tutorial [html] [notebook]
LibTorch C++ tutorial [html]
TorchServe tutorial [html]
HuggingFace MarianMT tutorial [html] [notebook]
BERT TorchServe tutorial [html]
NeuronCore Pipeline tutorial [html] [notebook]
Additional Examples (torch-neuron
)
API Reference Guide (torch-neuron
)
Developer Guide (torch-neuron
)
This document is relevant for: Inf1
, Inf2
, Trn1
, Trn1n