Inference
This document is relevant for: Inf1, Inf2, Trn1, Trn1n
Inference#
Note
For help selecting a framework type, see:
Comparison of torch-neuron (Inf1) versus torch-neuronx (Inf2 & Trn1) for Inference
Tutorials (torch-neuronx)
HuggingFace pretrained BERT tutorial [html] [notebook]
TorchServe tutorial [html]
LibTorch C++ tutorial (for torch-neuron and torch-neuronx) [html]
Torchvision ResNet50 tutorial [html] [notebook]
Additional Examples (torch-neuronx)
API Reference Guide (torch-neuronx)
Developer Guide (torch-neuronx)
Misc (torch-neuronx)
Tutorials (torch-neuron)
ResNet-50 tutorial [html] [notebook]
PyTorch YOLOv4 tutorial [html] [notebook]
HuggingFace pretrained BERT tutorial [html] [notebook]
Bring your own HuggingFace pretrained BERT container to Sagemaker Tutorial [html] [notebook]
LibTorch C++ tutorial [html]
TorchServe tutorial [html]
HuggingFace MarianMT tutorial [html] [notebook]
BERT TorchServe tutorial [html]
NeuronCore Pipeline tutorial [html] [notebook]
Additional Examples (torch-neuron)
API Reference Guide (torch-neuron)
Developer Guide (torch-neuron)
This document is relevant for: Inf1, Inf2, Trn1, Trn1n