What’s New
Contents
This document is relevant for: Inf1
, Inf2
, Trn1
, Trn1n
What’s New#
Table of contents
Neuron 2.10.0 (05/01/2023)#
Table of contents
What’s New#
This release introduces new features, performance optimizations, minor enhancements and bug fixes. This release introduces the following:
What’s New |
Details |
Instances |
---|---|---|
Initial support for computer vision models inference |
|
Inf2, Trn1/Trn1n |
Profiling support in PyTorch Neuron( |
Inf2, Trn1/Trn1n |
|
New Features and Performance Enhancements in transformers-neuronx |
|
Inf2, Trn1/Trn1n |
Support models larger than 2GB in TensorFlow 2.x Neuron ( |
|
Trn1/Trn1n, Inf2 |
Support models larger than 2GB in TensorFlow 2.x Neuron ( |
|
Inf1 |
Performance Enhancements in PyTorch C++ Custom Operators (Experimental) |
|
Trn1/Trn1n |
Weight Deduplication Feature (Inf1) |
|
Inf1 |
|
|
Trn1/Trn1n , Inf2 |
Announcing end of support for tensorflow-neuron 2.7 & mxnet-neuron 1.5 versions |
Inf1 |
|
Minor enhancements and bug fixes. |
Trn1/Trn1n , Inf2, Inf1 |
|
Release Artifacts |
Trn1/Trn1n , Inf2, Inf1 |
For more detailed release notes of the new features and resolved issues, see Neuron Components Release Notes.
To learn about the model architectures currently supported on Inf1, Inf2, Trn1 and Trn1n instances, please see Model Architecture Fit Guidelines.
Neuron Components Release Notes#
Inf1, Trn1/Trn1n and Inf2 common packages#
Component |
Instance/s |
Package/s |
Details |
---|---|---|---|
Neuron Runtime |
Trn1/Trn1n, Inf1, Inf2 |
|
|
Neuron Runtime Driver |
Trn1/Trn1n, Inf1, Inf2 |
|
|
Neuron System Tools |
Trn1/Trn1n, Inf1, Inf2 |
|
|
Containers |
Trn1/Trn1n, Inf1, Inf2 |
|
|
NeuronPerf (Inference only) |
Trn1/Trn1n, Inf1, Inf2 |
|
|
TensorFlow Model Server Neuron |
Trn1/Trn1n, Inf1, Inf2 |
|
Trn1/Trn1n and Inf2 only packages#
Component |
Instance/s |
Package/s |
Details |
---|---|---|---|
PyTorch Neuron |
Trn1/Trn1n, Inf2 |
|
|
TensorFlow Neuron |
Trn1/Trn1n, Inf2 |
|
|
Neuron Compiler (Trn1/Trn1n, Inf2 only) |
Trn1/Trn1n, Inf2 |
|
|
Collective Communication library |
Trn1/Trn1n, Inf2 |
|
|
Neuron Custom C++ Operators |
Trn1/Trn1n, Inf2 |
|
|
|
Trn1/Trn1n, Inf2 |
|
Note
In next releases aws-neuronx-tools
and aws-neuronx-runtime-lib
will add support for Inf1.
Inf1 only packages#
Component |
Instance/s |
Package/s |
Details |
---|---|---|---|
PyTorch Neuron |
Inf1 |
|
|
TensorFlow Neuron |
Inf1 |
|
|
Apache MXNet (Incubating) |
Inf1 |
|
|
Neuron Compiler (Inf1 only) |
Inf1 |
|
Release Artifacts#
Trn1 packages
List of packages in Neuron 2.10.0:
Component Package
Collective Communication Library aws-neuronx-collectives-2.13.7.0
Driver aws-neuronx-dkms-2.9.4.0
CustomOps aws-neuronx-gpsimd-customop-0.3.1.0
CustomOps Tools aws-neuronx-gpsimd-tools-0.3.0.0
General aws-neuronx-runtime-discovery-2.9
Runtime Library aws-neuronx-runtime-lib-2.13.6.0
System Tools aws-neuronx-tools-2.10.1.0
General libneuronxla-0.5.207
General neuronx_hwm-2.6.0.0
PyTorch torch_xla-1.13.1
Compiler neuronx-cc-2.6.0.19
Kubernetes Plugin aws-neuronx-k8-plugin-2.13.4.0
Kubernetes Scheduler aws-neuronx-k8-scheduler-2.13.4.0
OCI aws-neuronx-oci-hook-2.2.0.0
TensorFlow tensorflow-neuronx-2.10.1.2.1.0
TensorFlow tensorflow-neuronx-2.7.4.2.1.0
TensorFlow tensorflow-neuronx-2.8.4.2.1.0
TensorFlow tensorflow-neuronx-2.9.3.2.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-1.15.0.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.10.1.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.7.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.8.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.9.3.2.8.1.0
TensorBoard tensorboard-plugin-neuronx-2.5.26.0
PyTorch torch-neuronx-1.13.1.1.7.0
Transformers Neuron transformers-neuronx-0.3.32
Compiler islpy-2021.1
Inf2 packages
List of packages in Neuron 2.10.0:
Component Package
Collective Communication Library aws-neuronx-collectives-2.13.7.0
Driver aws-neuronx-dkms-2.9.4.0
CustomOps aws-neuronx-gpsimd-customop-0.3.1.0
CustomOps Tools aws-neuronx-gpsimd-tools-0.3.0.0
General aws-neuronx-runtime-discovery-2.9
Runtime Library aws-neuronx-runtime-lib-2.13.6.0
System Tools aws-neuronx-tools-2.10.1.0
General libneuronxla-0.5.207
General neuronx_hwm-2.6.0.0
PyTorch torch_xla-1.13.1
Compiler neuronx-cc-2.6.0.19
Kubernetes Plugin aws-neuronx-k8-plugin-2.13.4.0
Kubernetes Scheduler aws-neuronx-k8-scheduler-2.13.4.0
OCI aws-neuronx-oci-hook-2.2.0.0
TensorFlow tensorflow-neuronx-2.10.1.2.1.0
TensorFlow tensorflow-neuronx-2.7.4.2.1.0
TensorFlow tensorflow-neuronx-2.8.4.2.1.0
TensorFlow tensorflow-neuronx-2.9.3.2.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-1.15.0.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.10.1.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.7.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.8.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.9.3.2.8.1.0
TensorBoard tensorboard-plugin-neuronx-2.5.26.0
PyTorch torch-neuronx-1.13.1.1.7.0
Transformers Neuron transformers-neuronx-0.3.32
Compiler islpy-2021.1
Inf1 packages
List of packages in Neuron 2.10.0:
Component Package
Driver aws-neuronx-dkms-2.9.4.0
System Tools aws-neuronx-tools-2.10.1.0
Compiler neuron-cc-1.15.0
Kubernetes Plugin aws-neuronx-k8-plugin-2.13.4.0
Kubernetes Scheduler aws-neuronx-k8-scheduler-2.13.4.0
OCI aws-neuronx-oci-hook-2.2.0.0
TensorFlow tensorflow-neuron-1.15.5.2.8.1.0
TensorFlow tensorflow-neuron-2.10.1.2.8.1.0
TensorFlow tensorflow-neuron-2.7.4.2.8.1.0
TensorFlow tensorflow-neuron-2.8.4.2.8.1.0
TensorFlow tensorflow-neuron-2.9.3.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-1.15.0.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.10.1.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.7.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.8.4.2.8.1.0
TensorFlow Model Server tensorflow-model-server-neuronx-2.9.3.2.8.1.0
PyTorch torch-neuron-1.10.2.2.7.1.0
PyTorch torch-neuron-1.11.0.2.7.1.0
PyTorch torch-neuron-1.12.1.2.7.1.0
PyTorch torch-neuron-1.13.1.2.7.1.0
PyTorch torch-neuron-1.9.1.2.7.1.0
MXNet mxnet_neuron-1.5.1.1.10.39.0
MXNet mx_neuron-1.8.0.2.4.1.0
Perf Tools neuronperf-1.8.0.0
Runtime Library libnrt.so (Version 2.12.23.0)
Compiler dmlc_nnvm-1.15.0.0
Compiler dmlc_topi-1.15.0.0
Compiler dmlc_tvm-1.15.0.0
Compiler inferentia_hwm-1.14.1
Compiler islpy-2021.1
Previous Releases#
This document is relevant for: Inf1
, Inf2
, Trn1
, Trn1n