This document is relevant for: Inf1, Inf2, Trn1, Trn1n

What’s New#

Neuron 2.10.0 (05/01/2023)#

What’s New#

This release introduces new features, performance optimizations, minor enhancements and bug fixes. This release introduces the following:

What’s New

Details

Instances

Initial support for computer vision models inference

  • Added Stable Diffusion 2.1 model script for Text to Image Generation

  • Added VGG model script for Image Classification Task

  • Added UNet model script for Image Segmentation Task

  • Please check aws-neuron-samples repository

Inf2, Trn1/Trn1n

Profiling support in PyTorch Neuron(torch-neuronx) for Inference with TensorBoard

Inf2, Trn1/Trn1n

New Features and Performance Enhancements in transformers-neuronx

Inf2, Trn1/Trn1n

Support models larger than 2GB in TensorFlow 2.x Neuron (tensorflow-neuronx)

Trn1/Trn1n, Inf2

Support models larger than 2GB in TensorFlow 2.x Neuron (tensorflow-neuron)

Inf1

Performance Enhancements in PyTorch C++ Custom Operators (Experimental)

Trn1/Trn1n

Weight Deduplication Feature (Inf1)

  • Support for Sharing weights when loading multiple instance versions of the same model on different NeuronCores.

  • See more at Neuron Runtime Configuration

Inf1

nccom-test - Collective Communication Benchmarking Tool

  • Supports enabling benchmarking sweeps on various Neuron Collective Communication operations. See NCCOM-TEST (Beta) for more details.

Trn1/Trn1n , Inf2

Announcing end of support for tensorflow-neuron 2.7 & mxnet-neuron 1.5 versions

Inf1

Minor enhancements and bug fixes.

Trn1/Trn1n , Inf2, Inf1

Release Artifacts

Trn1/Trn1n , Inf2, Inf1

For more detailed release notes of the new features and resolved issues, see Neuron Components Release Notes.

To learn about the model architectures currently supported on Inf1, Inf2, Trn1 and Trn1n instances, please see Model Architecture Fit Guidelines.

Neuron Components Release Notes#

Inf1, Trn1/Trn1n and Inf2 common packages#

Component

Instance/s

Package/s

Details

Neuron Runtime

Trn1/Trn1n, Inf1, Inf2

  • Trn1/Trn1n: aws-neuronx-runtime-lib (.deb, .rpm)

  • Inf1: Runtime is linked into the ML frameworks packages

Neuron Runtime Driver

Trn1/Trn1n, Inf1, Inf2

  • aws-neuronx-dkms (.deb, .rpm)

Neuron System Tools

Trn1/Trn1n, Inf1, Inf2

  • aws-neuronx-tools (.deb, .rpm)

Containers

Trn1/Trn1n, Inf1, Inf2

  • aws-neuronx-k8-plugin (.deb, .rpm)

  • aws-neuronx-k8-scheduler (.deb, .rpm)

  • aws-neuronx-oci-hooks (.deb, .rpm)

NeuronPerf (Inference only)

Trn1/Trn1n, Inf1, Inf2

  • neuronperf (.whl)

TensorFlow Model Server Neuron

Trn1/Trn1n, Inf1, Inf2

  • tensorflow-model-server-neuronx (.deb, .rpm)

Trn1/Trn1n and Inf2 only packages#

Component

Instance/s

Package/s

Details

PyTorch Neuron

Trn1/Trn1n, Inf2

  • torch-neuronx (.whl)

TensorFlow Neuron

Trn1/Trn1n, Inf2

  • tensorflow-neuronx (.whl)

Neuron Compiler (Trn1/Trn1n, Inf2 only)

Trn1/Trn1n, Inf2

  • neuronx-cc (.whl)

Collective Communication library

Trn1/Trn1n, Inf2

  • aws-neuronx-collective (.deb, .rpm)

Neuron Custom C++ Operators

Trn1/Trn1n, Inf2

  • aws-neuronx-gpsimd-customop (.deb, .rpm)

  • aws-neuronx-gpsimd-tools (.deb, .rpm)

transformers-neuronx

Trn1/Trn1n, Inf2

Note

In next releases aws-neuronx-tools and aws-neuronx-runtime-lib will add support for Inf1.

Release Artifacts#

Trn1 packages

List of packages in Neuron 2.10.0:

Component                           Package                                           
Collective Communication Library    aws-neuronx-collectives-2.13.7.0 
Driver                              aws-neuronx-dkms-2.9.4.0 
CustomOps                           aws-neuronx-gpsimd-customop-0.3.1.0 
CustomOps Tools                     aws-neuronx-gpsimd-tools-0.3.0.0 
General                             aws-neuronx-runtime-discovery-2.9 
Runtime Library                     aws-neuronx-runtime-lib-2.13.6.0 
System Tools                        aws-neuronx-tools-2.10.1.0 
General                             libneuronxla-0.5.207 
General                             neuronx_hwm-2.6.0.0 
PyTorch                             torch_xla-1.13.1 
Compiler                            neuronx-cc-2.6.0.19 
Kubernetes Plugin                   aws-neuronx-k8-plugin-2.13.4.0 
Kubernetes Scheduler                aws-neuronx-k8-scheduler-2.13.4.0 
OCI                                 aws-neuronx-oci-hook-2.2.0.0 
TensorFlow                          tensorflow-neuronx-2.10.1.2.1.0 
TensorFlow                          tensorflow-neuronx-2.7.4.2.1.0 
TensorFlow                          tensorflow-neuronx-2.8.4.2.1.0 
TensorFlow                          tensorflow-neuronx-2.9.3.2.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-1.15.0.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.10.1.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.7.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.8.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.9.3.2.8.1.0 
TensorBoard                         tensorboard-plugin-neuronx-2.5.26.0 
PyTorch                             torch-neuronx-1.13.1.1.7.0 
Transformers Neuron                 transformers-neuronx-0.3.32 
Compiler                            islpy-2021.1

Inf2 packages

List of packages in Neuron 2.10.0:

Component                           Package                                           
Collective Communication Library    aws-neuronx-collectives-2.13.7.0 
Driver                              aws-neuronx-dkms-2.9.4.0 
CustomOps                           aws-neuronx-gpsimd-customop-0.3.1.0 
CustomOps Tools                     aws-neuronx-gpsimd-tools-0.3.0.0 
General                             aws-neuronx-runtime-discovery-2.9 
Runtime Library                     aws-neuronx-runtime-lib-2.13.6.0 
System Tools                        aws-neuronx-tools-2.10.1.0 
General                             libneuronxla-0.5.207 
General                             neuronx_hwm-2.6.0.0 
PyTorch                             torch_xla-1.13.1 
Compiler                            neuronx-cc-2.6.0.19 
Kubernetes Plugin                   aws-neuronx-k8-plugin-2.13.4.0 
Kubernetes Scheduler                aws-neuronx-k8-scheduler-2.13.4.0 
OCI                                 aws-neuronx-oci-hook-2.2.0.0 
TensorFlow                          tensorflow-neuronx-2.10.1.2.1.0 
TensorFlow                          tensorflow-neuronx-2.7.4.2.1.0 
TensorFlow                          tensorflow-neuronx-2.8.4.2.1.0 
TensorFlow                          tensorflow-neuronx-2.9.3.2.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-1.15.0.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.10.1.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.7.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.8.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.9.3.2.8.1.0 
TensorBoard                         tensorboard-plugin-neuronx-2.5.26.0 
PyTorch                             torch-neuronx-1.13.1.1.7.0 
Transformers Neuron                 transformers-neuronx-0.3.32 
Compiler                            islpy-2021.1

Inf1 packages

List of packages in Neuron 2.10.0:

Component                           Package                                           
Driver                              aws-neuronx-dkms-2.9.4.0 
System Tools                        aws-neuronx-tools-2.10.1.0 
Compiler                            neuron-cc-1.15.0 
Kubernetes Plugin                   aws-neuronx-k8-plugin-2.13.4.0 
Kubernetes Scheduler                aws-neuronx-k8-scheduler-2.13.4.0 
OCI                                 aws-neuronx-oci-hook-2.2.0.0 
TensorFlow                          tensorflow-neuron-1.15.5.2.8.1.0 
TensorFlow                          tensorflow-neuron-2.10.1.2.8.1.0 
TensorFlow                          tensorflow-neuron-2.7.4.2.8.1.0 
TensorFlow                          tensorflow-neuron-2.8.4.2.8.1.0 
TensorFlow                          tensorflow-neuron-2.9.3.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-1.15.0.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.10.1.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.7.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.8.4.2.8.1.0 
TensorFlow Model Server             tensorflow-model-server-neuronx-2.9.3.2.8.1.0 
PyTorch                             torch-neuron-1.10.2.2.7.1.0 
PyTorch                             torch-neuron-1.11.0.2.7.1.0 
PyTorch                             torch-neuron-1.12.1.2.7.1.0 
PyTorch                             torch-neuron-1.13.1.2.7.1.0 
PyTorch                             torch-neuron-1.9.1.2.7.1.0 
MXNet                               mxnet_neuron-1.5.1.1.10.39.0 
MXNet                               mx_neuron-1.8.0.2.4.1.0 
Perf Tools                          neuronperf-1.8.0.0 
Runtime Library                     libnrt.so (Version 2.12.23.0)
Compiler                            dmlc_nnvm-1.15.0.0 
Compiler                            dmlc_topi-1.15.0.0 
Compiler                            dmlc_tvm-1.15.0.0 
Compiler                            inferentia_hwm-1.14.1 
Compiler                            islpy-2021.1