Posts tagged announce-end-of-support
Announcing end-of-support for implicit asynchronous mode in Neuron Runtime
- 21 May 2026
A future release of the Neuron SDK will remove support for implicit asynchronous
mode, including the NEURON_RT_ASYNC_EXEC_MAX_INFLIGHT_REQUESTS environment
variable.
Customers using the implicit request async APIs must migrate their code and calls to the new Neuron Runtime async APIs. For details on the new APIs, see Neuron Runtime Async APIs: Motivation and Overview.
Announcing end-of-support for --model-type=transformer-inference compiler flag
- 15 September 2023
Starting with Neuron release 2.14, the --model-type=transformer-inference compiler flag is deprecated.
Neuron SDK users using the --model-type=transformer-inference compiler flag are highly encouraged to migrate to the --model-type=transformer compiler flag.