Neuron Calculator#

Select the Calculator

Number of NeuronCores needed for LLM Inference

Please enter model configurations (You can enter multiple values of each hyperparameter. Press enter after adding each value in the text field)

Model:

Instance Type:

Data Type:

Batch Size:

Max Sequence Length:

Embedding Dimension:

Number of Attention Heads:

Number of Layers:

Tensor Parallel Degree Constraint (Flexible tensor parallelism (TP) is not supported for certain models like GPT-J and GPT-NeoX in transformers-neuronx. Checking this box will flag a TP degree as invalid if the number of attention heads is not divisible by it.)