Index _ | A | B | C | D | E | F | G | H | I | J | L | M | N | O | P | R | S | T | U | V | W | Z _ __init__() (nki.isa.nc_version method) A abs (C++ function) abs() (in module nki.language) abs_out (C++ function) accessor (C++ function), [1] activation() (in module nki.isa) activation_reduce() (in module nki.isa) add (C++ function), [1] add() (in module nki.language) add_out (C++ function), [1] affine_range() (in module nki.language) affine_select() (in module nki.isa) all() (in module nki.language) all_reduce() (in module nki.language) alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) allocated_fused_rms_norm_qkv() (in module nki.kernels) allocated_fused_self_attn_for_SD_small_head_size() (in module nki.kernels) arange() (in module nki.language) arctan() (in module nki.language) assert_shape() (nki.tensor method) astype() (nki.tensor method) atomic_rmw() (in module nki.language) auto_alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) B baremetal() (in module nki) benchmark() built-in function benchmark() (in module nki) BF16 bfloat16 (in module nki.language) bitwise_and (C++ function), [1], [2] bitwise_and() (in module nki.language) bitwise_and_out (C++ function), [1], [2] bitwise_not (C++ function) bitwise_not_out (C++ function) bitwise_or (C++ function), [1], [2] bitwise_or() (in module nki.language) bitwise_or_out (C++ function), [1], [2] bitwise_xor() (in module nki.language) bn_aggr() (in module nki.isa) bn_stats() (in module nki.isa) broadcast_to() (nki.tensor method) built-in function benchmark() compile() get_reports() model_index.append() model_index.copy() model_index.create() model_index.filter() model_index.load() model_index.move() model_index.save() print_reports() torch.neuron.DataParallel() torch.neuron.DataParallel.disable_dynamic_batching(), [1] torch_neuron.trace() torch_neuronx.analyze() torch_neuronx.async_load() torch_neuronx.bucket_model_trace() torch_neuronx.DataParallel() torch_neuronx.dynamic_batch() torch_neuronx.experimental.multicore_context() torch_neuronx.experimental.neuron_cores_context() torch_neuronx.experimental.profiler.profile() torch_neuronx.experimental.profiler.profile.start() torch_neuronx.experimental.set_multicore() torch_neuronx.experimental.set_neuron_cores() torch_neuronx.lazy_load() torch_neuronx.move_trace_to_device() torch_neuronx.PartitionerConfig() torch_neuronx.replace_weights() torch_neuronx.trace() write_csv() write_json() C CCE ceil (C++ function) ceil() (in module nki.language) ceil_out (C++ function) cFP8 clamp (C++ function) clamp_out (C++ function) close (C++ function), [1] Collective Communication Engine compile() built-in function copy() (in module nki.language) cos (C++ function) cos() (in module nki.language) cos_out (C++ function) CustomOps D device_print() (in module nki.language) div (C++ function), [1] div_out (C++ function), [1] divide() (in module nki.language) dma_copy() (in module nki.isa) dma_engine (in module nki.isa) DP DPr dropout() (in module nki.isa) (in module nki.language) ds() (in module nki.language) dtype (nki.tensor property) E empty (C++ function) enable_stack_allocator() (in module nki.compiler) equal() (in module nki.language) erf() (in module nki.language) erf_dx() (in module nki.language) exp (C++ function) exp() (in module nki.language) exp_out (C++ function) expand_dims() (in module nki.language) (nki.tensor method) eye (C++ function) F fill_ (C++ function) flash_attn_bwd() (in module nki.kernels) flash_fwd() (in module nki.kernels) FLOAT32_TO_FLOAT16 (torch_neuron.Optimization attribute) float8_e4m3 (in module nki.language) float8_e5m2 (in module nki.language) floor (C++ function) floor() (in module nki.language) floor_out (C++ function) force_auto_alloc() (in module nki.compiler) FP16 FP32 full (C++ function) full() (in module nki.language) fused_self_attn_for_SD_small_head_size() (in module nki.kernels) G gelu() (in module nki.language) gelu_apprx_tanh() (in module nki.language) gelu_dx() (in module nki.language) get_accessor_coherence_policy (C++ function) get_cpu_count (C++ function) get_cpu_id (C++ function) get_dst_tensor (C++ function) get_nc_version() (in module nki.isa) get_reports() built-in function GPSIMD Engine gpsimd_engine (in module nki.isa) GpSimdE greater() (in module nki.language) greater_equal() (in module nki.language) H HBM hbm (in module nki.language) High Bandwidth Memory I Inf1 Inf2 Inferentia invert() (in module nki.language) iota() (in module nki.isa) itemsize (nki.tensor property) J jit() (in module nki) L left_shift() (in module nki.language) less() (in module nki.language) less_equal() (in module nki.language) load() (in module nki.language) load_transpose2d() (in module nki.language) local_gather() (in module nki.isa) log (C++ function) log() (in module nki.language) log10 (C++ function) log10_out (C++ function) log2 (C++ function) log2_out (C++ function) log_out (C++ function) logical_and() (in module nki.language) logical_not() (in module nki.language) logical_or() (in module nki.language) logical_xor() (in module nki.language) loop_reduce() (in module nki.language) M matmul() (in module nki.language) max() (in module nki.language) maximum() (in module nki.language) mean() (in module nki.language) memset() (in module nki.isa) mgrid (in module nki.language) min() (in module nki.language) minimum() (in module nki.language) mish() (in module nki.language) mod_alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) model_index.append() built-in function model_index.copy() built-in function model_index.create() built-in function model_index.filter() built-in function model_index.load() built-in function model_index.move() built-in function model_index.save() built-in function module placement mul (C++ function), [1] mul_out (C++ function), [1] multiply() (in module nki.language) N NC nc (in module nki.language) nc_matmul() (in module nki.isa) nc_transpose() (in module nki.isa) nc_version (class in nki.isa) ND ndarray() (in module nki.language) ndim (nki.tensor property) negative() (in module nki.language) Neuron Device Neuron Kernel Interface neuron-cc neuron-cc command line option, [1], [2] neuron-cc command line option neuron-cc, [1], [2] neuron-ls neuron-ls command line option neuron-ls command line option neuron-ls neuron-monitor neuron-monitor command line option neuron-monitor command line option neuron-monitor neuron-profile neuron-profile command line option, [1] neuron-profile command line option neuron-profile, [1] NeuronCore, [1] NeuronCore-v1 NeuronCore-v2 NeuronDevice NeuronLink NeuronLink-v1 NeuronLink-v2 neuronx-cc neuronx-cc command line option, [1], [2] neuronx-cc command line option neuronx-cc, [1], [2] NKI not_equal() (in module nki.language) nrt_add_tensor_to_tensor_set (C function) nrt_allocate_tensor_set (C function) nrt_close (C function) nrt_destroy_tensor_set (C function) nrt_execute (C function) nrt_execute_repeat (C function) nrt_free_model_tensor_info (C function) nrt_get_model_instance_count (C function) nrt_get_model_nc_count (C function) nrt_get_model_tensor_info (C function) nrt_get_tensor_from_tensor_set (C function) nrt_get_total_nc_count (C function) nrt_get_version (C function) nrt_get_visible_nc_count (C function) nrt_init (C function) nrt_load (C function) nrt_load_collectives (C function) nrt_profile_start (C function) nrt_profile_stop (C function) nrt_tensor_allocate (C function) nrt_tensor_allocate_empty (C function) nrt_tensor_allocate_slice (C function) nrt_tensor_attach_buffer (C function) nrt_tensor_free (C function) nrt_tensor_get_size (C function) nrt_tensor_get_va (C function) nrt_tensor_read (C function) nrt_tensor_write (C function) nrt_unload (C function) num_programs() (in module nki.language) NxD Core NxD Training O ones (C++ function) ones() (in module nki.language) operator= (C++ function), [1] P par_dim (in module nki.language) Partial Sum Buffer placement module pow (C++ function), [1], [2] pow_out (C++ function), [1], [2] power() (in module nki.language) PP PPr print_reports() built-in function private_hbm (in module nki.language) prod() (in module nki.language) program_id() (in module nki.language) program_ndim() (in module nki.language) PSUM psum (in module nki.language) R rand() (in module nki.language) random_seed() (in module nki.language) read (C++ function) read_stream_accessor (C++ function) reciprocal() (in module nki.isa) relu() (in module nki.language) reshape() (nki.tensor method) resize_nearest_fixed_dma_kernel() (in module nki.kernels) right_shift() (in module nki.language) rms_norm() (in module nki.language) RNE rsqrt() (in module nki.language) RT S SBUF sbuf (in module nki.language) Scalar Engine scalar_engine (in module nki.isa) scalar_tensor_tensor() (in module nki.isa) ScalarE select_and_scatter_kernel() (in module nki.kernels) sequential_range() (in module nki.language) set_accessor_coherence_policy (C++ function) shape (nki.tensor property) shared_constant() (in module nki.language) shared_hbm (in module nki.language) shared_identity_matrix() (in module nki.language) sigmoid() (in module nki.language) sign() (in module nki.language) silu() (in module nki.language) silu_dx() (in module nki.language) simulate_kernel() (in module nki) sin (C++ function) sin() (in module nki.language) sin_out (C++ function) skip_middle_end_transformations() (in module nki.compiler) softmax() (in module nki.language) softplus() (in module nki.language) spmd_dim (in module nki.language) sqrt() (in module nki.language) square() (in module nki.language) SR State Buffer static_range() (in module nki.language) store() (in module nki.language) sub (C++ function), [1] sub_out (C++ function), [1] subtract() (in module nki.language) sum() (in module nki.language) Sync Engine T tan (C++ function) tan() (in module nki.language) tan_out (C++ function) tanh() (in module nki.language) tcm_accessor (C++ function), [1] tcm_to_tensor (C++ function) tensor (class in nki) Tensor Engine tensor_copy() (in module nki.isa) tensor_copy_dynamic_src() (in module nki.isa) tensor_engine (in module nki.isa) tensor_partition_reduce() (in module nki.isa) tensor_reduce() (in module nki.isa) tensor_scalar() (in module nki.isa) tensor_scalar_reduce() (in module nki.isa) tensor_tensor() (in module nki.isa) tensor_tensor_scan() (in module nki.isa) tensor_to_tcm (C++ function) TensorE TF32 tfloat32 (in module nki.language) tile_size (class in nki.language) torch.neuron.DataParallel() built-in function torch.neuron.DataParallel.disable_dynamic_batching() built-in function, [1] torch::neuron::tcm_free (C++ function) torch::neuron::tcm_malloc (C++ function) torch_neuron.experimental.multicore_context() (in module placement) torch_neuron.experimental.neuron_cores_context() (in module placement) torch_neuron.experimental.set_multicore() (in module placement) torch_neuron.experimental.set_neuron_cores() (in module placement) torch_neuron.Optimization (built-in class) torch_neuron.trace() built-in function torch_neuronx.analyze() built-in function torch_neuronx.async_load() built-in function torch_neuronx.bucket_model_trace() built-in function torch_neuronx.BucketModelConfig (built-in class) torch_neuronx.DataParallel() built-in function torch_neuronx.dynamic_batch() built-in function torch_neuronx.experimental.multicore_context() built-in function torch_neuronx.experimental.neuron_cores_context() built-in function torch_neuronx.experimental.profiler.profile() built-in function torch_neuronx.experimental.profiler.profile.start() built-in function torch_neuronx.experimental.set_multicore() built-in function torch_neuronx.experimental.set_neuron_cores() built-in function torch_neuronx.lazy_load() built-in function torch_neuronx.move_trace_to_device() built-in function torch_neuronx.PartitionerConfig() built-in function torch_neuronx.replace_weights() built-in function torch_neuronx.trace() built-in function TP TPr Trainium/Inferentia2 transpose() (in module nki.language) Trn1 trunc() (in module nki.language) U unknown_engine (in module nki.isa) V var() (in module nki.language) Vector Engine vector_engine (in module nki.isa) VectorE view() (nki.tensor method) W where() (in module nki.language) write (C++ function) write_csv() built-in function write_json() built-in function write_stream_accessor (C++ function) Z zeros (C++ function) zeros() (in module nki.language) zeros_like() (in module nki.language)