Index _ | A | B | C | D | E | F | G | H | I | J | L | M | N | O | P | R | S | T | V | W | Z _ __init__() (nki.isa.nc_version method) A abs (C++ function) abs() (in module nki.language) abs_out (C++ function) accessor (C++ function), [1] activation() (in module nki.isa) activation_reduce() (in module nki.isa) add (C++ function), [1] add() (in module nki.language) add_out (C++ function), [1] affine_range() (in module nki.language) affine_select() (in module nki.isa) all() (in module nki.language) all_reduce() (in module nki.language) alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) arange() (in module nki.language) arctan() (in module nki.language) assert_shape() (nki.tensor method) astype() (nki.tensor method) atomic_rmw() (in module nki.language) auto_alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) B baremetal() (in module nki) benchmark() built-in function benchmark() (in module nki) BF16 bfloat16 (in module nki.language) bitwise_and (C++ function), [1], [2] bitwise_and() (in module nki.language) bitwise_and_out (C++ function), [1], [2] bitwise_not (C++ function) bitwise_not_out (C++ function) bitwise_or (C++ function), [1], [2] bitwise_or() (in module nki.language) bitwise_or_out (C++ function), [1], [2] bitwise_xor() (in module nki.language) bn_aggr() (in module nki.isa) bn_stats() (in module nki.isa) broadcast_to() (in module nki.language) (nki.tensor method) built-in function benchmark() compile() get_reports() model_index.append() model_index.copy() model_index.create() model_index.filter() model_index.load() model_index.move() model_index.save() print_reports() torch.neuron.DataParallel() torch.neuron.DataParallel.disable_dynamic_batching(), [1] torch_neuron.trace() torch_neuronx.analyze() torch_neuronx.async_load() torch_neuronx.bucket_model_trace() torch_neuronx.DataParallel() torch_neuronx.dynamic_batch() torch_neuronx.experimental.multicore_context() torch_neuronx.experimental.neuron_cores_context() torch_neuronx.experimental.profiler.profile() torch_neuronx.experimental.profiler.profile.start() torch_neuronx.experimental.set_multicore() torch_neuronx.experimental.set_neuron_cores() torch_neuronx.lazy_load() torch_neuronx.move_trace_to_device() torch_neuronx.PartitionerConfig() torch_neuronx.replace_weights() torch_neuronx.trace() write_csv() write_json() C CCE ceil (C++ function) ceil() (in module nki.language) ceil_out (C++ function) cFP8 clamp (C++ function) clamp_out (C++ function) close (C++ function), [1] Collective Communication Engine compile() built-in function copy() (in module nki.language) cos (C++ function) cos() (in module nki.language) cos_out (C++ function) CustomOps D device_print() (in module nki.language) dge_mode (class in nki.isa) div (C++ function), [1] div_out (C++ function), [1] divide() (in module nki.language) dma_copy() (in module nki.isa) DP DPr dropout() (in module nki.isa) (in module nki.language) ds() (in module nki.language) dtype (nki.tensor property) E empty (C++ function) empty_like() (in module nki.language) enable_stack_allocator() (in module nki.compiler) engine (class in nki.isa) equal() (in module nki.language) erf() (in module nki.language) erf_dx() (in module nki.language) exp (C++ function) exp() (in module nki.language) exp_out (C++ function) expand_dims() (in module nki.language) (nki.tensor method) eye (C++ function) F fill_ (C++ function) FLOAT32_TO_FLOAT16 (torch_neuron.Optimization attribute) float8_e4m3 (in module nki.language) float8_e5m2 (in module nki.language) floor (C++ function) floor() (in module nki.language) floor_out (C++ function) fmod() (in module nki.language) force_auto_alloc() (in module nki.compiler) FP16 FP32 fp32 (class in nki.language) full (C++ function) full() (in module nki.language) G gelu() (in module nki.language) gelu_apprx_tanh() (in module nki.language) gelu_dx() (in module nki.language) get_accessor_coherence_policy (C++ function) get_cpu_count (C++ function) get_cpu_id (C++ function) get_dst_tensor (C++ function) get_nc_version() (in module nki.isa) get_reports() built-in function GPSIMD Engine GpSimdE greater() (in module nki.language) greater_equal() (in module nki.language) H HBM hbm (in module nki.language) High Bandwidth Memory I Inf1 Inf2 Inferentia invert() (in module nki.language) iota() (in module nki.isa) itemsize (nki.tensor property) J jit() (in module nki) L left_shift() (in module nki.language) less() (in module nki.language) less_equal() (in module nki.language) load() (in module nki.language) load_transpose2d() (in module nki.language) local_gather() (in module nki.isa) log (C++ function) log() (in module nki.language) log10 (C++ function) log10_out (C++ function) log2 (C++ function) log2_out (C++ function) log_out (C++ function) logical_and() (in module nki.language) logical_not() (in module nki.language) logical_or() (in module nki.language) logical_xor() (in module nki.language) loop_reduce() (in module nki.language) M matmul() (in module nki.language) max() (in module nki.language) max8() (in module nki.isa) maximum() (in module nki.language) mean() (in module nki.language) memset() (in module nki.isa) mgrid (in module nki.language) min() (in module nki.language) minimum() (in module nki.language) mish() (in module nki.language) mod() (in module nki.language) mod_alloc() (in module nki.compiler.psum) (in module nki.compiler.sbuf) model_index.append() built-in function model_index.copy() built-in function model_index.create() built-in function model_index.filter() built-in function model_index.load() built-in function model_index.move() built-in function model_index.save() built-in function module placement mul (C++ function), [1] mul_out (C++ function), [1] multiply() (in module nki.language) N NC nc (in module nki.language) nc_find_index8() (in module nki.isa) nc_match_replace8() (in module nki.isa) nc_matmul() (in module nki.isa) nc_stream_shuffle() (in module nki.isa) nc_transpose() (in module nki.isa) nc_version (class in nki.isa) ND ndarray() (in module nki.language) ndim (nki.tensor property) negative() (in module nki.language) Neuron Device Neuron Kernel Interface neuron-cc neuron-cc command line option, [1], [2] neuron-cc command line option neuron-cc, [1], [2] neuron-ls neuron-ls command line option neuron-ls command line option neuron-ls neuron-monitor neuron-monitor command line option neuron-monitor command line option neuron-monitor neuron-profile neuron-profile command line option, [1] neuron-profile command line option neuron-profile, [1] NeuronCore, [1] NeuronCore-v1 NeuronCore-v2 NeuronCore-v3 NeuronDevice NeuronLink NeuronLink-v1 NeuronLink-v2 NeuronLink-v3 neuronx-cc neuronx-cc command line option, [1], [2] neuronx-cc command line option neuronx-cc, [1], [2] NKI not_equal() (in module nki.language) nrt_add_tensor_to_tensor_set (C function) nrt_allocate_tensor_set (C function) nrt_close (C function) nrt_destroy_tensor_set (C function) nrt_execute (C function) nrt_execute_repeat (C function) nrt_free_model_tensor_info (C function) nrt_get_model_instance_count (C function) nrt_get_model_nc_count (C function) nrt_get_model_tensor_info (C function) nrt_get_tensor_from_tensor_set (C function) nrt_get_total_nc_count (C function) nrt_get_version (C function) nrt_get_visible_nc_count (C function) nrt_init (C function) nrt_load (C function) nrt_load_collectives (C function) nrt_profile_start (C function) nrt_profile_stop (C function) nrt_tensor_allocate (C function) nrt_tensor_allocate_empty (C function) nrt_tensor_allocate_slice (C function) nrt_tensor_attach_buffer (C function) nrt_tensor_free (C function) nrt_tensor_get_size (C function) nrt_tensor_get_va (C function) nrt_tensor_read (C function) nrt_tensor_write (C function) nrt_unload (C function) num_programs() (in module nki.language) NxD Core NxD Inference NxD Training O ones (C++ function) ones() (in module nki.language) operator= (C++ function), [1] P par_dim (in module nki.language) Partial Sum Buffer placement module pow (C++ function), [1], [2] pow_out (C++ function), [1], [2] power() (in module nki.language) PP PPr print_reports() built-in function private_hbm (in module nki.language) prod() (in module nki.language) profile() (in module nki) program_id() (in module nki.language) program_ndim() (in module nki.language) PSUM psum (in module nki.language) R rand() (in module nki.language) random_seed() (in module nki.language) range_select() (in module nki.isa) read (C++ function) read_stream_accessor (C++ function) reciprocal() (in module nki.isa) (in module nki.language) reduce_cmd (class in nki.isa) relu() (in module nki.language) reshape() (nki.tensor method) right_shift() (in module nki.language) rms_norm() (in module nki.language) RNE rsqrt() (in module nki.language) RT S SBUF sbuf (in module nki.language) Scalar Engine scalar_tensor_tensor() (in module nki.isa) ScalarE sequential_range() (in module nki.language) set_accessor_coherence_policy (C++ function) shape (nki.tensor property) shared_constant() (in module nki.language) shared_hbm (in module nki.language) shared_identity_matrix() (in module nki.language) sigmoid() (in module nki.language) sign() (in module nki.language) silu() (in module nki.language) silu_dx() (in module nki.language) simulate_kernel() (in module nki) sin (C++ function) sin() (in module nki.language) sin_out (C++ function) skip_middle_end_transformations() (in module nki.compiler) softmax() (in module nki.language) softplus() (in module nki.language) spmd_dim (in module nki.language) sqrt() (in module nki.language) square() (in module nki.language) SR State Buffer static_range() (in module nki.language) store() (in module nki.language) sub (C++ function), [1] sub_out (C++ function), [1] subtract() (in module nki.language) sum() (in module nki.language) Sync Engine T tan (C++ function) tan() (in module nki.language) tan_out (C++ function) tanh() (in module nki.language) tcm_accessor (C++ function), [1] tcm_to_tensor (C++ function) tensor (class in nki) Tensor Engine tensor_copy() (in module nki.isa) tensor_copy_dynamic_dst() (in module nki.isa) tensor_copy_dynamic_src() (in module nki.isa) tensor_copy_predicated() (in module nki.isa) tensor_partition_reduce() (in module nki.isa) tensor_reduce() (in module nki.isa) tensor_scalar() (in module nki.isa) tensor_scalar_reduce() (in module nki.isa) tensor_tensor() (in module nki.isa) tensor_tensor_scan() (in module nki.isa) tensor_to_tcm (C++ function) TensorE TF32 tfloat32 (in module nki.language) tile_size (class in nki.language) torch.neuron.DataParallel() built-in function torch.neuron.DataParallel.disable_dynamic_batching() built-in function, [1] torch::neuron::tcm_free (C++ function) torch::neuron::tcm_malloc (C++ function) torch_neuron.experimental.multicore_context() (in module placement) torch_neuron.experimental.neuron_cores_context() (in module placement) torch_neuron.experimental.set_multicore() (in module placement) torch_neuron.experimental.set_neuron_cores() (in module placement) torch_neuron.Optimization (built-in class) torch_neuron.trace() built-in function torch_neuronx.analyze() built-in function torch_neuronx.async_load() built-in function torch_neuronx.bucket_model_trace() built-in function torch_neuronx.BucketModelConfig (built-in class) torch_neuronx.DataParallel() built-in function torch_neuronx.dynamic_batch() built-in function torch_neuronx.experimental.multicore_context() built-in function torch_neuronx.experimental.neuron_cores_context() built-in function torch_neuronx.experimental.profiler.profile() built-in function torch_neuronx.experimental.profiler.profile.start() built-in function torch_neuronx.experimental.set_multicore() built-in function torch_neuronx.experimental.set_neuron_cores() built-in function torch_neuronx.lazy_load() built-in function torch_neuronx.move_trace_to_device() built-in function torch_neuronx.PartitionerConfig() built-in function torch_neuronx.replace_weights() built-in function torch_neuronx.trace() built-in function TP TPr Trainium/Inferentia2 Trainium2 transpose() (in module nki.language) Trn1 Trn2 trunc() (in module nki.language) V var() (in module nki.language) Vector Engine VectorE view() (nki.tensor method) W where() (in module nki.language) write (C++ function) write_csv() built-in function write_json() built-in function write_stream_accessor (C++ function) Z zeros (C++ function) zeros() (in module nki.language) zeros_like() (in module nki.language)