This document is relevant for: Inf1, Inf2, Trn1, Trn2

  • Make sure Neuron device plugin is running

  • Enable the kube-scheduler with option to use configMap for scheduler policy. In your cluster.yml Please update the spec section with the following

    spec:
      kubeScheduler:
      usePolicyConfigMap: true
    
  • Launch the cluster

    kops create -f cluster.yml
    kops create secret --name neuron-test-1.k8s.local sshpublickey admin -i ~/.ssh/id_rsa.pub
    kops update cluster --name neuron-test-1.k8s.local --yes
    
  • Install the neuron-scheduler-extension [Registers neuron-scheduler-extension with kube-scheduler]

    helm upgrade --install neuron-helm-chart oci://public.ecr.aws/neuron/neuron-helm-chart \
        --set "scheduler.enabled=true" \
        --set "scheduler.customScheduler.enabled=false" \
        --set "scheduler.defaultScheduler.enabled=true" \
        --set "npd.enabled=false"
    

This document is relevant for: Inf1, Inf2, Trn1, Trn2