System-wide Configuration

The slurm.conf file can be generated in the following link https://slurm.schedmd.com/configurator.html. Read carefully each of the options, in particular, pick linear for the resource selection.

- name: Ensure the folder /etc/slurm folder exists
  ansible.builtin.file:
    path: /etc/slurm
    state: directory
    owner: slurm
    group: slurm
    mode: "0755"

- name: Copy slurm config files
  ansible.builtin.copy:
    src: "{{ item }}"
    dest: /etc/slurm/{{ item }}
    owner: root
    group: root
    mode: u=rw,g=r,o=r
    force: true
  with_items:
    - slurm.conf

The most relevant options are, for Temple’s hardware:

ClusterName="MASTERX"
SlurmctldHost=master
AuthType=auth/munge
MpiDefault=none
ProctrackType=proctrack/cgroup
ReturnToService=0
SlurmctldPidFile=/var/run/slurmctld.pid
SlurmctldPort=6817
SlurmdPidFile=/var/run/slurmd.pid
SlurmdPort=6818
SlurmdSpoolDir=/var/spool/slurmd
SlurmUser=slurm
StateSaveLocation=/var/spool/slurmctld
SwitchType=switch/none
InactiveLimit=0
KillWait=30
MinJobAge=300
SlurmctldTimeout=120
SlurmdTimeout=300
Waittime=0
SchedulerType=sched/backfill
SlurmctldDebug=info
SlurmctldLogFile=/var/log/slurmctld.log
SlurmdDebug=info
SlurmdLogFile=/var/log/slurmd.log

NodeName=c0[1-3] RealMemory=11000  CPUs=12 Sockets=2 CoresPerSocket=6 ThreadsPerCore=1 State=UNKNOWN
PartitionName=normal Nodes=ALL Default=YES MaxTime=INFINITE State=UP