System-wide Configuration
The slurm.conf
file can be generated in the following link https://slurm.schedmd.com/configurator.html. Read carefully each of the options, in particular, pick linear
for the
resource selection.
- name: Ensure the folder /etc/slurm folder exists
ansible.builtin.file:
path: /etc/slurm
state: directory
owner: slurm
group: slurm
mode: "0755"
- name: Copy slurm config files
ansible.builtin.copy:
src: "{{ item }}"
dest: /etc/slurm/{{ item }}
owner: root
group: root
mode: u=rw,g=r,o=r
force: true
with_items:
- slurm.conf
The most relevant options are, for Temple’s hardware:
ClusterName="MASTERX"
SlurmctldHost=master
AuthType=auth/munge
MpiDefault=none
ProctrackType=proctrack/cgroup
ReturnToService=0
SlurmctldPidFile=/var/run/slurmctld.pid
SlurmctldPort=6817
SlurmdPidFile=/var/run/slurmd.pid
SlurmdPort=6818
SlurmdSpoolDir=/var/spool/slurmd
SlurmUser=slurm
StateSaveLocation=/var/spool/slurmctld
SwitchType=switch/none
InactiveLimit=0
KillWait=30
MinJobAge=300
SlurmctldTimeout=120
SlurmdTimeout=300
Waittime=0
SchedulerType=sched/backfill
SlurmctldDebug=info
SlurmctldLogFile=/var/log/slurmctld.log
SlurmdDebug=info
SlurmdLogFile=/var/log/slurmd.log
NodeName=c0[1-3] RealMemory=11000 CPUs=12 Sockets=2 CoresPerSocket=6 ThreadsPerCore=1 State=UNKNOWN
PartitionName=normal Nodes=ALL Default=YES MaxTime=INFINITE State=UP