Lecture VI
In this video lecture, the presenter discusses the basics of using Slurm, a cluster management and job scheduling system. Slurm is used on supercomputers like HPC Bura to manage jobs and allocate resources. The lecture explains that Slurm acts as a workload manager, assigning resources based on priority and availability. The presenter introduces various Slurm commands used for job scheduling, job management, and observing available resources. The lecture demonstrates the use of commands such as sinfo, squeue, scontrol, and snode to gather information about partitions, nodes, and running jobs.
Subsequently, the use of resources and modules is discussed. The presenter demonstrates how to request resources through a Slurm script. The presenter also explains the concept of modules, which allow for the management of different versions of compilers and libraries. Modules can be loaded and unloaded to provide the necessary environment for executing specific codes.