services:cluster:start
                Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| services:cluster:start [2025/02/07 14:33] – [Slurm documentation] drop link to outdated GPU documentation behrmj87 | services:cluster:start [2025/07/21 10:50] (current) – hoffmac00 | ||
|---|---|---|---|
| Line 8: | Line 8: | ||
| - Your ZEDAT account username | - Your ZEDAT account username | ||
| - | - The group you are using the system for (e.g. ag-netz,ag-imhof,...) | + | - The group you are using the system for (e.g. AG Netz, AG Eisert, AG Franke…) | 
| - | - The software you are using for your simulations | + | - The software you are using for your numerics | 
| - Software that you happen to know so well that other HPC users within the department may ask you for help. | - Software that you happen to know so well that other HPC users within the department may ask you for help. | ||
| - | - A self-contained example job that is typical for the workload you will be using the HPC systems for, ideally **with a small README** describing how to run it **and a job script**. If possible scale it so it runs between a few minutes and an hour at maximum. | + | - A self-contained example job that is typical for the workload you will be using the HPC systems for. | 
| - | - If you are no longer a member of the physics department, we would like to get an estimate on how much longer you will need access to the systems (e.g. to finish some paper) | + | - If you are no longer a member of the physics department, we would like to get an estimate on how much longer you will need access to the systems (e.g. to finish some paper). | 
| + | The example must contain: | ||
| + | |||
| + | - A small README | ||
| + | - a Slurm job script, and | ||
| + | - the program that is run in the example and/or all input files needed to run it, this includes data files and definitions for the environment the job is to run in (e.g. a '' | ||
| + | |||
| + | If possible: | ||
| + | |||
| + | - The example should have an option to scale it so it runs between a few minutes and an hour at maximum, so that it can be used for benchmarking. | ||
| + | |||
| + | If you can't answer the questions for your example, these steps can help you answer them | ||
| + | |||
| + | - If you have written the code yourself, what dependecies does it have (e.g. Python libraries you import)? | ||
| + | - How long does your example run? | ||
| + | - How many CPUs and how much memory does the example need? | ||
| + | - Can the  example' | ||
| ===== Slurm documentation ===== | ===== Slurm documentation ===== | ||
| - | * [[important|Important notes]] on cluster | + | Read this for an introduction to Slurm queuing system, if you haven' | 
| * Start with the [[slurm|Introduction to the Slurm HPC cluster]]. | * Start with the [[slurm|Introduction to the Slurm HPC cluster]]. | ||
| + | |||
| + | Read this for some important notes on the specifics of our clusters. | ||
| + | |||
| + | * [[important|Important notes]] on cluster usage | ||
| + | |||
| + | These are more specialised topics: | ||
| + | |||
| * Using [[interactivesessions|interactive sessions]] with the queuing system. | * Using [[interactivesessions|interactive sessions]] with the queuing system. | ||
| * Here is a [[nodes|list of special nodes]] that are currently not part of slurm. | * Here is a [[nodes|list of special nodes]] that are currently not part of slurm. | ||
| Line 29: | Line 53: | ||
| ^ Hosts ^ Nodes ^ Cores/Node ^ RAM/Core ^ RAM/Node ^ CPU features ^ GPU ^ on-GPU RAM ^ #Cores ^ #RAM ^ #GPU ^ | ^ Hosts ^ Nodes ^ Cores/Node ^ RAM/Core ^ RAM/Node ^ CPU features ^ GPU ^ on-GPU RAM ^ #Cores ^ #RAM ^ #GPU ^ | ||
| - | | @#cfc:** sheldon cluster** - FB Physik - Location: Takustraße 7 - OS: Debian/ | + | | @#cfc:** sheldon-ng cluster** - FB Physik - Location: Takustraße 7 - OS: Debian/ | 
| | @# | | @# | ||
| | @# | | @# | ||
| | @# | | @# | ||
| | @# | | @# | ||
| - | | @# | + | | @# | 
| - | | @# | + | | @# | 
| | @# | | @# | ||
| - | | @# | + | | @# | 
| - | | @# | + | | @# | 
| | @# | | @# | ||
| | @# | | @# | ||
| - | | @# | + | | @# | 
| - | | @# | + | | @# | 
| | @# | | @# | ||
| - | | @# | + | | @# | 
| | @# | | @# | ||
| - | | @# | + | | @# | 
| - | (07.02.2025) | + | (21.07.2025) | 
| {{: | {{: | ||
| {{: | {{: | ||
services/cluster/start.1738938810.txt.gz · Last modified: 2025/02/07 14:33 by behrmj87
                
                