User Tools

Site Tools


services:cluster:localstorage

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
services:cluster:localstorage [2013/11/08 09:52] – created dregerservices:cluster:localstorage [2013/11/08 10:26] (current) dreger
Line 1: Line 1:
-====== Use local storage on the computer nodes ======+====== Use local storage on the compute nodes ======
  
-If many jobs write to or read from the NFS server for the cluster home at the same time, the server can get very slow and even crash. Therefore it's very important that all users try to use local storage available to the nodes if possible. In most cases this will also speed up your jobs. In order to do so you have to tell the queuing system the amount of local disk space you want to reserve for your job. The queuing system will create a directory named ''/local_scratch/$PBS_JOBID'' on the nodes. After the computation has finished you must copy the results you want to keep from the local disks back to your home directory. +If many jobs write to or read from the NFS server for the cluster home at the same time, the server can get very slow and even crash. Therefore it's very important that all users try to use local storage available to the nodes if possible. In most cases this will also speed up your jobs. In order to do so you have to tell the queuing system the amount of local disk space you want to reserve for your job. The queuing system will create a directory named ''/local_scratch/$PBS_JOBID'' on the nodes. After the computation has finished you must copy the results you want to keep from the local disks back to your home directory. Please try to copy only input and output files you really need, refrain from using ''*'' everywhere. 
- +===== Example for TORQUE aka sheldon cluster =====
-===== Exmaple for TORQUE aka sheldon cluster =====+
  
 <code> <code>
Line 36: Line 35:
 </code> </code>
  
 +Example run using this jobfile:
 +
 +<xterm>
 +dreger@sheldon:~/test-file> **ls**
 +input.dat  jobfile1
 +dreger@sheldon:~/test-file> **qsub jobfile1**
 +656781.torque.physik.fu-berlin.de
 +dreger@sheldon:~/test-file> **ls**
 +input.dat  jobfile1  local-file.e656781  local-file.o656781  result.out
 +dreger@sheldon:~/test-file> **grep ^# local-file.o656781**
 +# found local storage at /local_scratch/656781.torque.physik.fu-berlin.de. copying data from /home/dreger/local-file to /local_scratch/656781.torque.physik.fu-berlin.de.
 +# maximum file size is: 10485760
 +dreger@sheldon:~/test-file> **cat result.out**
 +aee97cb3ad288ef0add6c6b5b5fae48a  input.dat
 +</xterm>
services/cluster/localstorage.1383904347.txt.gz · Last modified: 2013/11/08 09:52 by dreger

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki