.. _example.static: schedMain Example A: static files ==================================== .. graphviz:: digraph exampleStatic { //graph [label="NREL MatDB Data Flow", labelloc=t, fontsize=30]; rank=source; legendx [shape=none, margin=0, label=<
Task Dependencies
>]; rankdir = TB; node [color=blue, shape=box, fontsize=10]; edge [fontsize=11]; //URL="index.html"; // default for entire graph alpha0 [shape=none, margin=0, label=<
alpha.sh in aaDir
* Specified in initWork
* Starts immediately (no prereqs)
* At end writes file aaDir/alpha.status.ok
>]; beta0 [shape=none, margin=0, label=<
beta.py in bbDir0
* Specified in aaDir/alpha.postOkWork
* Starts after alpha.sh ends
* At end writes file bbDir0/beta.status.ok
>]; beta1 [shape=none, margin=0, label=<
beta.py in bbDir1
* Specified in aaDir/alpha.postOkWork
* Starts after alpha.sh ends
* At end writes file bbDir1/beta.status.ok
>]; beta2 [shape=none, margin=0, label=<
beta.py in bbDir2
* Specified in aaDir/alpha.postOkWork
* Starts after alpha.sh ends
* At end writes file bbDir2/beta.status.ok
>]; gamma0 [shape=none, margin=0, label=<
gamma.py in ccDir
* Specified in aaDir/alpha.postOkWork
* Starts after all 3 beta.py end
* At end writes file ccDir/gamma.status.ok
>]; legendx -> alpha0 [style=invis]; alpha0 -> beta0 alpha0 -> beta1 alpha0 -> beta2 beta0 -> gamma0 beta1 -> gamma0 beta2 -> gamma0 } To run this example, pick a name of some directory for testing, say testa. Then:: cp -r schedMain/example.static testa cd testa .../schedMain.py -globalDir global -ancDir . -initWork initWork -delaySec 1 -redoAll n The command line parameters are: ============= ======== =================================================== -bugLev Debug level. Typically 0, 1, or 5. -hostType System type: hostLocal or peregrine or ... -globalDir Dir containing global info, including subdir cmd -ancDir An ancestor dir of all dirs to be processed -initWork File containing the initial work list -delaySec Schedule loop delay, seconds -redoAll n/y: on restart, redo all even if prior run was ok -useReadOnly n/y: only print status; do not start tasks ============= ======== =================================================== The possible task status values are: **init** The task is on the work list but not yet started. Either it was just added to the work list, and soon will start, or it has unsatisfied prerequisites. **submit** The task has been submitted to the HPC via qsub, msub, or similar, but has not yet been recognized by the HPC. **wait** The task has been submitted to the HPC via qsub, msub, or similar, but has not yet started. **start** The task has started **ok** The task finished successfully and wrote the file taskName.status.ok. **error** The task finished but with an error. Generally the error message is in file taskName.status.error. In this example you should see output like the following. The "#" notes are mine, after the fact:: # This is the initial work list. Here schedMain has # just read the file initWork. schedMain task counts: init:1 execName jobId new status npre taskDir -------- ----- --- ------ ---- ------- alpha.sh None new init * 0 aaDir scheduleTasks: start task: alpha.sh taskDir: aaDir # After alpha.sh completes, the work list is as follows. # SchedMain noticed the file alpha.status.ok, read alpha.postOkWork, # and added the new tasks to the work list. # The "npre" column is the number of unsatisfied prerequisites. # Here gamma cannot start until the 3 betas complete. # The "*" indicates that task is ready to start. schedMain task counts: init:4 ok:1 execName jobId new status npre taskDir -------- ----- --- ------ ---- ------- alpha.sh None new ok 0 aaDir beta.py None new init * 0 bbDir0 beta.py None new init * 0 bbDir1 beta.py None new init * 0 bbDir2 gamma.py None new init 3 ccDir # Schedmain starts all the ready tasks -- the three betas. # Gamma cannot start yet since its prerequisites, in gamma.preWork, # are the betas. scheduleTasks: start task: beta.py taskDir: bbDir0 scheduleTasks: start task: beta.py taskDir: bbDir1 scheduleTasks: start task: beta.py taskDir: bbDir2 # As soon as the betas start they finish, # and finally gamma's pre-requisites are satisfied. scheduleTasks: start task: gamma.py taskDir: ccDir # All done. # If schedMain ends and some tasks have "init" status, # most likely it's because their prerequistites aren't # satisfied -- perhaps some prior task failed. schedMain task counts: ok:5 execName jobId new status npre taskDir -------- ----- --- ------ ---- ------- alpha.sh None new ok 0 aaDir beta.py None new ok 0 bbDir0 beta.py None new ok 0 bbDir1 beta.py None new ok 0 bbDir2 gamma.py None new ok 0 ccDir The "new" notation means that the task actually ran. If you start the scheduler again in this directory, with the same command as above, you will see:: schedMain task counts: ok:5 execName jobId new status npre taskDir -------- ----- --- ------ ---- ------- alpha.sh None ok 0 aaDir beta.py None ok 0 bbDir0 beta.py None ok 0 bbDir1 beta.py None ok 0 bbDir2 gamma.py None ok 0 ccDir Notice the lack of "new" flags. The scheduler found the x.status.ok file for each of the tasks and concluded the task did not need to be rerun. If you want to force the scheduler to rerun all tasks even if they completed OK, specify the command line flag ``-redoAll y``.