Incorrect Handling of Running Jobs
Example:
$ cin_seff -j 10382893 -v 1000
PARSED VARIABLES:
jobids==10382893
consumed_budget==True
node_distribution==False
verbosity==1000
Command:
sacct -P -n -a --format JobID,User,Account,State,AllocCPUS,REQMEM,TotalCPU,ElapsedRaw,MaxRSS,ExitCode,NodeList,AllocNodes,Elapsed,Partition,QOS -j 10382893
OUTPUT:
10382893|<user>|cin_sanity|RUNNING|4096|61600G|00:00.008|2557||0:0|lrdn[0001-0002,0004-0005,0008-0027,0030-0032,0036-0038,0042-0044,0046-0047,0049-0053,0055,0060,0063-0065,0067-0068,0070,0072-0075,0078-0079,0082-0086,0088-0090,0092,0094-0095,0099,0101-0102,0105,0108-0111,0113,0115-0117,0119-0121,0124-0128,0131,0134-0137,0145,0147,0152,0157,0159,0161,0163,0165,0167,0169-0172,0174,0176,0178-0180,0182-0185,0187-0189,0195-0197,0202-0203,0206-0210,0292,0305,0409]|128|00:42:37|boost_usr_prod|boost_qos_bprod
10382893.batch||cin_sanity|RUNNING|32||00:00:00|2557||0:0|lrdn0001|1|00:42:37||
10382893.extern||cin_sanity|RUNNING|4096||00:00:00|2557||0:0|lrdn[0001-0002,0004-0005,0008-0027,0030-0032,0036-0038,0042-0044,0046-0047,0049-0053,0055,0060,0063-0065,0067-0068,0070,0072-0075,0078-0079,0082-0086,0088-0090,0092,0094-0095,0099,0101-0102,0105,0108-0111,0113,0115-0117,0119-0121,0124-0128,0131,0134-0137,0145,0147,0152,0157,0159,0161,0163,0165,0167,0169-0172,0174,0176,0178-0180,0182-0185,0187-0189,0195-0197,0202-0203,0206-0210,0292,0305,0409]|128|00:42:37||
10382893.0||cin_sanity|COMPLETED|32||00:00.008|0|0|0:0|lrdn0001|1|00:00:00||
10382893.1||cin_sanity|RUNNING|4096||00:00:00|2549||0:0|lrdn[0001-0002,0004-0005,0008-0027,0030-0032,0036-0038,0042-0044,0046-0047,0049-0053,0055,0060,0063-0065,0067-0068,0070,0072-0075,0078-0079,0082-0086,0088-0090,0092,0094-0095,0099,0101-0102,0105,0108-0111,0113,0115-0117,0119-0121,0124-0128,0131,0134-0137,0145,0147,0152,0157,0159,0161,0163,0165,0167,0169-0172,0174,0176,0178-0180,0182-0185,0187-0189,0195-0197,0202-0203,0206-0210,0292,0305,0409]|128|00:42:29||
Traceback (most recent call last):
File "/leonardo/prod/opt/tools/cintools/1.0/none/bin/cin_seff", line 561, in <module>
main()
File "/leonardo/prod/opt/tools/cintools/1.0/none/bin/cin_seff", line 540, in main
raise Exception(f"Step termited correctly (exit codfe {step_exitcode}), but no MAXRSS recorded.")
Exception: Step termited correctly (exit codfe 0:0), but no MAXRSS recorded.
Edited by Daniele Di Bari