justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 38120.11@dunegpschedd02.fnal.gov

Jobsub ID38120.11@dunegpschedd02.fnal.gov
Workflow ID2501
Stage ID1
User namepgranger@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-11 10:00:34
SiteUS_Wisconsin
EntryHCCHTPC_US_Wisconsin_osg01_rhel7
Last heartbeat2025-09-11 10:27:55
From worker nodeHostnamee4093
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit82800 (23 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-11 10:03:14
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74481118_149_20231201T111344Z_gen_g4_detsim_hitreco__20240507T193519Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50574832_534_20231203T125537Z_gen_g4_detsim_hitreco__20240508T051616Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74517585_1127_20231204T074243Z_gen_g4_detsim_hitreco__20240509T203139Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6374820_200_20231201T082310Z_gen_g4_detsim_hitreco__20240507T203012Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6494524_379_20231208T070130Z_gen_g4_detsim_hitreco__20240510T054750Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50620606_97_20231207T094207Z_gen_g4_detsim_hitreco__20240510T042041Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6390196_789_20231201T210741Z_gen_g4_detsim_hitreco__20240507T223703Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66006610_535_20231202T203204Z_gen_g4_detsim_hitreco__20240508T065952Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_468_20231202T171110Z_gen_g4_detsim_hitreco__20240507T204823Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50574782_405_20231203T092814Z_gen_g4_detsim_hitreco__20240509T195032Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-11 10:27:55
Saved logsjustin-logs:38120.11-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

0.000321122   0.00156801    0.00695809     0.0012984    0.00117714       100    
reco:opslicer:OpSlicer                                0.000185818    0.238148       1.02665      0.188586      0.192728        100    
reco:rns:RandomNumberSaver                            1.9406e-05    5.75876e-05   0.000404859   4.96075e-05   4.19922e-05      100    
reco:anglereconue:NuAngularReco                       0.00017625    0.00139361     0.0213376    0.000422797   0.00292937       100    
reco:anglereconumu:NuAngularReco                      8.6492e-05     0.0010593     0.0207432    0.000197976   0.00271788       100    
reco:anglereconuepfps:NuAngularReco                   0.000116458   0.00127681     0.0206788    0.000314925   0.00278342       100    
reco:anglereconumupfps:NuAngularReco                  0.000103154   0.00119455     0.0227834    0.000260734   0.00303851       100    
reco:anglerecohits:NuAngularReco                      0.000109745   0.00186074     0.024667     0.000692999   0.00341818       100    
[art]:TriggerResults:TriggerResultInserter             9.588e-06    1.63902e-05   7.0032e-05    1.37355e-05   9.35122e-06      100    
end_path:cafmaker:CAFMaker                            0.00228079     0.0212804     0.283489     0.00782685     0.0389189       100    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 3655.5 MB
  Peak resident set size usage (VmHWM): 1978.98 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 11-Sep-2025 05:26:28 CDT ModuleEndJob
---- FileOpenError BEGIN
  ---- FatalRootError BEGIN
    Fatal Root Error: TNetXNGFile::Open
    [FATAL] Socket timeout
    ROOT severity: 3000
  ---- FatalRootError END
  
  RootInputFileSequence::initFile(): Input file root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/c7/f7/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50574832_534_20231203T125537Z_gen_g4_detsim_hitreco__20240508T051616Z_reco2.root was not found or could not be opened.
---- FileOpenError END
%MSG
lar exit code 1
=== Start last 100 lines of lar log file ===
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.000875976, 0.993281, 0.00584299, 
Output 1: 0.992014, 0.00780783, 8.43234e-05, 9.41979e-05, 
Output 2: 0.998911, 0.00103929, 3.19691e-05, 1.74853e-05, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1238!
Begin processing the 95th record. run: 74481118 subRun: 1 event: 14995 at 11-Sep-2025 05:18:16 CDT
Boundary wire vector sizes: 1172, 980, 1127
minwire 0: 180
minwire 1: 1591
minwire 2: 142
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.992452, 0.00194354, 0.00560392, 
Output 1: 0.239868, 0.63862, 0.107344, 0.0141673, 
Output 2: 0.848044, 0.14178, 0.00956445, 0.00061178, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1869!
Begin processing the 96th record. run: 74481118 subRun: 1 event: 14996 at 11-Sep-2025 05:18:25 CDT
Boundary wire vector sizes: 1873, 1355, 1648
minwire 0: 1223
minwire 1: 187
minwire 2: 1119
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.997051, 0.000453488, 0.00249598, 
Output 1: 0.290483, 0.523954, 0.121952, 0.0636104, 
Output 2: 0.746923, 0.214872, 0.0337472, 0.0044577, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 2213!
Begin processing the 97th record. run: 74481118 subRun: 1 event: 14997 at 11-Sep-2025 05:18:33 CDT
Boundary wire vector sizes: 125, 65, 107
minwire 0: 768
minwire 1: 2743
minwire 2: 225
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.0010346, 0.99396, 0.00500499, 
Output 1: 0.994343, 0.00539443, 0.000130402, 0.000131978, 
Output 2: 0.99816, 0.00177076, 4.81101e-05, 2.08691e-05, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 756!
Begin processing the 98th record. run: 74481118 subRun: 1 event: 14998 at 11-Sep-2025 05:18:40 CDT
Boundary wire vector sizes: 38, 46, 39
minwire 0: 2079
minwire 1: 1075
minwire 2: 1707
Used alternate method to get min and max wires due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 238, 537
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 194, 493
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.00292746, 0.00181128, 0.995261, 
Output 1: 0.953468, 0.046039, 0.000400944, 9.2243e-05, 
Output 2: 0.99893, 0.00104337, 2.32245e-05, 3.49243e-06, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1451!
Begin processing the 99th record. run: 74481118 subRun: 1 event: 14999 at 11-Sep-2025 05:18:46 CDT

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 708!
Begin processing the 100th record. run: 74481118 subRun: 1 event: 15000 at 11-Sep-2025 05:18:51 CDT
Boundary wire vector sizes: 103, 71, 71
minwire 0: 1936
minwire 1: 382
minwire 2: 2259
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.944493, 0.000745605, 0.0547616, 
Output 1: 0.125497, 0.673691, 0.194623, 0.0061893, 
Output 2: 0.925648, 0.0726208, 0.00169508, 3.643e-05, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1013!
Art has completed and will exit with status 1.
=== End last 100 lines of lar log file ===
processed files
root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/df/94/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74481118_149_20231201T111344Z_gen_g4_detsim_hitreco__20240507T193519Z_reco2.root
root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/c7/f7/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50574832_534_20231203T125537Z_gen_g4_detsim_hitreco__20240508T051616Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/b0/c0/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74517585_1127_20231204T074243Z_gen_g4_detsim_hitreco__20240509T203139Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/e7/e5/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6374820_200_20231201T082310Z_gen_g4_detsim_hitreco__20240507T203012Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/19/a6/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6494524_379_20231208T070130Z_gen_g4_detsim_hitreco__20240510T054750Z_reco2.root
root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/fardet-hd/e1/bc/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50620606_97_20231207T094207Z_gen_g4_detsim_hitreco__20240510T042041Z_reco2.root
root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ff/d0/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6390196_789_20231201T210741Z_gen_g4_detsim_hitreco__20240507T223703Z_reco2.root
root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/fardet-hd/3c/e9/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66006610_535_20231202T203204Z_gen_g4_detsim_hitreco__20240508T065952Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/96/eb/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_468_20231202T171110Z_gen_g4_detsim_hitreco__20240507T204823Z_reco2.root
root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/ee/8f/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50574782_405_20231203T092814Z_gen_g4_detsim_hitreco__20240509T195032Z_reco2.root
.:
total 4860
-rw-r--r-- 1 slot1_32 slot1_32 1911155 Sep 11 05:26 flatcaf.root
-rw-r--r-- 1 slot1_32 slot1_32 1452469 Sep 11 05:26 caf.root
-rw-r--r-- 1 slot1_32 slot1_32 1452469 Sep 11 05:26 caf_fd_hd_atmo_2501_20250911T100346Z.root
-rw-r--r-- 1 slot1_32 slot1_32   73507 Sep 11 05:26 caf_20250911T100346Z.log
-rw-r--r-- 1 slot1_32 slot1_32   39696 Sep 11 05:26 jobscript.log
-rw-r--r-- 1 slot1_32 slot1_32    1979 Sep 11 05:26 caf_20250911T100346Z.file
-rw-r--r-- 1 slot1_32 slot1_32    1979 Sep 11 05:26 caf_20250911T100346Z.pfns
-rw-r--r-- 1 slot1_32 slot1_32    1979 Sep 11 05:03 file.list
-rw-r--r-- 1 slot1_32 slot1_32    1979 Sep 11 05:26 justin-processed-pfns.txt
-rw-r--r-- 1 slot1_32 slot1_32    1377 Sep 11 05:03 all-input-dids.txt
-rw-r--r-- 1 slot1_32 slot1_32    1377 Sep 11 05:26 caf_20250911T100346Z.did
-rw-r--r-- 1 slot1_32 slot1_32    1377 Sep 11 05:03 did.list
-rw-r--r-- 1 slot1_32 slot1_32       0 Sep 11 05:04 debugprod.log
justIN time: 2025-09-19 05:16:24 UTC       justIN version: 01.05.00