justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 46668.79@dunegpschedd01.fnal.gov

Jobsub ID46668.79@dunegpschedd01.fnal.gov
Workflow ID2501
Stage ID1
User namepgranger@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-15 14:29:46
SiteBR_CBPF
EntryDUNE_BR_CBPF_ce01
Last heartbeat2025-09-15 17:45:52
From worker nodeHostnamewn39
cpuinfoIntel(R) Xeon(R) CPU E5-2640 v4 @ 2.40GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit257400 (71 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-15 14:30:51
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50624901_339_20231207T124441Z_gen_g4_detsim_hitreco__20240510T044316Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74481119_460_20231201T120833Z_gen_g4_detsim_hitreco__20240507T194144Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74568471_590_20231207T160915Z_gen_g4_detsim_hitreco__20240510T031655Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50231162_285_20231118T190608Z_gen_g4_detsim_hitreco__20240503T052221Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74553259_277_20231207T132251Z_gen_g4_detsim_hitreco__20240510T063201Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515286_377_20231201T144647Z_gen_g4_detsim_hitreco__20240507T190510Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6472771_212_20231208T085416Z_gen_g4_detsim_hitreco__20240510T061915Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74574126_684_20231208T025557Z_gen_g4_detsim_hitreco__20240510T041414Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74505548_172_20231202T141048Z_gen_g4_detsim_hitreco__20240508T055416Z_reco2.root
fardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6472771_166_20231208T085219Z_gen_g4_detsim_hitreco__20240510T061116Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-15 17:45:52
Saved logsjustin-logs:46668.79-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

gularReco                       0.000187532   0.00309207     0.145796     0.000664923    0.0096287       700    
reco:anglereconumu:NuAngularReco                      0.00010485    0.00267402     0.140784     0.000401195   0.00933388       700    
reco:anglereconuepfps:NuAngularReco                   0.000156849   0.00290278     0.144992     0.000574092   0.00953575       700    
reco:anglereconumupfps:NuAngularReco                  0.000143299   0.00283797     0.147703     0.000528925   0.00953126       700    
reco:anglerecohits:NuAngularReco                      0.000165143   0.00385857     0.148563     0.00124581     0.0103433       700    
[art]:TriggerResults:TriggerResultInserter            1.2623e-05    2.49195e-05   8.4596e-05    2.4014e-05    6.33046e-06      700    
end_path:cafmaker:CAFMaker                            0.00174498     0.0383625      1.07511     0.00939687     0.102896        700    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 3821.5 MB
  Peak resident set size usage (VmHWM): 2151.44 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 15-Sep-2025 14:41:37 -03 ModuleEndJob
---- FileOpenError BEGIN
  ---- FatalRootError BEGIN
    Fatal Root Error: TNetXNGFile::Open
    [FATAL] Socket timeout
    ROOT severity: 3000
  ---- FatalRootError END
  
  RootInputFileSequence::initFile(): Input file root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/bf/3e/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6472771_212_20231208T085416Z_gen_g4_detsim_hitreco__20240510T061915Z_reco2.root was not found or could not be opened.
---- FileOpenError END
%MSG
lar exit code 1
=== Start last 100 lines of lar log file ===
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.00577696, 0.371774, 0.622449, 
Output 1: 0.0258417, 0.964779, 0.00909616, 0.000282971, 
Output 2: 0.989419, 0.0102263, 0.00033846, 1.6327e-05, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1897!
Begin processing the 694th record. run: 50515286 subRun: 1 event: 37794 at 15-Sep-2025 14:31:50 -03

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 290!
Begin processing the 695th record. run: 50515286 subRun: 1 event: 37795 at 15-Sep-2025 14:32:00 -03
Boundary wire vector sizes: 4137, 3801, 3591
minwire 0: 66
minwire 1: 1725
minwire 2: 0
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.819805, 0.0350133, 0.145182, 
Output 1: 0.213777, 0.376065, 0.227999, 0.182159, 
Output 2: 0.0275836, 0.0898653, 0.264257, 0.618294, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 2887!
Begin processing the 696th record. run: 50515286 subRun: 1 event: 37796 at 15-Sep-2025 14:33:07 -03
PandoraContentApi::GetList(*this, m_inputHitListName, pCaloHitList) return STATUS_CODE_NOT_INITIALIZED
    in function: GetVolumeIdToHitListMap
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/larpandoracontent/v04_16_00-buildFW/src/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 271
this->GetVolumeIdToHitListMap(volumeIdToHitListMap) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/larpandoracontent/v04_16_00-buildFW/src/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 165
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0003, LArDLMaster, STATUS_CODE_NOT_INITIALIZED

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 18!
Begin processing the 697th record. run: 50515286 subRun: 1 event: 37797 at 15-Sep-2025 14:33:15 -03
Boundary wire vector sizes: 129, 139, 53
minwire 0: 1682
minwire 1: 2028
minwire 2: 1224
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.99148, 0.000711257, 0.00780853, 
Output 1: 0.00989847, 0.945254, 0.0437511, 0.00109667, 
Output 2: 0.988718, 0.0109702, 0.000305552, 6.30861e-06, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1996!
Begin processing the 698th record. run: 50515286 subRun: 1 event: 37798 at 15-Sep-2025 14:33:28 -03

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 89!
Begin processing the 699th record. run: 50515286 subRun: 1 event: 37799 at 15-Sep-2025 14:33:39 -03
Boundary wire vector sizes: 336, 341, 84
minwire 0: 365
minwire 1: 2749
minwire 2: 0
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.997373, 0.00047492, 0.00215227, 
Output 1: 0.00768722, 0.720571, 0.265448, 0.00629449, 
Output 2: 0.984989, 0.0145314, 0.000472168, 7.34047e-06, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1225!
Begin processing the 700th record. run: 50515286 subRun: 1 event: 37800 at 15-Sep-2025 14:33:51 -03
Boundary wire vector sizes: 65, 61, 56
minwire 0: 1961
minwire 1: 562
minwire 2: 1686
Used alternate method to get min and max wires due to vertex determination failure: 528, 827
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 433, 732
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Used alternate method to get min and max wires due to vertex determination failure: 518, 817
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 299
Classifier summary: 
Output 0: 0.00360961, 0.00234296, 0.994047, 
Output 1: 0.869538, 0.129491, 0.000861451, 0.000109317, 
Output 2: 0.998689, 0.00127384, 3.37838e-05, 3.53377e-06, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 1573!
Art has completed and will exit with status 1.
=== End last 100 lines of lar log file ===
processed files
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/0c/a4/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50624901_339_20231207T124441Z_gen_g4_detsim_hitreco__20240510T044316Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/24/23/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74481119_460_20231201T120833Z_gen_g4_detsim_hitreco__20240507T194144Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/71/e6/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74568471_590_20231207T160915Z_gen_g4_detsim_hitreco__20240510T031655Z_reco2.root
root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/01/5e/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50231162_285_20231118T190608Z_gen_g4_detsim_hitreco__20240503T052221Z_reco2.root
root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/0d/05/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74553259_277_20231207T132251Z_gen_g4_detsim_hitreco__20240510T063201Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/25/05/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515286_377_20231201T144647Z_gen_g4_detsim_hitreco__20240507T190510Z_reco2.root
root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/bf/3e/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6472771_212_20231208T085416Z_gen_g4_detsim_hitreco__20240510T061915Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/be/00/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74574126_684_20231208T025557Z_gen_g4_detsim_hitreco__20240510T041414Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/a0/10/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74505548_172_20231202T141048Z_gen_g4_detsim_hitreco__20240508T055416Z_reco2.root
root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/f0/e1/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6472771_166_20231208T085219Z_gen_g4_detsim_hitreco__20240510T061116Z_reco2.root
.:
total 33584
-rw-r--r-- 1 nobody nobody 12620552 Sep 15 14:41 flatcaf.root
-rw-r--r-- 1 nobody nobody 10566669 Sep 15 14:41 caf.root
-rw-r--r-- 1 nobody nobody 10566669 Sep 15 14:41 caf_fd_hd_atmo_2501_20250915T143103Z.root
-rw-r--r-- 1 nobody nobody   417117 Sep 15 14:41 caf_20250915T143103Z.log
-rw-r--r-- 1 nobody nobody   164219 Sep 15 14:41 jobscript.log
-rw-r--r-- 1 nobody nobody     1898 Sep 15 14:41 caf_20250915T143103Z.file
-rw-r--r-- 1 nobody nobody     1898 Sep 15 14:41 caf_20250915T143103Z.pfns
-rw-r--r-- 1 nobody nobody     1898 Sep 15 11:31 file.list
-rw-r--r-- 1 nobody nobody     1898 Sep 15 14:41 justin-processed-pfns.txt
-rw-r--r-- 1 nobody nobody     1378 Sep 15 11:31 all-input-dids.txt
-rw-r--r-- 1 nobody nobody     1378 Sep 15 14:41 caf_20250915T143103Z.did
-rw-r--r-- 1 nobody nobody     1378 Sep 15 11:31 did.list
-rw-r--r-- 1 nobody nobody        0 Sep 15 11:31 debugprod.log
justIN time: 2025-09-18 15:58:52 UTC       justIN version: 01.05.00