justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 49522.26@dunegpschedd01.fnal.gov

Jobsub ID49522.26@dunegpschedd01.fnal.gov
Workflow ID2979
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-09-19 08:41:29
SiteUS_Wisconsin
EntryHCCHTPC_US_Wisconsin_osg01_rhel7
Last heartbeat2025-09-19 10:19:26
From worker nodeHostnamee4022
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit82800 (23 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-09-19 10:10:22
Input filesfardet-hd:nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2.root
JobscriptExit code0
Real time8m (523s)
CPU time5m (320s = 61%)
Max RSS bytes1323212800 (1261 MiB)
Outputting started2025-09-19 10:19:06
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/fnal/02979/1/001/trackShowerCountingValidation_nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2.root
Finished2025-09-19 10:19:26
Saved logsjustin-logs:49522.26-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

le:     /exp/dune/app/users/lwhite86/DUNE-FD/pandoraEventClassification/srcs/larpandoracontent/larpandoradlcontent/LArEventClassification/CNNTrackShowerCountingValidationAlgorithm.cc line#: 78
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_NOT_INITIALIZED
Begin processing the 57th record. run: 1124 subRun: 1 event: 28057 at 19-Sep-2025 05:16:32 CDT
Begin processing the 58th record. run: 1124 subRun: 1 event: 28058 at 19-Sep-2025 05:16:35 CDT
Begin processing the 59th record. run: 1124 subRun: 1 event: 28059 at 19-Sep-2025 05:16:38 CDT
Begin processing the 60th record. run: 1124 subRun: 1 event: 28060 at 19-Sep-2025 05:16:41 CDT
Begin processing the 61st record. run: 1124 subRun: 1 event: 28061 at 19-Sep-2025 05:16:45 CDT
Begin processing the 62nd record. run: 1124 subRun: 1 event: 28062 at 19-Sep-2025 05:16:48 CDT
Begin processing the 63rd record. run: 1124 subRun: 1 event: 28063 at 19-Sep-2025 05:16:53 CDT
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_FAILURE
Begin processing the 64th record. run: 1124 subRun: 1 event: 28064 at 19-Sep-2025 05:16:55 CDT
Begin processing the 65th record. run: 1124 subRun: 1 event: 28065 at 19-Sep-2025 05:16:59 CDT
Begin processing the 66th record. run: 1124 subRun: 1 event: 28066 at 19-Sep-2025 05:17:03 CDT
Begin processing the 67th record. run: 1124 subRun: 1 event: 28067 at 19-Sep-2025 05:17:06 CDT
Begin processing the 68th record. run: 1124 subRun: 1 event: 28068 at 19-Sep-2025 05:17:12 CDT
Begin processing the 69th record. run: 1124 subRun: 1 event: 28069 at 19-Sep-2025 05:17:15 CDT
Begin processing the 70th record. run: 1124 subRun: 1 event: 28070 at 19-Sep-2025 05:17:19 CDT
Begin processing the 71st record. run: 1124 subRun: 1 event: 28071 at 19-Sep-2025 05:17:22 CDT
Begin processing the 72nd record. run: 1124 subRun: 1 event: 28072 at 19-Sep-2025 05:17:25 CDT
Begin processing the 73rd record. run: 1124 subRun: 1 event: 28073 at 19-Sep-2025 05:17:34 CDT
Begin processing the 74th record. run: 1124 subRun: 1 event: 28074 at 19-Sep-2025 05:17:37 CDT
Begin processing the 75th record. run: 1124 subRun: 1 event: 28075 at 19-Sep-2025 05:17:41 CDT
Begin processing the 76th record. run: 1124 subRun: 1 event: 28076 at 19-Sep-2025 05:17:44 CDT
Begin processing the 77th record. run: 1124 subRun: 1 event: 28077 at 19-Sep-2025 05:17:47 CDT
Begin processing the 78th record. run: 1124 subRun: 1 event: 28078 at 19-Sep-2025 05:17:50 CDT
Begin processing the 79th record. run: 1124 subRun: 1 event: 28079 at 19-Sep-2025 05:17:54 CDT
Begin processing the 80th record. run: 1124 subRun: 1 event: 28080 at 19-Sep-2025 05:17:57 CDT
Begin processing the 81st record. run: 1124 subRun: 1 event: 28081 at 19-Sep-2025 05:18:01 CDT
Begin processing the 82nd record. run: 1124 subRun: 1 event: 28082 at 19-Sep-2025 05:18:05 CDT
Begin processing the 83rd record. run: 1124 subRun: 1 event: 28083 at 19-Sep-2025 05:18:08 CDT
Begin processing the 84th record. run: 1124 subRun: 1 event: 28084 at 19-Sep-2025 05:18:12 CDT
Begin processing the 85th record. run: 1124 subRun: 1 event: 28085 at 19-Sep-2025 05:18:15 CDT
Begin processing the 86th record. run: 1124 subRun: 1 event: 28086 at 19-Sep-2025 05:18:18 CDT
Begin processing the 87th record. run: 1124 subRun: 1 event: 28087 at 19-Sep-2025 05:18:21 CDT
Begin processing the 88th record. run: 1124 subRun: 1 event: 28088 at 19-Sep-2025 05:18:24 CDT
Begin processing the 89th record. run: 1124 subRun: 1 event: 28089 at 19-Sep-2025 05:18:30 CDT
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_FAILURE
Begin processing the 90th record. run: 1124 subRun: 1 event: 28090 at 19-Sep-2025 05:18:32 CDT
Begin processing the 91st record. run: 1124 subRun: 1 event: 28091 at 19-Sep-2025 05:18:35 CDT
Begin processing the 92nd record. run: 1124 subRun: 1 event: 28092 at 19-Sep-2025 05:18:38 CDT
Begin processing the 93rd record. run: 1124 subRun: 1 event: 28093 at 19-Sep-2025 05:18:41 CDT
Begin processing the 94th record. run: 1124 subRun: 1 event: 28094 at 19-Sep-2025 05:18:44 CDT
Begin processing the 95th record. run: 1124 subRun: 1 event: 28095 at 19-Sep-2025 05:18:47 CDT
Begin processing the 96th record. run: 1124 subRun: 1 event: 28096 at 19-Sep-2025 05:18:51 CDT
Begin processing the 97th record. run: 1124 subRun: 1 event: 28097 at 19-Sep-2025 05:18:54 CDT
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_FAILURE
Begin processing the 98th record. run: 1124 subRun: 1 event: 28098 at 19-Sep-2025 05:18:57 CDT
Begin processing the 99th record. run: 1124 subRun: 1 event: 28099 at 19-Sep-2025 05:19:00 CDT
Begin processing the 100th record. run: 1124 subRun: 1 event: 28100 at 19-Sep-2025 05:19:03 CDT
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, unknown exception
PandoraContentApi::GetList(*this, m_inputPfoListName, pPfoList) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /exp/dune/app/users/lwhite86/DUNE-FD/pandoraEventClassification/srcs/larpandoracontent/larpandoradlcontent/LArEventClassification/CNNTrackShowerCountingValidationAlgorithm.cc line#: 78
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_NOT_INITIALIZED
19-Sep-2025 05:19:05 CDT  Closed output file "nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2_reco2.root"
19-Sep-2025 05:19:06 CDT  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/ad/9f/nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                      1.22464       2.96405       7.69546       2.82065      0.821046        100    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0274572     0.0581113     0.346354      0.0527514     0.0354716       100    
reco:pandora2:StandardPandora                   1.16758       2.89846       7.6132        2.74859      0.817956        100    
[art]:TriggerResults:TriggerResultInserter     1.058e-05    1.90486e-05   8.7213e-05     1.59e-05     1.04079e-05      100    
end_path:out1:RootOutput                       2.605e-06    4.3874e-06    2.9566e-05    3.5115e-06    3.02035e-06      100    
end_path:out1:RootOutput(write)               0.000918153   0.00726174     0.0282048    0.00597583    0.00546798       100    
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2307.13 MB
  Peak resident set size usage (VmHWM): 1323.21 MB
====================================================================================================
Art has completed and will exit with status 0.
lar exit code 0
total 1053472
-rw-r--r-- 1 slot1_36 slot1_36        212 Sep 19 05:10 all-input-dids.txt
-rw-r--r-- 1 slot1_36 slot1_36          0 Sep 19 05:10 debugprod.log
-rw-r--r-- 1 slot1_36 slot1_36      47221 Sep 19 05:19 jobscript.log
-rw-r--r-- 1 slot1_36 slot1_36        182 Sep 19 05:19 justin-processed-pfns.txt
drwxr-xr-x 4 slot1_36 slot1_36       4096 Sep 19 05:10 larpandoracontent
-rw-r--r-- 1 slot1_36 slot1_36 1078665893 Sep 19 05:19 nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2_reco2.root
-rw-r--r-- 1 slot1_36 slot1_36        519 Sep 19 05:19 reco2_hist.root
-rw-r--r-- 1 slot1_36 slot1_36      11980 Sep 19 05:19 trackShowerCountingValidation_nue_dune10kt_1x2x6_1124_280_20230828T133414Z_gen_g4_detsim_hitreco__20240221T081310Z_reco2.root
justIN time: 2025-09-19 13:07:27 UTC       justIN version: 01.05.00