justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 13536.88@dunegpschedd02.fnal.gov

Jobsub ID13536.88@dunegpschedd02.fnal.gov
Workflow ID168
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-08-01 00:51:26
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce04_mcore_op_duneonly
Last heartbeat2025-08-01 01:01:58
From worker nodeHostnamedunegli-6423675-0-fnpc19123.fnal.gov
cpuinfoAMD EPYC 7502 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit172800 (48 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-01 00:54:39
Input filesfardet-hd:anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2.root
JobscriptExit code0
Real time6m (415s)
CPU time6m (397s = 95%)
Max RSS bytes1271332864 (1212 MiB)
Outputting started2025-08-01 01:01:35
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/fnal/00168/1/001/trainingFile_anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2.root
Finished2025-08-01 01:01:58
Saved logsjustin-logs:13536.88-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 58th record. run: 1072 subRun: 1 event: 42858 at 01-Aug-2025 00:58:15 UTC
Begin processing the 59th record. run: 1072 subRun: 1 event: 42859 at 01-Aug-2025 00:58:17 UTC
Begin processing the 60th record. run: 1072 subRun: 1 event: 42860 at 01-Aug-2025 00:58:18 UTC
Begin processing the 61st record. run: 1072 subRun: 1 event: 42861 at 01-Aug-2025 00:58:20 UTC
Begin processing the 62nd record. run: 1072 subRun: 1 event: 42862 at 01-Aug-2025 00:58:22 UTC
Begin processing the 63rd record. run: 1072 subRun: 1 event: 42863 at 01-Aug-2025 00:58:24 UTC
Begin processing the 64th record. run: 1072 subRun: 1 event: 42864 at 01-Aug-2025 00:58:27 UTC
Begin processing the 65th record. run: 1072 subRun: 1 event: 42865 at 01-Aug-2025 00:58:29 UTC
Begin processing the 66th record. run: 1072 subRun: 1 event: 42866 at 01-Aug-2025 00:58:31 UTC
Begin processing the 67th record. run: 1072 subRun: 1 event: 42867 at 01-Aug-2025 00:58:32 UTC
Begin processing the 68th record. run: 1072 subRun: 1 event: 42868 at 01-Aug-2025 00:58:34 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 69th record. run: 1072 subRun: 1 event: 42869 at 01-Aug-2025 00:58:35 UTC
Begin processing the 70th record. run: 1072 subRun: 1 event: 42870 at 01-Aug-2025 00:58:36 UTC
Begin processing the 71st record. run: 1072 subRun: 1 event: 42871 at 01-Aug-2025 00:58:38 UTC
Begin processing the 72nd record. run: 1072 subRun: 1 event: 42872 at 01-Aug-2025 00:58:39 UTC
Begin processing the 73rd record. run: 1072 subRun: 1 event: 42873 at 01-Aug-2025 00:58:49 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 74th record. run: 1072 subRun: 1 event: 42874 at 01-Aug-2025 00:58:50 UTC
Begin processing the 75th record. run: 1072 subRun: 1 event: 42875 at 01-Aug-2025 00:58:52 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 76th record. run: 1072 subRun: 1 event: 42876 at 01-Aug-2025 00:58:53 UTC
Begin processing the 77th record. run: 1072 subRun: 1 event: 42877 at 01-Aug-2025 00:58:54 UTC
Begin processing the 78th record. run: 1072 subRun: 1 event: 42878 at 01-Aug-2025 00:58:55 UTC
Begin processing the 79th record. run: 1072 subRun: 1 event: 42879 at 01-Aug-2025 00:58:56 UTC
Begin processing the 80th record. run: 1072 subRun: 1 event: 42880 at 01-Aug-2025 00:58:58 UTC
Begin processing the 81st record. run: 1072 subRun: 1 event: 42881 at 01-Aug-2025 00:58:59 UTC
Begin processing the 82nd record. run: 1072 subRun: 1 event: 42882 at 01-Aug-2025 00:59:01 UTC
Begin processing the 83rd record. run: 1072 subRun: 1 event: 42883 at 01-Aug-2025 00:59:02 UTC
Begin processing the 84th record. run: 1072 subRun: 1 event: 42884 at 01-Aug-2025 00:59:04 UTC
Begin processing the 85th record. run: 1072 subRun: 1 event: 42885 at 01-Aug-2025 00:59:05 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 86th record. run: 1072 subRun: 1 event: 42886 at 01-Aug-2025 00:59:06 UTC
Begin processing the 87th record. run: 1072 subRun: 1 event: 42887 at 01-Aug-2025 00:59:09 UTC
Begin processing the 88th record. run: 1072 subRun: 1 event: 42888 at 01-Aug-2025 00:59:29 UTC
Begin processing the 89th record. run: 1072 subRun: 1 event: 42889 at 01-Aug-2025 00:59:30 UTC
Begin processing the 90th record. run: 1072 subRun: 1 event: 42890 at 01-Aug-2025 00:59:31 UTC
Begin processing the 91st record. run: 1072 subRun: 1 event: 42891 at 01-Aug-2025 00:59:33 UTC
Begin processing the 92nd record. run: 1072 subRun: 1 event: 42892 at 01-Aug-2025 00:59:34 UTC
Begin processing the 93rd record. run: 1072 subRun: 1 event: 42893 at 01-Aug-2025 00:59:36 UTC
Begin processing the 94th record. run: 1072 subRun: 1 event: 42894 at 01-Aug-2025 01:01:22 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 95th record. run: 1072 subRun: 1 event: 42895 at 01-Aug-2025 01:01:23 UTC
Begin processing the 96th record. run: 1072 subRun: 1 event: 42896 at 01-Aug-2025 01:01:24 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 97th record. run: 1072 subRun: 1 event: 42897 at 01-Aug-2025 01:01:26 UTC
Begin processing the 98th record. run: 1072 subRun: 1 event: 42898 at 01-Aug-2025 01:01:28 UTC
Begin processing the 99th record. run: 1072 subRun: 1 event: 42899 at 01-Aug-2025 01:01:29 UTC
Begin processing the 100th record. run: 1072 subRun: 1 event: 42900 at 01-Aug-2025 01:01:32 UTC
01-Aug-2025 01:01:34 UTC  Closed output file "anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2_reco2.root"
01-Aug-2025 01:01:34 UTC  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/0f/1c/anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.0330764      3.55102       105.595       1.34657       10.8836        100    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00161567     0.0176409     0.0577481     0.0171939     0.0084661       100    
reco:pandora2:StandardPandora                  0.0279923      3.53265       105.575       1.32706       10.8835        100    
[art]:TriggerResults:TriggerResultInserter     1.013e-05    2.22746e-05   0.000209841    1.764e-05    2.29988e-05      100    
end_path:out1:RootOutput                       2.22e-06     3.60721e-06    2.676e-05     2.88e-06     2.67422e-06      100    
end_path:out1:RootOutput(write)               0.000175241   0.000530963   0.00314287    0.000286211   0.000590485      100    
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2262.9 MB
  Peak resident set size usage (VmHWM): 1271.33 MB
====================================================================================================
Art has completed and will exit with status 0.
lar exit code 0
total 2136
-rw-r--r-- 1 dunegli fnalgrid     214 Aug  1 00:54 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid  350700 Aug  1 01:01 anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2_reco2.root
-rw-r--r-- 1 dunegli fnalgrid       0 Aug  1 00:55 debugprod.log
-rw-r--r-- 1 dunegli fnalgrid   46444 Aug  1 01:01 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid     183 Aug  1 01:01 justin-processed-pfns.txt
drwxr-xr-x 4 dunegli fnalgrid      60 Aug  1 00:54 larpandoracontent
-rw-r--r-- 1 dunegli fnalgrid     519 Aug  1 01:01 reco2_hist.root
-rw-r--r-- 1 dunegli fnalgrid 1773266 Aug  1 01:01 trainingFile_anue_dune10kt_1x2x6_1072_428_20230824T120338Z_gen_g4_detsim_hitreco__20240222T020458Z_reco2.root
justIN time: 2025-08-04 17:44:06 UTC       justIN version: 01.04.00