justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 20031.33@dunegpschedd01.fnal.gov

Jobsub ID20031.33@dunegpschedd01.fnal.gov
Workflow ID168
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-08-01 00:47:26
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce03_mcore_op_duneonly
Last heartbeat2025-08-01 01:02:51
From worker nodeHostnamedunegli-6250846-0-fnpc9054.fnal.gov
cpuinfoIntel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit172800 (48 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-01 00:55:42
Input filesfardet-hd:anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2.root
JobscriptExit code0
Real time7m (421s)
CPU time6m (407s = 96%)
Max RSS bytes1281748992 (1222 MiB)
Outputting started2025-08-01 01:02:43
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/fnal/00168/1/001/trainingFile_anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2.root
Finished2025-08-01 01:02:51
Saved logsjustin-logs:20031.33-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

TC
Begin processing the 45th record. run: 1095 subRun: 1 event: 83045 at 01-Aug-2025 00:59:18 UTC
Begin processing the 46th record. run: 1095 subRun: 1 event: 83046 at 01-Aug-2025 00:59:30 UTC
Begin processing the 47th record. run: 1095 subRun: 1 event: 83047 at 01-Aug-2025 00:59:35 UTC
Begin processing the 48th record. run: 1095 subRun: 1 event: 83048 at 01-Aug-2025 00:59:46 UTC
Begin processing the 49th record. run: 1095 subRun: 1 event: 83049 at 01-Aug-2025 00:59:49 UTC
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_NOT_INITIALIZED
Begin processing the 50th record. run: 1095 subRun: 1 event: 83050 at 01-Aug-2025 00:59:51 UTC
Begin processing the 51st record. run: 1095 subRun: 1 event: 83051 at 01-Aug-2025 00:59:54 UTC
Begin processing the 52nd record. run: 1095 subRun: 1 event: 83052 at 01-Aug-2025 00:59:57 UTC
Begin processing the 53rd record. run: 1095 subRun: 1 event: 83053 at 01-Aug-2025 01:00:01 UTC
Begin processing the 54th record. run: 1095 subRun: 1 event: 83054 at 01-Aug-2025 01:00:04 UTC
Begin processing the 55th record. run: 1095 subRun: 1 event: 83055 at 01-Aug-2025 01:00:05 UTC
Begin processing the 56th record. run: 1095 subRun: 1 event: 83056 at 01-Aug-2025 01:00:20 UTC
Begin processing the 57th record. run: 1095 subRun: 1 event: 83057 at 01-Aug-2025 01:00:22 UTC
Begin processing the 58th record. run: 1095 subRun: 1 event: 83058 at 01-Aug-2025 01:00:24 UTC
Begin processing the 59th record. run: 1095 subRun: 1 event: 83059 at 01-Aug-2025 01:00:26 UTC
Begin processing the 60th record. run: 1095 subRun: 1 event: 83060 at 01-Aug-2025 01:00:28 UTC
Begin processing the 61st record. run: 1095 subRun: 1 event: 83061 at 01-Aug-2025 01:00:30 UTC
Begin processing the 62nd record. run: 1095 subRun: 1 event: 83062 at 01-Aug-2025 01:00:32 UTC
Begin processing the 63rd record. run: 1095 subRun: 1 event: 83063 at 01-Aug-2025 01:00:34 UTC
Begin processing the 64th record. run: 1095 subRun: 1 event: 83064 at 01-Aug-2025 01:00:39 UTC
Begin processing the 65th record. run: 1095 subRun: 1 event: 83065 at 01-Aug-2025 01:00:41 UTC
Begin processing the 66th record. run: 1095 subRun: 1 event: 83066 at 01-Aug-2025 01:00:43 UTC
Begin processing the 67th record. run: 1095 subRun: 1 event: 83067 at 01-Aug-2025 01:00:49 UTC
Begin processing the 68th record. run: 1095 subRun: 1 event: 83068 at 01-Aug-2025 01:00:51 UTC
Begin processing the 69th record. run: 1095 subRun: 1 event: 83069 at 01-Aug-2025 01:00:54 UTC
Begin processing the 70th record. run: 1095 subRun: 1 event: 83070 at 01-Aug-2025 01:01:17 UTC
Begin processing the 71st record. run: 1095 subRun: 1 event: 83071 at 01-Aug-2025 01:01:18 UTC
Begin processing the 72nd record. run: 1095 subRun: 1 event: 83072 at 01-Aug-2025 01:01:20 UTC
Begin processing the 73rd record. run: 1095 subRun: 1 event: 83073 at 01-Aug-2025 01:01:23 UTC
Begin processing the 74th record. run: 1095 subRun: 1 event: 83074 at 01-Aug-2025 01:01:25 UTC
Begin processing the 75th record. run: 1095 subRun: 1 event: 83075 at 01-Aug-2025 01:01:27 UTC
Begin processing the 76th record. run: 1095 subRun: 1 event: 83076 at 01-Aug-2025 01:01:29 UTC
Begin processing the 77th record. run: 1095 subRun: 1 event: 83077 at 01-Aug-2025 01:01:31 UTC
Begin processing the 78th record. run: 1095 subRun: 1 event: 83078 at 01-Aug-2025 01:01:34 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 79th record. run: 1095 subRun: 1 event: 83079 at 01-Aug-2025 01:01:35 UTC
Begin processing the 80th record. run: 1095 subRun: 1 event: 83080 at 01-Aug-2025 01:01:37 UTC
Begin processing the 81st record. run: 1095 subRun: 1 event: 83081 at 01-Aug-2025 01:01:40 UTC
Begin processing the 82nd record. run: 1095 subRun: 1 event: 83082 at 01-Aug-2025 01:01:41 UTC
Begin processing the 83rd record. run: 1095 subRun: 1 event: 83083 at 01-Aug-2025 01:01:43 UTC
Begin processing the 84th record. run: 1095 subRun: 1 event: 83084 at 01-Aug-2025 01:01:52 UTC
Begin processing the 85th record. run: 1095 subRun: 1 event: 83085 at 01-Aug-2025 01:01:57 UTC
Begin processing the 86th record. run: 1095 subRun: 1 event: 83086 at 01-Aug-2025 01:01:58 UTC
Begin processing the 87th record. run: 1095 subRun: 1 event: 83087 at 01-Aug-2025 01:02:00 UTC
Begin processing the 88th record. run: 1095 subRun: 1 event: 83088 at 01-Aug-2025 01:02:01 UTC
Begin processing the 89th record. run: 1095 subRun: 1 event: 83089 at 01-Aug-2025 01:02:03 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 90th record. run: 1095 subRun: 1 event: 83090 at 01-Aug-2025 01:02:05 UTC
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 91st record. run: 1095 subRun: 1 event: 83091 at 01-Aug-2025 01:02:06 UTC
Begin processing the 92nd record. run: 1095 subRun: 1 event: 83092 at 01-Aug-2025 01:02:09 UTC
Begin processing the 93rd record. run: 1095 subRun: 1 event: 83093 at 01-Aug-2025 01:02:10 UTC
Begin processing the 94th record. run: 1095 subRun: 1 event: 83094 at 01-Aug-2025 01:02:13 UTC
Begin processing the 95th record. run: 1095 subRun: 1 event: 83095 at 01-Aug-2025 01:02:15 UTC
Begin processing the 96th record. run: 1095 subRun: 1 event: 83096 at 01-Aug-2025 01:02:18 UTC
Begin processing the 97th record. run: 1095 subRun: 1 event: 83097 at 01-Aug-2025 01:02:20 UTC
Begin processing the 98th record. run: 1095 subRun: 1 event: 83098 at 01-Aug-2025 01:02:32 UTC
Begin processing the 99th record. run: 1095 subRun: 1 event: 83099 at 01-Aug-2025 01:02:34 UTC
Begin processing the 100th record. run: 1095 subRun: 1 event: 83100 at 01-Aug-2025 01:02:37 UTC
CNNTrackShowerCountingAlgorithm::GetCrop1D: reconstructed vertex outside cropped region! 362.355, 106.334, 362.334
CNNTrackShowerCountingAlgorithm::GetCrop1D: reconstructed vertex outside cropped region! 362.355, 106.281, 362.281
01-Aug-2025 01:02:42 UTC  Closed output file "anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2_reco2.root"
01-Aug-2025 01:02:42 UTC  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/c3/6d/anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                      1.2904        3.52836       44.6458       1.96974        5.413         100    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00180298     0.0193546     0.130333      0.0174512     0.0160646       100    
reco:pandora2:StandardPandora                   1.27562       3.5061        44.6399       1.95604       5.41419        100    
[art]:TriggerResults:TriggerResultInserter    1.4241e-05    2.52536e-05   0.000130389   2.2559e-05    1.44328e-05      100    
end_path:out1:RootOutput                       2.215e-06    3.80286e-06    1.987e-05    3.3565e-06    1.9097e-06       100    
end_path:out1:RootOutput(write)               0.000142709   0.00270108      0.20579     0.000288944    0.0204256       100    
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2266.32 MB
  Peak resident set size usage (VmHWM): 1281.75 MB
====================================================================================================
Art has completed and will exit with status 0.
lar exit code 0
total 2176
-rw-r--r-- 1 dunegli fnalgrid     214 Aug  1 00:55 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid  362974 Aug  1 01:02 anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2_reco2.root
-rw-r--r-- 1 dunegli fnalgrid       0 Aug  1 00:56 debugprod.log
-rw-r--r-- 1 dunegli fnalgrid   43907 Aug  1 01:02 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid     183 Aug  1 01:02 justin-processed-pfns.txt
drwxr-xr-x 4 dunegli fnalgrid      60 Aug  1 00:55 larpandoracontent
-rw-r--r-- 1 dunegli fnalgrid     519 Aug  1 01:02 reco2_hist.root
-rw-r--r-- 1 dunegli fnalgrid 1802811 Aug  1 01:02 trainingFile_anue_dune10kt_1x2x6_1095_830_20230825T091829Z_gen_g4_detsim_hitreco__20240221T091323Z_reco2.root
justIN time: 2025-09-02 19:11:32 UTC       justIN version: 01.04.01