justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 13546.10@dunegpschedd02.fnal.gov

Jobsub ID13546.10@dunegpschedd02.fnal.gov
Workflow ID83
Stage ID1
User nameichong@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-08-01 02:23:30
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc03
Last heartbeat2025-08-01 02:30:47
From worker nodeHostnamewn-da-08.gina.surf.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-01 02:24:01
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2.root
JobscriptExit code0
Real time6m (364s)
CPU time5m (334s = 91%)
Max RSS bytes2008653824 (1915 MiB)
Outputting started2025-08-01 02:30:06
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training1_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training1_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training1_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training2_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training2_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_training2_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_analysiseid.root
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_7_ana_tree_hd.root
Finished2025-08-01 02:30:47
Saved logsjustin-logs:13546.10-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

amed training2_CaloHitListU_graph.data -> graph_output_2025-08-01T_022408Z_7_training2_CaloHitListU_graph.data
Renamed training2_CaloHitListV_graph.data -> graph_output_2025-08-01T_022408Z_7_training2_CaloHitListV_graph.data
Renamed training2_CaloHitListW_graph.data -> graph_output_2025-08-01T_022408Z_7_training2_CaloHitListW_graph.data
Renamed analysiseid.root -> graph_output_2025-08-01T_022408Z_7_analysiseid.root
Renamed ana_tree_hd.root -> graph_output_2025-08-01T_022408Z_7_ana_tree_hd.root
=== Start last 100 lines of lar log file ===
Finding XUV coincidences...
C:0 T:13 8 XUs and 9 XVs -> 6 XUVs
C:0 T:17 59 XUs and 53 XVs -> 33 XUVs
C:0 T:21 24 XUs and 21 XVs -> 11 XUVs
50 XUVs total
26 collection wire objects
50 potential space points
Neighbour search...
212 tests to find 162 neighbours
Iterating with no regularization...
Begin: 181399
0 51083.1
1 47828.2
2 47791.5
Now with regularization...
Begin: -6180.9
0 -6217.96
1 -6218.67
---MC-PARTICLE-MONITORING-----------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLVertexing, STATUS_CODE_NOT_FOUND
Operating in inference mode.
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_FOUND
Boundary wire vector sizes: 44, 46, 34
minwire 0: 1634
minwire 1: 201
minwire 2: 1700
Used alternate method to get min and max wires due to vertex determination failure: 163, 662
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 511, 1010
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.0105632, 
Output 1: 0.00343189, 0.00264195, 0.00221068, 0.991715, 
Output 2: 0.995093, 0.000673642, 7.00087e-05, 0.00416335, 
Output 3: 0.974423, 0.0245672, 0.000511665, 0.000497805, 
Output 4: 0.995099, 0.00429244, 8.43373e-05, 0.000523846, 
Output 5: 0.998062, 0.00179778, 1.07314e-05, 0.000129421, 
Output 6: 0.0946684, 0.611462, 0.129745, 0.164125, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 68!
01-Aug-2025 04:29:16 CEST  Closed output file "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2_graph_2025-08-01T_022408Z.root"
01-Aug-2025 04:29:16 CEST  Closed input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/5a/e3/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2.root"

========================================================================================================================================
TimeTracker printout (sec)                                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================================
Full event                                             0.0537497      2.32272       11.2344       2.33977       1.35047        100    
----------------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                                 0.0012754    0.00256021    0.00403005    0.00257564    0.000623661      100    
reco:gaushit:GausHitFinder                             0.0139128     0.0542627     0.574891      0.0232995     0.0851716       100    
reco:spsolve:SpacePointSolver                         9.9569e-05     0.0320984     0.593131     0.00313306     0.0905293       100    
reco:hitfd:DisambigFromSpacePoints                    0.000191895    0.0129848     0.467067     0.00108518     0.0524566       100    
reco:rns:RandomNumberSaver                            1.7953e-05    2.78902e-05   0.00028007    2.36205e-05   2.62966e-05      100    
reco:pandora:StandardPandora                          0.00991981      1.12117       7.77573      0.941879       0.82048        100    
reco:pandoraTrack:LArPandoraTrackCreation             0.000108996   0.000163692   0.00186411    0.000132416   0.000174717      100    
reco:pandoraShower:LArPandoraModularShowerCreation    0.000149134   0.000199455   0.00273811    0.00016862    0.000255751      100    
reco:pandoracalo:Calorimetry                          9.1454e-05    0.000129016   0.00114741    0.000111407   0.000106181      100    
reco:pandorapid:Chi2ParticleID                        3.3113e-05    4.38506e-05   0.000612001   3.6194e-05    5.73992e-05      100    
reco:cvnmap:CVNMapper                                 2.1641e-05     0.0210568     0.0524887     0.0220104     0.015315        100    
reco:cvneva:CVNEvaluator                              1.4578e-05      0.62668       2.34147      0.844731      0.427089        100    
reco:energyrecnumu:EnergyReco                         0.00188559    0.00380005     0.010291     0.00278605    0.00201006       100    
reco:energyrecnue:EnergyReco                           0.0017599    0.00262855    0.00913915    0.00230926    0.00112468       100    
reco:energyrecnc:EnergyReco                           0.00187796    0.00255683    0.00905744    0.00220978     0.0011241       100    
reco:energyrecnumurange:EnergyReco                    0.00183355     0.0025644     0.0089362    0.00226693    0.00109912       100    
reco:energyrecnumumcs:EnergyReco                      0.00172126    0.00252134    0.00894073    0.00217556    0.00111639       100    
reco:opdec:Deconvolution                              0.00540535     0.137399       0.25466      0.134266      0.0563497       100    
reco:ophitspe:OpHitFinderDeco                         0.00579315     0.0107341     0.0289874     0.0100941    0.00281543       100    
reco:opflash:OpFlashFinder                            0.000137872   0.000336245   0.00337455    0.000273289   0.00032386       100    
reco:opslicer:OpSlicer                                3.4616e-05    0.00870569     0.0490241    0.00427369     0.0112971       100    
[art]:TriggerResults:TriggerResultInserter             9.347e-06    1.32925e-05   5.5786e-05    1.1972e-05    4.99085e-06      100    
end_path:out1:RootOutput                               3.588e-06    4.79267e-06   1.7784e-05     4.483e-06    1.50618e-06      100    
end_path:out1:RootOutput(write)                       0.00371212      0.27915      0.577968       0.34698      0.159109        100    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 11601 MB
  Peak resident set size usage (VmHWM): 2008.65 MB
====================================================================================================
Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
=== Generated output files ===
13546.10_dunegpschedd02.fnal.gov.logs.tgz
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2_graph_2025-08-01T_022408Z.log
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2_graph_2025-08-01T_022408Z.root
debugprod.log
graph_output_2025-08-01T_022408Z_7_ana_tree_hd.root
graph_output_2025-08-01T_022408Z_7_analysiseid.root
graph_output_2025-08-01T_022408Z_7_training1_CaloHitListU_graph.data
graph_output_2025-08-01T_022408Z_7_training1_CaloHitListV_graph.data
graph_output_2025-08-01T_022408Z_7_training1_CaloHitListW_graph.data
graph_output_2025-08-01T_022408Z_7_training2_CaloHitListU_graph.data
graph_output_2025-08-01T_022408Z_7_training2_CaloHitListV_graph.data
graph_output_2025-08-01T_022408Z_7_training2_CaloHitListW_graph.data
jobscript.log
justin-processed-pfns.txt
reco_hist.root
secondary_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2_graph_2025-08-01T_022408Z.log
third_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50568178_576_20231202T171512Z_gen_g4_detsim_hitreco__20240507T205408Z_reco2_graph_2025-08-01T_022408Z.log
training1_CaloHitListU.csv
training1_CaloHitListV.csv
training1_CaloHitListW.csv
training2_CaloHitListU.csv
training2_CaloHitListV.csv
training2_CaloHitListW.csv
justIN time: 2025-08-04 17:39:50 UTC       justIN version: 01.04.00