justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 13546.2@dunegpschedd02.fnal.gov

Jobsub ID13546.2@dunegpschedd02.fnal.gov
Workflow ID83
Stage ID1
User nameichong@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-08-01 02:23:30
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc03
Last heartbeat2025-08-01 02:35:37
From worker nodeHostnamewn-da-15.gina.surf.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-01 02:23:56
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2.root
JobscriptExit code0
Real time10m (656s)
CPU time10m (615s = 93%)
Max RSS bytes2025381888 (1931 MiB)
Outputting started2025-08-01 02:34:54
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training1_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training1_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training1_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training2_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training2_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_training2_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_analysiseid.root
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00083/1/024/graph_output_2025-08-01T_022408Z_8_ana_tree_hd.root
Finished2025-08-01 02:35:37
Saved logsjustin-logs:13546.2-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

-> graph_output_2025-08-01T_022408Z_8_training1_CaloHitListU_graph.data
Renamed training1_CaloHitListV_graph.data -> graph_output_2025-08-01T_022408Z_8_training1_CaloHitListV_graph.data
Renamed training1_CaloHitListW_graph.data -> graph_output_2025-08-01T_022408Z_8_training1_CaloHitListW_graph.data
Renamed training2_CaloHitListU_graph.data -> graph_output_2025-08-01T_022408Z_8_training2_CaloHitListU_graph.data
Renamed training2_CaloHitListV_graph.data -> graph_output_2025-08-01T_022408Z_8_training2_CaloHitListV_graph.data
Renamed training2_CaloHitListW_graph.data -> graph_output_2025-08-01T_022408Z_8_training2_CaloHitListW_graph.data
Renamed analysiseid.root -> graph_output_2025-08-01T_022408Z_8_analysiseid.root
Renamed ana_tree_hd.root -> graph_output_2025-08-01T_022408Z_8_ana_tree_hd.root
=== Start last 100 lines of lar log file ===
Used alternate method to get min and max wires due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.0174195, 
Output 1: 0.0203204, 0.677887, 0.134433, 0.167359, 
Output 2: 0.92245, 0.00745875, 0.00163459, 0.0684563, 
Output 3: 0.0193992, 0.958721, 0.0218633, 1.67279e-05, 
Output 4: 0.987379, 0.0124743, 8.17768e-05, 6.49831e-05, 
Output 5: 0.946633, 0.0502285, 0.000795352, 0.00234275, 
Output 6: 0.435732, 0.52439, 0.0326778, 0.00720001, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 86!
Begin processing the 200th record. run: 74254738 subRun: 1 event: 38400 at 01-Aug-2025 04:33:43 CEST
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:2 17 XUs and 20 XVs -> 13 XUVs
13 XUVs total
5 collection wire objects
13 potential space points
Neighbour search...
85 tests to find 72 neighbours
Iterating with no regularization...
Begin: 37214
0 17499
1 5408.52
2 3269.66
3 2905.11
4 2841.17
5 2829.95
6 2827.94
Now with regularization...
Begin: -22764.3
0 -22807.1
1 -22832.4
2 -22837.1
---MC-PARTICLE-MONITORING-----------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLVertexing, STATUS_CODE_NOT_FOUND
Operating in inference mode.
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_FOUND

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 11!
01-Aug-2025 04:33:45 CEST  Closed output file "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2_graph_2025-08-01T_022408Z.root"
01-Aug-2025 04:33:45 CEST  Closed input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ab/62/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2.root"

========================================================================================================================================
TimeTracker printout (sec)                                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================================
Full event                                             0.0778134      2.36099       15.5935       2.50082       1.33618        200    
----------------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                                0.00131963    0.00259532     0.0133898     0.0023523    0.00132712       200    
reco:gaushit:GausHitFinder                             0.0141929     0.0476876     0.726891      0.0245816     0.0733416       200    
reco:spsolve:SpacePointSolver                         0.000100912    0.0410856      1.2941      0.00301635      0.16422        200    
reco:hitfd:DisambigFromSpacePoints                    0.00019992    0.00871125     0.636588     0.000985305    0.0475355       200    
reco:rns:RandomNumberSaver                            1.9567e-05    2.71273e-05   0.000316092   2.4125e-05    2.11897e-05      200    
reco:pandora:StandardPandora                           0.0102028      1.12454       11.1258       1.00334       0.79015        200    
reco:pandoraTrack:LArPandoraTrackCreation             0.000118927   0.000164561   0.00237553    0.000146765   0.000159263      200    
reco:pandoraShower:LArPandoraModularShowerCreation    0.00016264    0.000208508   0.00326835    0.000191053   0.000217415      200    
reco:pandoracalo:Calorimetry                          9.6703e-05    0.000125821   0.00144733    0.00011533    9.49765e-05      200    
reco:pandorapid:Chi2ParticleID                        3.5257e-05    4.67208e-05   0.000760418    4.218e-05    5.08208e-05      200    
reco:cvnmap:CVNMapper                                 1.8947e-05     0.0168241     0.0462857     0.0220951     0.0110965       200    
reco:cvneva:CVNEvaluator                                1.6e-05      0.641512       3.0209        0.87817      0.436903        200    
reco:energyrecnumu:EnergyReco                         0.00188639    0.00431254     0.016663     0.00324811    0.00232037       200    
reco:energyrecnue:EnergyReco                          0.00187546    0.00260491     0.0123335    0.00227812    0.00117717       200    
reco:energyrecnc:EnergyReco                           0.00183736    0.00247531     0.012426     0.00218044    0.00102206       200    
reco:energyrecnumurange:EnergyReco                    0.00185478    0.00248533     0.0125314    0.00218786    0.00102851       200    
reco:energyrecnumumcs:EnergyReco                      0.00178244    0.00246431     0.0125113    0.00216233    0.00101863       200    
reco:opdec:Deconvolution                               0.0105239     0.148686      0.286825      0.148733      0.0581934       200    
reco:ophitspe:OpHitFinderDeco                         0.00593687     0.0112886     0.0490194     0.0106951     0.004206        200    
reco:opflash:OpFlashFinder                            0.000117464   0.000329163   0.00287312    0.000298092   0.000205879      200    
reco:opslicer:OpSlicer                                3.3734e-05     0.0103256     0.0561116    0.00535567     0.0125685       200    
[art]:TriggerResults:TriggerResultInserter             9.739e-06    1.35785e-05   6.8802e-05    1.26295e-05   4.81353e-06      200    
end_path:out1:RootOutput                               3.497e-06    4.87818e-06   2.0229e-05    4.5885e-06    1.84197e-06      200    
end_path:out1:RootOutput(write)                       0.00364862     0.291547      0.701518      0.367599      0.166688        200    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 11615.6 MB
  Peak resident set size usage (VmHWM): 2025.38 MB
====================================================================================================
Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
=== Generated output files ===
13546.2_dunegpschedd02.fnal.gov.logs.tgz
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2_graph_2025-08-01T_022408Z.log
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2_graph_2025-08-01T_022408Z.root
debugprod.log
graph_output_2025-08-01T_022408Z_8_ana_tree_hd.root
graph_output_2025-08-01T_022408Z_8_analysiseid.root
graph_output_2025-08-01T_022408Z_8_training1_CaloHitListU_graph.data
graph_output_2025-08-01T_022408Z_8_training1_CaloHitListV_graph.data
graph_output_2025-08-01T_022408Z_8_training1_CaloHitListW_graph.data
graph_output_2025-08-01T_022408Z_8_training2_CaloHitListU_graph.data
graph_output_2025-08-01T_022408Z_8_training2_CaloHitListV_graph.data
graph_output_2025-08-01T_022408Z_8_training2_CaloHitListW_graph.data
jobscript.log
justin-processed-pfns.txt
reco_hist.root
secondary_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2_graph_2025-08-01T_022408Z.log
third_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74254738_191_20231121T152659Z_gen_g4_detsim_hitreco__20240503T070553Z_reco2_graph_2025-08-01T_022408Z.log
training1_CaloHitListU.csv
training1_CaloHitListV.csv
training1_CaloHitListW.csv
training2_CaloHitListU.csv
training2_CaloHitListV.csv
training2_CaloHitListW.csv
justIN time: 2025-08-04 17:41:45 UTC       justIN version: 01.04.00