justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 14065.15@dunegpschedd02.fnal.gov

Jobsub ID14065.15@dunegpschedd02.fnal.gov
Workflow ID253
Stage ID1
User nameichong@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-08-02 22:37:47
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc02
Last heartbeat2025-08-02 22:48:26
From worker nodeHostnamewn-la-13.gina.surf.nl
cpuinfoAMD EPYC 9754 128-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-02 22:39:03
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2.root
JobscriptExit code0
Real time8m (492s)
CPU time7m (469s = 95%)
Max RSS bytes2085957632 (1989 MiB)
Outputting started2025-08-02 22:47:16
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training1_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training1_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training1_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training2_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training2_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_training2_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_analysiseid.root
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223914Z_1_ana_tree_hd.root
Finished2025-08-02 22:48:26
Saved logsjustin-logs:14065.15-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

tree_hd.root -> graph_output_2025-08-02T_223914Z_1_ana_tree_hd.root
=== Start last 100 lines of lar log file ===
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLVertexing, STATUS_CODE_NOT_FOUND
Operating in inference mode.
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_FOUND

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 28!
Begin processing the 100th record. run: 6403947 subRun: 1 event: 49400 at 03-Aug-2025 00:46:02 CEST
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:2 10 XUs and 10 XVs -> 8 XUVs
C:0 T:6 8 XUs and 7 XVs -> 5 XUVs
13 XUVs total
7 collection wire objects
13 potential space points
Neighbour search...
59 tests to find 46 neighbours
Iterating with no regularization...
Begin: 81129.2
0 54811.8
1 53921.8
2 53921.7
Now with regularization...
Begin: 29206.7
0 29204.5
---MC-PARTICLE-MONITORING-----------------------------------------------------------------------
------------------------------------------------------------------------------------------------
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLVertexing, STATUS_CODE_NOT_FOUND
Operating in inference mode.
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_FOUND

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 55!
03-Aug-2025 00:46:05 CEST  Closed output file "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2_graph_2025-08-02T_223914Z.root"
03-Aug-2025 00:46:05 CEST  Closed input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/b2/0b/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2.root"

========================================================================================================================================
TimeTracker printout (sec)                                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================================
Full event                                             0.0791304      3.06341       13.9553       3.08548       1.8896         100    
----------------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                                0.00252066    0.00412602    0.00617312     0.003976     0.000968175      100    
reco:gaushit:GausHitFinder                             0.0189737     0.0616754     0.643069      0.027951      0.0922711       100    
reco:spsolve:SpacePointSolver                         0.00012749     0.0881638      3.76337     0.00264408     0.431123        100    
reco:hitfd:DisambigFromSpacePoints                    0.000230234    0.0127302     0.437905     0.000873607    0.0514151       100    
reco:rns:RandomNumberSaver                            2.3675e-05    3.59135e-05   0.000311946   2.94935e-05   2.93208e-05      100    
reco:pandora:StandardPandora                           0.0143614      1.50115       9.47091       1.27969       1.03351        100    
reco:pandoraTrack:LArPandoraTrackCreation             0.000151847   0.000218352   0.00276853    0.000182608   0.000259137      100    
reco:pandoraShower:LArPandoraModularShowerCreation    0.000192758   0.000260412   0.00371265    0.000222553   0.000347692      100    
reco:pandoracalo:Calorimetry                          0.000122473   0.00017197    0.00166142    0.000154727   0.000151394      100    
reco:pandorapid:Chi2ParticleID                        4.2343e-05    5.66552e-05   0.000779535    4.662e-05    7.32028e-05      100    
reco:cvnmap:CVNMapper                                 3.0294e-05     0.0165803     0.0658999     0.013378      0.0148436       100    
reco:cvneva:CVNEvaluator                               2.075e-05     0.790895       3.48564       1.09066      0.569683        100    
reco:energyrecnumu:EnergyReco                         0.00217344    0.00505089     0.0137594    0.00465351    0.00251222       100    
reco:energyrecnue:EnergyReco                          0.00215917    0.00285502    0.00940512    0.00250392    0.00112615       100    
reco:energyrecnc:EnergyReco                            0.0020965    0.00276186    0.00931684    0.00238378     0.0010881       100    
reco:energyrecnumurange:EnergyReco                    0.00205875    0.00275455    0.00947876    0.00237618    0.00111525       100    
reco:energyrecnumumcs:EnergyReco                      0.00204191    0.00280749     0.0093821    0.00239754    0.00119118       100    
reco:opdec:Deconvolution                              0.00854691     0.166583      0.334496       0.15717      0.0711128       100    
reco:ophitspe:OpHitFinderDeco                          0.0161813     0.0216197     0.0271433     0.0214474    0.00246158       100    
reco:opflash:OpFlashFinder                            0.000123315   0.000405304   0.00312865    0.000355817   0.000310028      100    
reco:opslicer:OpSlicer                                3.8387e-05     0.0128551     0.0735937    0.00479569     0.017581        100    
[art]:TriggerResults:TriggerResultInserter            1.2749e-05    1.69859e-05   6.9304e-05    1.53375e-05   7.13677e-06      100    
end_path:out1:RootOutput                               4.317e-06    5.94455e-06   2.6129e-05    5.4085e-06    2.87135e-06      100    
end_path:out1:RootOutput(write)                        0.0041462      0.36852      0.760261      0.459698       0.21256        100    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 19793.6 MB
  Peak resident set size usage (VmHWM): 2085.96 MB
====================================================================================================
Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
=== Generated output files ===
14065.15_dunegpschedd02.fnal.gov.logs.tgz
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2_graph_2025-08-02T_223914Z.log
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2_graph_2025-08-02T_223914Z.root
debugprod.log
graph_output_2025-08-02T_223914Z_1_ana_tree_hd.root
graph_output_2025-08-02T_223914Z_1_analysiseid.root
graph_output_2025-08-02T_223914Z_1_training1_CaloHitListU_graph.data
graph_output_2025-08-02T_223914Z_1_training1_CaloHitListV_graph.data
graph_output_2025-08-02T_223914Z_1_training1_CaloHitListW_graph.data
graph_output_2025-08-02T_223914Z_1_training2_CaloHitListU_graph.data
graph_output_2025-08-02T_223914Z_1_training2_CaloHitListV_graph.data
graph_output_2025-08-02T_223914Z_1_training2_CaloHitListW_graph.data
jobscript.log
justin-processed-pfns.txt
reco_hist.root
secondary_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2_graph_2025-08-02T_223914Z.log
third_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6403947_493_20231202T120446Z_gen_g4_detsim_hitreco__20240507T212214Z_reco2_graph_2025-08-02T_223914Z.log
training1_CaloHitListU.csv
training1_CaloHitListV.csv
training1_CaloHitListW.csv
training2_CaloHitListU.csv
training2_CaloHitListV.csv
training2_CaloHitListW.csv
justIN time: 2025-08-04 16:08:28 UTC       justIN version: 01.04.00