justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 14065.77@dunegpschedd02.fnal.gov

Jobsub ID14065.77@dunegpschedd02.fnal.gov
Workflow ID253
Stage ID1
User nameichong@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-08-02 22:37:47
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc02
Last heartbeat2025-08-02 22:46:34
From worker nodeHostnamewn-db-04.gina.surf.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-02 22:38:46
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2.root
JobscriptExit code0
Real time6m (409s)
CPU time6m (373s = 91%)
Max RSS bytes2024591360 (1930 MiB)
Outputting started2025-08-02 22:45:36
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training1_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training1_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training1_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training2_CaloHitListU_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training2_CaloHitListV_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_training2_CaloHitListW_graph.data
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_analysiseid.root
https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00253/1/001/graph_output_2025-08-02T_223854Z_9_ana_tree_hd.root
Finished2025-08-02 22:46:34
Saved logsjustin-logs:14065.77-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

l exit with status 0.
=== End last 100 lines of third lar log file ===
Renamed training1_CaloHitListU_graph.data -> graph_output_2025-08-02T_223854Z_9_training1_CaloHitListU_graph.data
Renamed training1_CaloHitListV_graph.data -> graph_output_2025-08-02T_223854Z_9_training1_CaloHitListV_graph.data
Renamed training1_CaloHitListW_graph.data -> graph_output_2025-08-02T_223854Z_9_training1_CaloHitListW_graph.data
Renamed training2_CaloHitListU_graph.data -> graph_output_2025-08-02T_223854Z_9_training2_CaloHitListU_graph.data
Renamed training2_CaloHitListV_graph.data -> graph_output_2025-08-02T_223854Z_9_training2_CaloHitListV_graph.data
Renamed training2_CaloHitListW_graph.data -> graph_output_2025-08-02T_223854Z_9_training2_CaloHitListW_graph.data
Renamed analysiseid.root -> graph_output_2025-08-02T_223854Z_9_analysiseid.root
Renamed ana_tree_hd.root -> graph_output_2025-08-02T_223854Z_9_ana_tree_hd.root
=== Start last 100 lines of lar log file ===
Output 2: 0.993181, 0.000556104, 0.000913162, 0.00535024, 
Output 3: 0.922217, 0.0759981, 0.00121029, 0.000574709, 
Output 4: 0.996897, 0.00291721, 1.90545e-05, 0.000166714, 
Output 5: 0.992498, 0.00639436, 3.38903e-05, 0.00107418, 
Output 6: 0.691769, 0.277745, 0.0168946, 0.0135917, 


Running Ophitfinder with InputDigiType = 'recob'
Found hits: 87!
Begin processing the 100th record. run: 6405486 subRun: 1 event: 52900 at 03-Aug-2025 00:44:33 CEST
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:2 67 XUs and 61 XVs -> 41 XUVs
41 XUVs total
22 collection wire objects
41 potential space points
Neighbour search...
765 tests to find 698 neighbours
Iterating with no regularization...
Begin: 259092
0 106764
1 89987.3
2 87863.2
3 87141.4
4 86802.3
5 86646.4
6 86573.6
Now with regularization...
Begin: -77221.3
0 -77463.3
1 -77595.7
2 -77663.3
---MC-PARTICLE-MONITORING-----------------------------------------------------------------------

BeamNeutrinos: 

--Primary 0, MCPDG 13, Energy 2.10355, Dist. 6.95973, nMCHits 15 (6, 4, 5)
MCPDG 13, Energy 2.10355, Dist. 6.95973, nMCHits 15 (6, 4, 5)
------------------------------------------------------------------------------------------------
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLVertexing, STATUS_CODE_NOT_FOUND
Operating in inference mode.
Operating in training mode.
this->CompleteMCHierarchy(mcToHitsMap, hierarchy) return STATUS_CODE_NOT_FOUND
    in function: PrepareTrainingSample
    in file:     /exp/dune/app/users/ichong/larsoft_graph_V1_2025/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 81
iter->second->Run() throw STATUS_CODE_NOT_FOUND
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_FOUND

Running Ophitfinder with InputDigiType = 'recob'
Found hits: 30!
03-Aug-2025 00:44:35 CEST  Closed output file "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2_graph_2025-08-02T_223854Z.root"
03-Aug-2025 00:44:35 CEST  Closed input file "root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/fardet-hd/a6/0c/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2.root"

========================================================================================================================================
TimeTracker printout (sec)                                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================================
Full event                                             0.943785       2.51277       7.22943       2.50619       1.10428        100    
----------------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                                0.00142006    0.00255466    0.00807344     0.0025221    0.000825271      100    
reco:gaushit:GausHitFinder                             0.0164223     0.0597932     0.339435      0.0311629     0.0678183       100    
reco:spsolve:SpacePointSolver                         8.6255e-05     0.0986831      4.44284     0.00498139     0.492214        100    
reco:hitfd:DisambigFromSpacePoints                    0.000188709   0.00759842     0.099832     0.00123323     0.0170511       100    
reco:rns:RandomNumberSaver                             1.625e-05    2.42412e-05   0.000252539   2.04135e-05   2.33455e-05      100    
reco:pandora:StandardPandora                           0.833868       1.05058       2.72939      0.898952      0.376225        100    
reco:pandoraTrack:LArPandoraTrackCreation             0.000118396   0.000156033   0.00188696    0.000133093   0.000174826      100    
reco:pandoraShower:LArPandoraModularShowerCreation    0.000160195   0.000206729   0.00283193    0.000173159   0.000265615      100    
reco:pandoracalo:Calorimetry                          9.7797e-05    0.000122535    0.0011607    0.000106077   0.000105433      100    
reco:pandorapid:Chi2ParticleID                        3.4516e-05    4.35245e-05   0.000626993    3.685e-05    5.87199e-05      100    
reco:cvnmap:CVNMapper                                 2.0579e-05     0.014063      0.0279942     0.0170047    0.00791983       100    
reco:cvneva:CVNEvaluator                               1.555e-05     0.738741       2.49144      0.926638      0.432942        100    
reco:energyrecnumu:EnergyReco                         0.00210982    0.00606254     0.0197065    0.00541694    0.00441977       100    
reco:energyrecnue:EnergyReco                          0.00197331    0.00287568    0.00946616     0.0024707    0.00110258       100    
reco:energyrecnc:EnergyReco                           0.00190994    0.00277702    0.00864842    0.00239241    0.00106011       100    
reco:energyrecnumurange:EnergyReco                    0.00192782    0.00274737    0.00747189    0.00235653    0.000990436      100    
reco:energyrecnumumcs:EnergyReco                      0.00190881     0.003152      0.0164912     0.0023174     0.0025042       100    
reco:opdec:Deconvolution                               0.0316455      0.16679      0.384624      0.162616      0.0669895       100    
reco:ophitspe:OpHitFinderDeco                         0.00954224     0.0147779     0.0402478     0.0144106    0.00320918       100    
reco:opflash:OpFlashFinder                            0.000113776   0.00031078    0.00251695    0.000257255   0.000244583      100    
reco:opslicer:OpSlicer                                3.3544e-05     0.0119997     0.053836     0.00562604     0.0146526       100    
[art]:TriggerResults:TriggerResultInserter             9.378e-06    1.24854e-05   3.9916e-05    1.1643e-05    3.97356e-06      100    
end_path:out1:RootOutput                               3.046e-06    4.27501e-06   1.9728e-05     3.828e-06    2.38685e-06      100    
end_path:out1:RootOutput(write)                        0.0146372     0.327788      0.545582      0.391297      0.162094        100    
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 11619.1 MB
  Peak resident set size usage (VmHWM): 2024.59 MB
====================================================================================================
Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
=== Generated output files ===
14065.77_dunegpschedd02.fnal.gov.logs.tgz
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2_graph_2025-08-02T_223854Z.log
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2_graph_2025-08-02T_223854Z.root
debugprod.log
graph_output_2025-08-02T_223854Z_9_ana_tree_hd.root
graph_output_2025-08-02T_223854Z_9_analysiseid.root
graph_output_2025-08-02T_223854Z_9_training1_CaloHitListU_graph.data
graph_output_2025-08-02T_223854Z_9_training1_CaloHitListV_graph.data
graph_output_2025-08-02T_223854Z_9_training1_CaloHitListW_graph.data
graph_output_2025-08-02T_223854Z_9_training2_CaloHitListU_graph.data
graph_output_2025-08-02T_223854Z_9_training2_CaloHitListV_graph.data
graph_output_2025-08-02T_223854Z_9_training2_CaloHitListW_graph.data
jobscript.log
justin-processed-pfns.txt
reco_hist.root
secondary_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2_graph_2025-08-02T_223854Z.log
third_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_528_20231202T114028Z_gen_g4_detsim_hitreco__20240507T220006Z_reco2_graph_2025-08-02T_223854Z.log
training1_CaloHitListU.csv
training1_CaloHitListV.csv
training1_CaloHitListW.csv
training2_CaloHitListU.csv
training2_CaloHitListV.csv
training2_CaloHitListW.csv
justIN time: 2025-08-04 14:17:51 UTC       justIN version: 01.04.00