justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 20681.0@dunegpschedd01.fnal.gov

Jobsub ID20681.0@dunegpschedd01.fnal.gov
Workflow ID270
Stage ID1
User nameichong@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-08-03 17:54:49
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_juk
Last heartbeat2025-08-03 17:59:31
From worker nodeHostnamewn-pep-013.farm.nikhef.nl
cpuinfoIntel(R) Xeon(R) Gold 6148 CPU @ 2.40GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-08-03 17:55:42
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-08-03 17:59:31
Saved logsjustin-logs:20681.0-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

ke PFParticle with ID 1
Begin processing the 100th record. run: 74506302 subRun: 1 event: 15000 at 03-Aug-2025 19:59:13 CEST
Analysing.

Warning: there was no track found for track-like PFParticle with ID 3
03-Aug-2025 19:59:14 CEST  Closed input file "root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/be/09/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2.root"

========================================================================================================================
TimeTracker printout (sec)                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================
Full event                             0.456131      0.813676       1.40434      0.818625      0.147009        100    
------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                 0.014378      0.0217252     0.0367977     0.0149765    0.00736889       100    
end_path:analysistree:AnalysisTree      0.42864      0.791828       1.38981      0.794332      0.147664        100    
========================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 7462.41 MB
  Peak resident set size usage (VmHWM): 1109.87 MB
====================================================================================================
Art has completed and will exit with status 0.
=== End last 100 lines of third lar log file ===
=== Start last 100 lines of lar log file ===
StandardRawDigitPrepService::ctor:       DoDeconvolution: 1
StandardRawDigitPrepService::ctor:  DoPedestalAdjustment: 0
StandardRawDigitPrepService::ctor:                 DoROI: 1
StandardRawDigitPrepService::ctor:               DoWires: 1
StandardRawDigitPrepService::ctor:                DoDump: 0
StandardRawDigitPrepService::ctor:  DoIntermediateStates: 0
StandardRawDigitPrepService::ctor: No display tools.
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak).
Deconvolution::BuildExtraFilter sigma is 63.7044
*************************************************************************************************************************************
Unique Ptrs that are added to the event
*************************************************************************************************************************************
* Data Product Name: InitialTrack             * Instance Name:  * Type: std::vector<recob::Track, std::allocator<recob::Track> >*   *
* Data Product Name: ShowerPCA                * Instance Name:  * Type: std::vector<recob::PCAxis, std::allocator<recob::PCAxis> >* *
* Data Product Name: shower                   * Instance Name:  * Type: std::vector<recob::Shower, std::allocator<recob::Shower> >* *
* Association Name:  PFParticlePCAxisAssn     * Instance Name:  * Type: art::Assns<recob::PFParticle, recob::PCAxis, void>*         *
* Association Name:  ShowerPCAxisAssn         * Instance Name:  * Type: art::Assns<recob::Shower, recob::PCAxis, void>*             *
* Association Name:  ShowerTrackAssn          * Instance Name:  * Type: art::Assns<recob::Shower, recob::Track, void>*              *
* Association Name:  ShowerTrackHitAssn       * Instance Name:  * Type: art::Assns<recob::Track, recob::Hit, void>*                 *
* Association Name:  clusterAssociationsbase  * Instance Name:  * Type: art::Assns<recob::Shower, recob::Cluster, void>*            *
* Association Name:  hitAssociationsbase      * Instance Name:  * Type: art::Assns<recob::Shower, recob::Hit, void>*                *
* Association Name:  pfShowerAssociationsbase * Instance Name:  * Type: art::Assns<recob::Shower, recob::PFParticle, void>*         *
* Association Name:  spShowerAssociationsbase * Instance Name:  * Type: art::Assns<recob::Shower, recob::SpacePoint, void>*         *
*************************************************************************************************************************************
03-Aug-2025 19:56:22 CEST  Initiating request to open input file "root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/be/09/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2.root"
03-Aug-2025 19:56:24 CEST  Opened input file "root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/be/09/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2.root"
Begin processing the 1st record. run: 74506302 subRun: 1 event: 14901 at 03-Aug-2025 19:56:26 CEST
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:2 184 XUs and 95 XVs -> 50 XUVs
50 XUVs total
27 collection wire objects
50 potential space points
Neighbour search...
1062 tests to find 870 neighbours
Iterating with no regularization...
Begin: 3.34795e+07
0 3.33072e+07
1 3.32903e+07
Now with regularization...
Begin: 3.23527e+07
0 3.23491e+07
---MC-PARTICLE-MONITORING-----------------------------------------------------------------------

BeamNeutrinos: 

--Primary 0, MCPDG 13, Energy 0.182582, Dist. 20.9382, nMCHits 115 (34, 51, 30)
MCPDG 13, Energy 0.182582, Dist. 20.9382, nMCHits 49 (15, 25, 9)
\_ MCPDG 11, Energy 0.041382, Dist. 7.97612, nMCHits 66 (19, 26, 21)

--Primary 1, MCPDG 2212, Energy 1.08775, Dist. 13.8814, nMCHits 48 (4, 27, 17)
MCPDG 2212, Energy 1.08775, Dist. 13.8814, nMCHits 48 (4, 27, 17)
------------------------------------------------------------------------------------------------
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Atmos_1_U_v04_03_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Atmos_1_V_v04_03_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Atmos_1_W_v04_03_00.pt'
Operating in training mode.
The eid is 0
Graph saved to training1_CaloHitListW_graph.data
Size of file training1_CaloHitListW_graph.data is 1228 bytes.
The eid is 0
Graph saved to training1_CaloHitListU_graph.data
Size of file training1_CaloHitListU_graph.data is 1068 bytes.
The eid is 0
Graph saved to training1_CaloHitListV_graph.data
Size of file training1_CaloHitListV_graph.data is 1996 bytes.
Operating in inference mode.
Operating in training mode.
The eid is -1
Graph saved to training2_CaloHitListW_graph.data
Size of file training2_CaloHitListW_graph.data is 1228 bytes.
The eid is -1
Graph saved to training2_CaloHitListU_graph.data
Size of file training2_CaloHitListU_graph.data is 1068 bytes.
The eid is -1
Graph saved to training2_CaloHitListV_graph.data
Size of file training2_CaloHitListV_graph.data is 1996 bytes.
Boundary wire vector sizes: 43, 81, 50
minwire 0: 295
minwire 1: 2681
minwire 2: 4
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
03-Aug-2025 19:56:29 CEST  Opened output file with pattern "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2_graph_2025-08-03T_175546Z.root"
03-Aug-2025 19:56:42 CEST  Closed input file "root://se1.farm.particle.cz:1094//dune/RSE/fardet-hd/be/09/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2.root"
Malformed TimeTracker database.  The TimeEvent table is empty, but
the TimeModule table is not.  This can happen if an exception has
been thrown from a module while processing the first event.  Any
saved database file is suspect and should not be used.

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8588.53 MB
  Peak resident set size usage (VmHWM): 1686.2 MB
====================================================================================================
=== End last 100 lines of lar log file ===
=== Generated output files ===
20681.0_dunegpschedd01.fnal.gov.logs.tgz
RootOutput-3106-ccf8-c0ef-5460.root
TFileService-9ac9-6198-3067-f210.root
ana_tree_hd.root
analysiseid.root
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2_graph_2025-08-03T_175546Z.log
debugprod.log
jobscript.log
secondary_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2_graph_2025-08-03T_175546Z.log
third_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_74506302_149_20231202T180349Z_gen_g4_detsim_hitreco__20240508T062140Z_reco2_graph_2025-08-03T_175546Z.log
training1_CaloHitListU.csv
training1_CaloHitListU_graph.data
training1_CaloHitListV.csv
training1_CaloHitListV_graph.data
training1_CaloHitListW.csv
training1_CaloHitListW_graph.data
training2_CaloHitListU.csv
training2_CaloHitListU_graph.data
training2_CaloHitListV.csv
training2_CaloHitListV_graph.data
training2_CaloHitListW.csv
training2_CaloHitListW_graph.data
justIN time: 2025-08-04 16:20:31 UTC       justIN version: 01.04.00