justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 244298.1@dunegpschedd01.fnal.gov

Jobsub ID244298.1@dunegpschedd01.fnal.gov
Workflow ID9811
Stage ID1
User nameykermaid@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes6815744000 (6500 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-11-06 12:36:35
SiteUS_UCSD
EntryCMSHTPC_T2_US_UCSD_gw7
Last heartbeat2025-11-06 12:44:11
From worker nodeHostnamemh-7763-5.t2.ucsd.edu
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes7864320000 (7500 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-11-06 12:37:16
Input filesvd-protodune-det-reco:np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-11-06 12:44:11
Saved logsjustin-logs:244298.1-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

SETUP_LARDATAALG=lardataalg v10_01_04 -f Linux64bit+3.10-2.17 -z /cvmfs/larsoft.opensciencegrid.org/products -q e26:prof
LARCOREALG_INC=/cvmfs/larsoft.opensciencegrid.org/products/larcorealg/v10_00_03/include
G4PARTICLEHPDATA=/cvmfs/larsoft.opensciencegrid.org/products/g4tendl/v1_3_2/G4TENDL1.3.2
SETUP_LARANA=larana v10_01_01 -f Linux64bit+3.10-2.17 -z /cvmfs/larsoft.opensciencegrid.org/products -q e26:prof
SETUP_LARSOFT=larsoft v10_12_00 -f Linux64bit+3.10-2.17 -z /cvmfs/larsoft.opensciencegrid.org/products -q e26:prof
SETUP_CETLIB=cetlib v3_18_02 -f Linux64bit+3.10-2.17 -z /cvmfs/larsoft.opensciencegrid.org/products -q e26:prof
SETUP_FHICLCPP=fhiclcpp v4_18_04 -f Linux64bit+3.10-2.17 -z /cvmfs/larsoft.opensciencegrid.org/products -q e26:prof
OPENBLAS_INC=/cvmfs/larsoft.opensciencegrid.org/products/openblas/v0_3_23/Linux64bit+3.10-2.17-e26/include
PYTHON_INCLUDE=/cvmfs/larsoft.opensciencegrid.org/products/python/v3_9_15/Linux64bit+3.10-2.17/include/python3.9
_=/usr/bin/env
Justin specific env vars
JUSTIN_SCOPE=usertests
JUSTIN_WALL_SECONDS=80000
JUSTIN_ALLOCATOR=https://justin-allocator-fnal.dune.hep.ac.uk/api/allocator/
JUSTIN_SITE_NAME=US_UCSD
JUSTIN_SAM_WEB_URI=https://justin.dune.hep.ac.uk/api/samweb/244298.1@dunegpschedd01.fnal.gov/R-FNHGqcIBDcryqhU597CwahjF8njdR9otoblWw28WikCCtIrIFgVngQbwXBkjH0o0Cfk8HITsixYoseYDO7Fg
JUSTIN_PROCESSORS=1
JUSTIN_MQL=files where namespace=vd-protodune-det-reco and 040346 in core.runs and core.data_tier=full-reconstructed limit 10
JUSTIN_JOBSCRIPT_SECRET=R-FNHGqcIBDcryqhU597CwahjF8njdR9otoblWw28WikCCtIrIFgVngQbwXBkjH0o0Cfk8HITsixYoseYDO7Fg
JUSTIN_PATH=/home
JUSTIN_TIMESTAMP=1762432636
JUSTIN_JOBSUB_ID=244298.1@dunegpschedd01.fnal.gov
JUSTIN_WORKFLOW_ID=9811
JUSTIN_STAGE_ID=1
JUSTIN_RSS_MIB=6500
Will use justin-get-file
pfn: root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/vd-protodune-det-reco/8f/65/np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root
did: vd-protodune-det-reco:np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root
Output file: np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup_20251106T123721Z_ana.root
Running singlehit
Error in ntuple production
%MSG-i MF_INIT_OK:  Early 06-Nov-2025 04:37:27 PST JobSetup
Messagelogger initialization complete.
%MSG
Info in <TGeoManager::Import>: Reading geometry from file: /cvmfs/dune.opensciencegrid.org/products/dune/dunecore/v10_12_01d00/gdml/protodunevd_v5_ggd.gdml
Info in <TGeoManager::TGeoManager>: Geometry GDMLImport, Geometry imported from GDML created
Error: Unsupported GDML Tag Used :gdml_simple_extension. Please Check Geometry/Schema.
Info in <TGeoManager::SetTopVolume>: Top volume is volWorld. Master volume is volWorld
Info in <TGeoNavigator::BuildCache>: --- Maximum geometry depth set to 100
Info in <TGeoManager::CheckGeometry>: Fixing runtime shapes...
Info in <TGeoManager::CheckGeometry>: ...Nothing to fix
Info in <TGeoManager::CloseGeometry>: Counting nodes...
Info in <TGeoManager::Voxelize>: Voxelizing...
Info in <TGeoManager::CloseGeometry>: Building cache...
Info in <TGeoManager::CountLevels>: max level = 5, max placements = 292
Info in <TGeoManager::CloseGeometry>: 16484 nodes/ 2469 volume UID's in Geometry imported from GDML
Info in <TGeoManager::CloseGeometry>: ----------------modeler ready----------------
%MSG-i AuxDetGeometryCore:  Early 06-Nov-2025 04:37:34 PST JobSetup
New detector geometry loaded from
	/cvmfs/dune.opensciencegrid.org/products/dune/dunecore/v10_12_01d00/gdml/protodunevd_v5_ggd.gdml
%MSG
%MSG-i GeometryCore:  Early 06-Nov-2025 04:37:34 PST JobSetup
Sorting volumes...
%MSG
%MSG-i GeometryCore:  Early 06-Nov-2025 04:37:34 PST JobSetup
New detector geometry loaded from
	/cvmfs/dune.opensciencegrid.org/products/dune/dunecore/v10_12_01d00/gdml/protodunevd_v5_ggd.gdml
%MSG
%MSG-i CRPWireReadoutGeom:  Early 06-Nov-2025 04:37:34 PST JobSetup
Initializing CRPWireReadoutGeom channel mapping algorithm.
%MSG
%MSG-i CRPWireReadoutGeom:  Early 06-Nov-2025 04:37:34 PST JobSetup
Build readout planes for 1 16 3
%MSG
%MSG-i CRPWireReadoutGeom:  Early 06-Nov-2025 04:37:34 PST JobSetup
Counted 12288 channels.
%MSG
GeoApaChannelGroupService::ctor: Group 0 (apa0) has 3072 channels from 3/3 readout planes.
GeoApaChannelGroupService::ctor: Group 1 (apa1) has 3072 channels from 3/3 readout planes.
GeoApaChannelGroupService::ctor: Group 2 (apa2) has 3072 channels from 3/3 readout planes.
GeoApaChannelGroupService::ctor: Group 3 (apa3) has 3072 channels from 3/3 readout planes.
%MSG-i SimpleChannelStatusService:  Early 06-Nov-2025 04:37:34 PST JobSetup
Loaded from configuration:
  - 0 bad channels
  - 0 noisy channels
  - largest channel ID: 12287, largest present: 12287
%MSG
%MSG-i setupProvider<DetectorPropertiesStandard>:  Early  06-Nov-2025 04:37:34 PST JobSetup
Asked to ignore 1 keys: 'InheritNumberTimeSamples'
%MSG
PDSP Channel Map: Building RCE TPC wiremap from file protoDUNETPCChannelMap_RCE_v4.txt
PDSP Channel Map: Building FELIX TPC wiremap from file protoDUNETPCChannelMap_RCE_v4.txt
PDSP Channel Map: Building SSP channel map from file protoDUNESSPChannelMap_v1.txt
%MSG-i SignalShapingServiceDUNE:  Early 06-Nov-2025 04:37:34 PST JobSetup
Getting Filter from .fcl file
%MSG
%MSG-i SignalShapingServiceDUNE:  Early 06-Nov-2025 04:37:34 PST JobSetup
 using the field response provided from a .root file 
%MSG
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak).
06-Nov-2025 04:37:35 PST  Initiating request to open input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/vd-protodune-det-reco/8f/65/np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root"
06-Nov-2025 04:37:39 PST  Opened input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/vd-protodune-det-reco/8f/65/np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root"
db: runtime1761998789
Begin processing the 1st record. run: 40346 subRun: 1 event: 2864 at 06-Nov-2025 04:37:41 PST
Begin processing the 2nd record. run: 40346 subRun: 1 event: 2868 at 06-Nov-2025 04:38:35 PST
Begin processing the 3rd record. run: 40346 subRun: 1 event: 2872 at 06-Nov-2025 04:40:31 PST
Begin processing the 4th record. run: 40346 subRun: 1 event: 2876 at 06-Nov-2025 04:41:30 PST
Begin processing the 5th record. run: 40346 subRun: 1 event: 2880 at 06-Nov-2025 04:42:30 PST
Begin processing the 6th record. run: 40346 subRun: 1 event: 2884 at 06-Nov-2025 04:43:51 PST
06-Nov-2025 04:43:51 PST  Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/vd-protodune-det-reco/8f/65/np02vd_raw_run040346_0031_df-s04-d0_dw_0_20251101T120632_reco_stage1_20251101T134004_keepup.root"

====================================================================================================================
TimeTracker printout (sec)            Min           Avg           Max         Median          RMS         nEvts   
====================================================================================================================
Full event                         0.0482044      61.5859       115.773       59.3305       34.5419         6     
--------------------------------------------------------------------------------------------------------------------
source:RootInput(read)            0.000261259    0.0163849     0.0483969    0.000557039    0.0225683        6     
end_path:ana:SingleHit              54.171        73.8832       115.772       60.236        22.8757         5     
====================================================================================================================
%MSG-i NuRandomService:  SingleHit:ana@EndJob 06-Nov-2025 04:43:51 PST  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'random'
  master seed: 115565251
  seed within: [ 1 ; 900000000 ]

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 1756.91 MB
  Peak resident set size usage (VmHWM): 752.517 MB
  Details saved in: 'mem.db'
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 6 passed = 6 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport          6          5          1 ana

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 357.644962 Real = 370.239934

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 1756.91 VmHWM = 752.517

%MSG-s ArtException:  PostEndJob 06-Nov-2025 04:43:51 PST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- EventProcessorFailure BEGIN
    EndPathExecutor: an exception occurred during current event processing
    ---- ScheduleExecutionFailure BEGIN
      Path: ProcessingStopped.
      ---- ProductNotFound BEGIN
        Found zero products matching all selection criteria
          C++ type: std::vector<recob::Hit>
          Module label: 'hitpdune'
          Product instance name: ''
          Process name: (empty)
        The above exception was thrown while processing module SingleHit/ana run: 40346 subRun: 1 event: 2884
      ---- ProductNotFound END
      Exception going through path end_path
    ---- ScheduleExecutionFailure END
  ---- EventProcessorFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
justIN time: 2026-02-10 15:05:46 UTC       justIN version: 01.06.00