justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 297956.21@dunegpschedd02.fnal.gov

Jobsub ID297956.21@dunegpschedd02.fnal.gov
Workflow ID12718
Stage ID2
User nameykermaid@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit86400 (24 hours)
Submitted time2026-02-04 00:12:27
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce04_mcore_op_duneonly
Last heartbeat2026-02-04 00:18:39
From worker nodeHostnamedunegli-8534387-0-fnpc23011.fnal.gov
cpuinfoAMD EPYC 7543 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit172800 (48 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-02-04 00:18:09
Input filesusertests:np02vd_raw_run042422_0116_df-s02-d1_dw_0_20260203T161728_reco_stage0_20260203T234122_offline.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-02-04 00:18:39
Saved logsjustin-logs:297956.21-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

e Name:  * Type: art::Assns<recob::Shower, recob::Track, void>*              *
* Association Name:  ShowerTrackHitAssn       * Instance Name:  * Type: art::Assns<recob::Track, recob::Hit, void>*                 *
* Association Name:  clusterAssociationsbase  * Instance Name:  * Type: art::Assns<recob::Shower, recob::Cluster, void>*            *
* Association Name:  hitAssociationsbase      * Instance Name:  * Type: art::Assns<recob::Shower, recob::Hit, void>*                *
* Association Name:  pfShowerAssociationsbase * Instance Name:  * Type: art::Assns<recob::Shower, recob::PFParticle, void>*         *
* Association Name:  spShowerAssociationsbase * Instance Name:  * Type: art::Assns<recob::Shower, recob::SpacePoint, void>*         *
*************************************************************************************************************************************
      [INFO]  <ProcessDriver::configure::L174> Start looping process list to instantiate processes
    [NORMAL]  <ProcessDriver::configure> Instantiating Process ID=0 Type: SuperaBBoxInteraction w/ Name: SuperaBBoxInteraction
    [NORMAL]  <ProcessDriver::configure> Instantiating Process ID=1 Type: SuperaSpacePoint w/ Name: SuperaSpacePoint
     [DEBUG]  <ProcessDriver::process_names> ProcessDriver.cxx::L72 Called
     [DEBUG]  <ProcessDriver::override_output_file> ProcessDriver.cxx::L45 Called
04-Feb-2026 00:18:25 UTC  Initiating request to open input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/ea/ff/np02vd_raw_run042422_0116_df-s02-d1_dw_0_20260203T161728_reco_stage0_20260203T234122_offline.root"
04-Feb-2026 00:18:26 UTC  Opened input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/ea/ff/np02vd_raw_run042422_0116_df-s02-d1_dw_0_20260203T161728_reco_stage0_20260203T234122_offline.root"
     [DEBUG]  <ProcessDriver::initialize> ProcessDriver.cxx::L216 Called
      [INFO]  <ProcessDriver::initialize::L224> Initializing IO 
     [DEBUG]  <IOManager::initialize> IOManager.cxx::L131 start
      [INFO]  <IOManager::initialize::L136> Opening an output file: larcv_stage1_20260204_001825_475175.root
      [INFO]  <ProcessDriver::initialize::L250> Initializing: SuperaBBoxInteraction
      [INFO]  <ProcessDriver::initialize::L250> Initializing: SuperaSpacePoint
      [INFO]  <ProcessDriver::initialize::L258> Preparing access index vector
     [DEBUG]  <ProcessDriver::process_names> ProcessDriver.cxx::L72 Called
     [DEBUG]  <ProcessDriver::process_ptr> ProcessDriver.cxx::L84 Called
     [DEBUG]  <ProcessDriver::process_names> ProcessDriver.cxx::L72 Called
     [DEBUG]  <ProcessDriver::process_ptr> ProcessDriver.cxx::L84 Called
     [DEBUG]  <ProcessDriver::process_names> ProcessDriver.cxx::L72 Called
db: runtime1770135447
Begin processing the 1st record. run: 42422 subRun: 1 event: 18805 at 04-Feb-2026 00:18:26 UTC
PandoraContentApi::GetList(*this, m_inputHitListName, pCaloHitList) return STATUS_CODE_NOT_INITIALIZED
    in function: GetVolumeIdToHitListMap
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/larpandoracontent/v04_18_01-buildFW/src/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 271
this->GetVolumeIdToHitListMap(volumeIdToHitListMap) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/larpandoracontent/v04_18_01-buildFW/src/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 165
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/pandora/v04_17_05/src/pandora-v04-17-05/PandoraSDK-v04-01-00/src/Api/PandoraContentApiImpl.cc line#: 263
Failure in algorithm Alg0003, LArMaster, STATUS_CODE_NOT_INITIALIZED
     [DEBUG]  <ProcessDriver::process_ptr> ProcessDriver.cxx::L84 Called
     [DEBUG]  <ProcessDriver::process_ptr> ProcessDriver.cxx::L84 Called
Attempted to load data: cluster3d
      [INFO]  <IOManager::set_id::L569> Request to set event id: 0042422_00001_018805
     [DEBUG]  <ProcessDriver::process_entry> ProcessDriver.cxx::L314 Called
      [INFO]  <SuperaBBoxInteraction::process::L173> 3D Meta:X range: -514.08 => 514.08 ... 1344 bins
Y range: -342.72 => 342.72 ... 896 bins
Z range: -21.71 => 321.01 ... 448 bins

     [DEBUG]  <IOManager::get_data> IOManager.cxx::L499 start
     [DEBUG]  <IOManager::producer_id> IOManager.cxx::L479 start
     [DEBUG]  <IOManager::register_producer> IOManager.cxx::L178 start
      [INFO]  <IOManager::register_producer::L184> Requested to register a producer: reco (TTree sparse3d_reco_tree)
      [INFO]  <IOManager::register_producer::L201> It is a new producer registration (key=0)
      [INFO]  <IOManager::register_producer::L225> kWRITE/kBOTH mode: creating an output TTree
     [DEBUG]  <IOManager::register_producer> IOManager.cxx::L226 Branch name: sparse3d_reco_branch data pointer: 0xd55b380(0/1000)
     [DEBUG]  <IOManager::register_producer> IOManager.cxx::L230 Created TTree @ 0xd95c300 ... TBranch @ 0x5786f20
    [NORMAL]  <IOManager::get_data> Created TTree sparse3d_reco_tree (id=0) w/ 0 entries...
     [DEBUG]  <IOManager::get_data> IOManager.cxx::L519 start
     [DEBUG]  <IOManager::get_data> IOManager.cxx::L499 start
     [DEBUG]  <IOManager::producer_id> IOManager.cxx::L479 start
     [DEBUG]  <IOManager::get_data> IOManager.cxx::L519 start
      [INFO]  <SuperaSpacePoint::process::L83> Voxel3DMeta: X range: -514.08 => 514.08 ... 1344 bins
Y range: -342.72 => 342.72 ... 896 bins
Z range: -21.71 => 321.01 ... 448 bins
04-Feb-2026 00:18:27 UTC  Opened output file with pattern "%ifb_reco_stage1_%tc_offline.root"
     [DEBUG]  <ProcessDriver::finalize> ProcessDriver.cxx::L424 called
      [INFO]  <ProcessDriver::finalize::L427> Finalizing: SuperaBBoxInteraction
      [INFO]  <ProcessDriver::finalize::L427> Finalizing: SuperaSpacePoint
      [INFO]  <ProcessDriver::finalize::L434> Compiling time profile...
      [INFO]  <ProcessDriver::finalize::L457> Finalizing IO...
     [DEBUG]  <IOManager::finalize> IOManager.cxx::L611 start
    [NORMAL]  <IOManager::finalize> Writing sparse3d_reco_tree with 0 entries
    [NORMAL]  <IOManager::finalize> Closing output file
      [INFO]  <IOManager::finalize::L639> Deleting data pointers
     [DEBUG]  <IOManager::reset> IOManager.cxx::L647 start
      [INFO]  <ProcessDriver::finalize::L459> Resetting...
     [DEBUG]  <ProcessDriver::reset> ProcessDriver.cxx::L25 Called
     [DEBUG]  <IOManager::reset> IOManager.cxx::L647 start
04-Feb-2026 00:18:30 UTC  Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/ea/ff/np02vd_raw_run042422_0116_df-s02-d1_dw_0_20260203T161728_reco_stage0_20260203T234122_offline.root"
Malformed TimeTracker database.  The TimeEvent table is empty, but
the TimeModule table is not.  This can happen if an exception has
been thrown from a module while processing the first event.  Any
saved database file is suspect and should not be used.

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2035.18 MB
  Peak resident set size usage (VmHWM): 925.237 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 04-Feb-2026 00:18:30 UTC ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- EventProcessorFailure BEGIN
    EndPathExecutor: an exception occurred during current event processing
    ---- ScheduleExecutionFailure BEGIN
      Path: ProcessingStopped.
      ---- ProductNotFound BEGIN
        Found zero products matching all selection criteria
          C++ type: std::vector<recob::SpacePoint>
          Module label: 'cluster3d'
          Product instance name: ''
          Process name: (empty)
        The above exception was thrown while processing module LArSoftSuperaDriver/supera run: 42422 subRun: 1 event: 18805
      ---- ProductNotFound END
      Exception going through path end_path
    ---- ScheduleExecutionFailure END
  ---- EventProcessorFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 0 entries while recob::SpacePoints_cluster3d_Vertex_pdvdofflinestage0. has 28 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2026-02-04 01:40:04 UTC       justIN version: 01.06.00