justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 302096.112@dunegpschedd01.fnal.gov

Jobsub ID302096.112@dunegpschedd01.fnal.gov
Workflow ID12511
Stage ID1
User nameimawby@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes1048576000 (1000 MiB)
Wall seconds limit7200 (2 hours)
Submitted time2026-01-29 10:37:51
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1_multicore
Last heartbeat2026-01-29 10:49:41
From worker nodeHostnamenode2b06.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes1048576000 (1000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2026-01-29 10:39:36
Input filesfardet-hd:nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root
JobscriptExit code0
Real time9m (586s)
CPU time7m (436s = 74%)
Max RSS bytes1343561728 (1281 MiB)
Outputting started2026-01-29 10:49:23
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/imawby/kalmanAlg_CHECK_nu_0/fnal/12511/1/001/ClusterValidation_WithAlg_nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root
Finished2026-01-29 10:49:41
Saved logsjustin-logs:302096.112-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

+0000][Debug  ][ExDbgMsg          ][ 1261] [msg: 0x19ea31c0] Assigned MsgHandler: 0x17e03900.
[2026-01-29 10:49:21.724254 +0000][Debug  ][ExDbgMsg          ][ 1261] [handler: 0x17e03900] Removed MsgHandler: 0x17e03900 from the in-queue.
[2026-01-29 10:49:21.724283 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Calling MsgHandler: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 164968, size: 132) ) with status: [SUCCESS] .
[2026-01-29 10:49:21.724299 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Destroying MsgHandler: 0x17e03900.
[2026-01-29 10:49:21.724357 +0000][Debug  ][File              ][ 1261] [0xe79ed10@root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/96/83/nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root?xrdcl.requuid=3d493864-c429-46d0-a7eb-38eafea49061] Sending a read command for handle 0x0 to heplns140.pp.rl.ac.uk:50533
[2026-01-29 10:49:21.724390 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] MsgHandler created: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165100, size: 138) ).
[2026-01-29 10:49:21.724464 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Moving MsgHandler: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165100, size: 138) ) from out-queu to in-queue.
[2026-01-29 10:49:21.738999 +0000][Debug  ][ExDbgMsg          ][ 1261] [msg: 0x2699c9b0] Assigned MsgHandler: 0x17e03900.
[2026-01-29 10:49:21.739148 +0000][Debug  ][ExDbgMsg          ][ 1261] [handler: 0x17e03900] Removed MsgHandler: 0x17e03900 from the in-queue.
[2026-01-29 10:49:21.739350 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Calling MsgHandler: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165100, size: 138) ) with status: [SUCCESS] .
[2026-01-29 10:49:21.740002 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Destroying MsgHandler: 0x17e03900.
[2026-01-29 10:49:21.740202 +0000][Debug  ][File              ][ 1261] [0xe79ed10@root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/96/83/nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root?xrdcl.requuid=3d493864-c429-46d0-a7eb-38eafea49061] Sending a read command for handle 0x0 to heplns140.pp.rl.ac.uk:50533
[2026-01-29 10:49:21.740263 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] MsgHandler created: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165826, size: 1979) ).
[2026-01-29 10:49:21.740481 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Moving MsgHandler: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165826, size: 1979) ) from out-queu to in-queue.
[2026-01-29 10:49:21.754641 +0000][Debug  ][ExDbgMsg          ][ 1261] [msg: 0xc4dd2e0] Assigned MsgHandler: 0x17e03900.
[2026-01-29 10:49:21.754673 +0000][Debug  ][ExDbgMsg          ][ 1261] [handler: 0x17e03900] Removed MsgHandler: 0x17e03900 from the in-queue.
[2026-01-29 10:49:21.754708 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Calling MsgHandler: 0x17e03900 (message: kXR_read (handle: 0x00000000, offset: 165826, size: 1979) ) with status: [SUCCESS] .
[2026-01-29 10:49:21.754728 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Destroying MsgHandler: 0x17e03900.
> Running Algorithm: Alg0001, LArPreProcessing
PreProcessingAlgorithm: could not replace current calo hit list with list named: CaloHitList2D
> Running Algorithm: Alg0002, LArDLMaster
PandoraContentApi::GetList(*this, m_inputHitListName, pCaloHitList) return STATUS_CODE_NOT_INITIALIZED
    in function: GetVolumeIdToHitListMap
    in file:     /exp/dune/app/users/imawby/dunesw_check/srcs/larpandoracontent/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 271
this->GetVolumeIdToHitListMap(volumeIdToHitListMap) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /exp/dune/app/users/imawby/dunesw_check/srcs/larpandoracontent/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 165
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/ALMA9/build/pandora/v04_17_05/src/pandora-v04-17-05/PandoraSDK-v04-01-00/src/Api/PandoraContentApiImpl.cc line#: 263
Failure in algorithm Alg0002, LArDLMaster, STATUS_CODE_NOT_INITIALIZED
29-Jan-2026 10:49:22 GMT  Closed output file "nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2_reco2.root"
[2026-01-29 10:49:22.128629 +0000][Debug  ][File              ][ 1261] [0xe79ed10@root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/96/83/nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root?xrdcl.requuid=3d493864-c429-46d0-a7eb-38eafea49061] Sending a close command for handle 0x0 to heplns140.pp.rl.ac.uk:50533
[2026-01-29 10:49:22.128761 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] MsgHandler created: 0x14e65d10 (message: kXR_close (handle: 0x00000000) ).
[2026-01-29 10:49:22.128832 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Moving MsgHandler: 0x14e65d10 (message: kXR_close (handle: 0x00000000) ) from out-queu to in-queue.
[2026-01-29 10:49:22.143370 +0000][Debug  ][ExDbgMsg          ][ 1261] [msg: 0x14e6db20] Assigned MsgHandler: 0x14e65d10.
[2026-01-29 10:49:22.143385 +0000][Debug  ][ExDbgMsg          ][ 1261] [handler: 0x14e65d10] Removed MsgHandler: 0x14e65d10 from the in-queue.
[2026-01-29 10:49:22.143414 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Calling MsgHandler: 0x14e65d10 (message: kXR_close (handle: 0x00000000) ) with status: [SUCCESS] .
[2026-01-29 10:49:22.143438 +0000][Debug  ][File              ][ 1261] [0xe79ed10@root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/96/83/nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root?xrdcl.requuid=3d493864-c429-46d0-a7eb-38eafea49061] Close returned from heplns140.pp.rl.ac.uk:50533 with: [SUCCESS] 
[2026-01-29 10:49:22.143459 +0000][Debug  ][ExDbgMsg          ][ 1261] [heplns140.pp.rl.ac.uk:50533] Destroying MsgHandler: 0x14e65d10.
29-Jan-2026 10:49:22 GMT  Closed input file "root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/fardet-hd/96/83/nu_dune10kt_1x2x6_1098_917_20230826T014306Z_gen_g4_detsim_hitreco__20240229T181033Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.124287       4.18701       116.077       2.28438       11.7215        100    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0297102     0.0457476     0.0719681     0.0487668    0.00908903       100    
reco:pandora:StandardPandora                   0.0937305      4.13187       115.936       2.23153       11.7116        100    
[art]:TriggerResults:TriggerResultInserter    1.0745e-05    1.91232e-05    7.216e-05    1.7263e-05    7.73948e-06      100    
end_path:out1:RootOutput                       2.135e-06    2.81297e-06   1.1555e-05    2.7185e-06    9.27588e-07      100    
end_path:out1:RootOutput(write)               0.000705223   0.00922966     0.0841087    0.00596839     0.010608        100    
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2333.66 MB
  Peak resident set size usage (VmHWM): 1343.56 MB
====================================================================================================
Art has completed and will exit with status 0.
[2026-01-29 10:49:22.585902 +0000][Debug  ][JobMgr            ][ 1261] Stopping the job manager...
[2026-01-29 10:49:22.586891 +0000][Debug  ][JobMgr            ][ 1261] Job manager stopped
[2026-01-29 10:49:22.587344 +0000][Debug  ][TaskMgr           ][ 1261] Stopping the task manager...
[2026-01-29 10:49:22.587651 +0000][Debug  ][TaskMgr           ][ 1261] Task manager stopped
[2026-01-29 10:49:22.587754 +0000][Debug  ][Poller            ][ 1261] Stopping the poller...
[2026-01-29 10:49:22.587948 +0000][Debug  ][AsyncSock         ][ 1261] [heplns140.pp.rl.ac.uk:50533.0] Closing the socket
[2026-01-29 10:49:22.587983 +0000][Debug  ][Poller            ][ 1261] <[::ffff:192.41.105.39]:33084><--><[::ffff:130.246.47.140]:50533> Removing socket from the poller
[2026-01-29 10:49:22.588050 +0000][Debug  ][PostMaster        ][ 1261] [heplns140.pp.rl.ac.uk:50533] Destroying stream
[2026-01-29 10:49:22.588178 +0000][Debug  ][AsyncSock         ][ 1261] [heplns140.pp.rl.ac.uk:50533.0] Closing the socket
[2026-01-29 10:49:22.588213 +0000][Debug  ][AsyncSock         ][ 1261] [mover.pp.rl.ac.uk:1094.0] Closing the socket
[2026-01-29 10:49:22.588236 +0000][Debug  ][Poller            ][ 1261] <[::ffff:192.41.105.39]:38576><--><[::ffff:130.246.47.230]:1094> Removing socket from the poller
[2026-01-29 10:49:22.588427 +0000][Debug  ][PostMaster        ][ 1261] [mover.pp.rl.ac.uk:1094] Destroying stream
[2026-01-29 10:49:22.588464 +0000][Debug  ][AsyncSock         ][ 1261] [mover.pp.rl.ac.uk:1094.0] Closing the socket
=== End last 100 lines of lar log file ===
justIN time: 2026-02-04 06:06:53 UTC       justIN version: 01.06.00