Jobsub ID 47052.4@dunegpschedd01.fnal.gov
Jobsub ID | 47052.4@dunegpschedd01.fnal.gov |
Workflow ID | 2332 |
Stage ID | 1 |
User name | ykermaid@fnal.gov |
HTCondor Group | group_dune.prod.mcsim |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 4193255424 (3999 MiB) |
Wall seconds limit | 18000 (5 hours) |
Submitted time | 2025-09-16 16:42:07 |
Site | NL_NIKHEF |
Entry | VIRGO_NL_NIKHEF_brug |
Last heartbeat | 2025-09-16 17:10:39 |
From worker node | Hostname | wn-choc-033.farm.nikhef.nl |
cpuinfo | Intel(R) Xeon(R) CPU E5-2650 v4 @ 2.20GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 129600 (36 hours) |
GPU | |
Inner Apptainer? | True |
Job state | jobscript_error |
Started | 2025-09-16 16:42:29 |
Input files | vd-protodune:np02vd_raw_run039343_0008_df-s01-d3_dw_0_20250908T122406.hdf5
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Max RSS bytes | 0 (0 MiB) |
Outputting started | |
Output files | |
Finished | 2025-09-16 17:10:39 |
Saved logs | justin-logs:47052.4-dunegpschedd01.fnal.gov.logs.tgz |
List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
SG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
input nticks=6400 keeping as is
[19:09:07.400] D [ main ] executing 1 apps, thread limit 0:
[19:09:07.400] D [ main ] executing 1 apps, thread limit 0:
[19:09:07.400] D [ main ] executing app: "Pgrapher"
[19:09:07.400] D [ pgraph ] <Pgrapher:> executing graph
[19:09:07.400] D [ pgraph ] executing with 26 nodes
[19:09:07.402] D [ glue ] <FrameFanout:nfsp> call=12: input: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:09:07.403] W [ glue ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped.
[19:09:07.404] D [ glue ] <ChannelSelector:chsel7> input frame: ident=4361 time=22 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=4361 time=22 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:09:07.404] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 input frame: frame: ident=4361 time=22 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:09:07.404] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 init nticks=6400 tbinmin=0 tbinmax=6400
[19:09:07.444] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 0, ntraces=1536, input bad regions: 0
[19:09:09.200] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 1, ntraces=1536, input bad regions: 0
[19:09:10.930] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 2, ntraces=1536, input bad regions: 0
==================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
==================================================================================================================================
Full event 6.4129e-05 219.456 690.971 154.624 205.637 7
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read) 6.0354e-05 8.0428e-05 0.000172859 6.4291e-05 3.79012e-05 7
produce:tpcrawdecoder:PDVDTPCReader 11.4042 11.9595 14.4094 11.5688 1.00349 7
produce:triggerrawdecoder:PDVDTriggerReader4 0.512127 0.531571 0.542358 0.529653 0.00966694 7
produce:pdvddaphne:DAPHNEReaderPDVD 4.02412 4.29705 4.5118 4.32005 0.157008 7
produce:ophit:OpHitFinder 0.0434386 0.0511102 0.0562815 0.05147 0.00370082 7
produce:opflash:OpFlashFinderVerticalDrift 0.0110659 0.0155398 0.0176028 0.0164288 0.00215823 7
produce:wclsdatavd:WireCellToolkit 54.0592 75.1835 114.463 70.3719 19.1558 6
produce:gaushit:GausHitFinder 0.770456 1.58168 2.84449 1.4209 0.638462 6
produce:nhitsfilter:NumberOfHitsFilter 0.000291828 0.000399809 0.000498931 0.000404759 9.31561e-05 6
produce:reco3d:SpacePointSolver 6.64519 13.1193 22.2648 11.5557 4.82276 6
produce:hitpdune:DisambigFromSpacePoints 0.105264 0.230727 0.548772 0.172494 0.146948 6
produce:pandora:StandardPandora 17.208 142.942 563.229 51.7366 191.795 6
produce:pandoraTrack:LArPandoraTrackCreation 0.504778 1.28542 2.83935 0.989921 0.783858 6
produce:pandoraGnocalo:GnocchiCalorimetry 0.0214692 0.0313349 0.038815 0.033263 0.00561922 6
[art]:TriggerResults:TriggerResultInserter 2.4651e-05 3.8288e-05 6.1795e-05 3.5152e-05 1.14002e-05 6
end_path:out1:RootOutput 6.174e-06 9.2175e-06 2.1911e-05 6.714e-06 5.69512e-06 6
end_path:out1:RootOutput(write) 4.16978 4.6976 6.25245 4.305 0.738709 6
==================================================================================================================================
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 8589.51 MB
Peak resident set size usage (VmHWM): 6675.44 MB
Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException: PostEndJob 16-Sep-2025 19:10:22 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- BadAlloc BEGIN
A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39343 subRun: 1 event: 4361
The job has probably exhausted the virtual memory available to the process.
---- BadAlloc END
Exception going through path produce
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1