Jobsub ID 47050.1@dunegpschedd01.fnal.gov
Jobsub ID | 47050.1@dunegpschedd01.fnal.gov |
Workflow ID | 2286 |
Stage ID | 1 |
User name | ykermaid@fnal.gov |
HTCondor Group | group_dune.prod.mcsim |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 4193255424 (3999 MiB) |
Wall seconds limit | 18000 (5 hours) |
Submitted time | 2025-09-16 16:42:07 |
Site | NL_NIKHEF |
Entry | VIRGO_NL_NIKHEF_brug |
Last heartbeat | 2025-09-16 17:33:23 |
From worker node | Hostname | wn-sate-038.farm.nikhef.nl |
cpuinfo | AMD EPYC 7551P 32-Core Processor |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 129600 (36 hours) |
GPU | |
Inner Apptainer? | True |
Job state | jobscript_error |
Started | 2025-09-16 16:42:43 |
Input files | vd-protodune:np02vd_raw_run039324_0632_df-s03-d1_dw_0_20250906T041123.hdf5
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Max RSS bytes | 0 (0 MiB) |
Outputting started | |
Output files | |
Finished | 2025-09-16 17:33:23 |
Saved logs | justin-logs:47050.1-dunegpschedd01.fnal.gov.logs.tgz |
List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
input nticks=6400 keeping as is
[19:31:54.199] D [ main ] executing 1 apps, thread limit 0:
[19:31:54.199] D [ main ] executing 1 apps, thread limit 0:
[19:31:54.199] D [ main ] executing app: "Pgrapher"
[19:31:54.199] D [ pgraph ] <Pgrapher:> executing graph
[19:31:54.199] D [ pgraph ] executing with 26 nodes
[19:31:54.200] D [ glue ] <FrameFanout:nfsp> call=26: input: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:31:54.201] W [ glue ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped.
[19:31:54.201] D [ glue ] <ChannelSelector:chsel7> input frame: ident=331747 time=40 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=331747 time=40 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:31:54.201] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 input frame: frame: ident=331747 time=40 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[19:31:54.201] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 init nticks=6400 tbinmin=0 tbinmax=6400
[19:31:54.232] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 0, ntraces=1536, input bad regions: 0
[19:31:55.779] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 1, ntraces=1536, input bad regions: 0
[19:31:57.394] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 2, ntraces=1536, input bad regions: 0
==================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
==================================================================================================================================
Full event 7.1745e-05 201.34 286.063 203.022 69.3229 14
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read) 5.793e-05 8.33684e-05 0.000268565 6.9511e-05 5.16038e-05 14
produce:tpcrawdecoder:PDVDTPCReader 51.2791 76.1592 117.402 74.7706 15.2289 14
produce:triggerrawdecoder:PDVDTriggerReader4 0.0347255 0.0393805 0.0521838 0.0352583 0.005924 14
produce:pdvddaphne:DAPHNEReaderPDVD 6.08745 8.79937 11.9114 8.86717 1.57973 14
produce:ophit:OpHitFinder 0.0379498 0.0473331 0.0577425 0.0464993 0.00568531 14
produce:opflash:OpFlashFinderVerticalDrift 0.00655343 0.0125901 0.0160468 0.0134612 0.00261089 14
produce:wclsdatavd:WireCellToolkit 43.9532 59.5925 80.5245 56.8386 10.2254 13
produce:gaushit:GausHitFinder 0.708856 1.17049 1.43945 1.14107 0.214306 13
produce:nhitsfilter:NumberOfHitsFilter 0.000198614 0.0003498 0.000541099 0.000326485 9.09468e-05 13
produce:reco3d:SpacePointSolver 5.98713 12.0491 17.4819 11.9023 3.25275 13
produce:hitpdune:DisambigFromSpacePoints 0.0689592 0.175235 0.275774 0.158962 0.0580364 13
produce:pandora:StandardPandora 18.5526 53.2603 120.938 42.643 29.9081 13
produce:pandoraTrack:LArPandoraTrackCreation 0.293014 0.845092 1.33154 0.885628 0.330766 13
produce:pandoraGnocalo:GnocchiCalorimetry 0.0166251 0.0278249 0.0361108 0.028843 0.00524063 13
[art]:TriggerResults:TriggerResultInserter 2.2523e-05 3.77167e-05 0.000107132 2.9386e-05 2.22234e-05 13
end_path:out1:RootOutput 5.62e-06 1.06123e-05 3.4054e-05 9.107e-06 6.88409e-06 13
end_path:out1:RootOutput(write) 3.87739 4.65407 5.98012 4.26636 0.735918 13
==================================================================================================================================
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 8589.84 MB
Peak resident set size usage (VmHWM): 6711.62 MB
Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException: PostEndJob 16-Sep-2025 19:33:03 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- BadAlloc BEGIN
A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39324 subRun: 1 event: 331747
The job has probably exhausted the virtual memory available to the process.
---- BadAlloc END
Exception going through path produce
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1