Jobsub ID 46998.1@dunegpschedd01.fnal.gov
Jobsub ID | 46998.1@dunegpschedd01.fnal.gov |
Workflow ID | 2260 |
Stage ID | 1 |
User name | ykermaid@fnal.gov |
HTCondor Group | group_dune.prod.mcsim |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 4193255424 (3999 MiB) |
Wall seconds limit | 18000 (5 hours) |
Submitted time | 2025-09-16 14:37:01 |
Site | NL_NIKHEF |
Entry | VIRGO_NL_NIKHEF_dissel |
Last heartbeat | 2025-09-16 15:47:05 |
From worker node | Hostname | wn-choc-032.farm.nikhef.nl |
cpuinfo | Intel(R) Xeon(R) CPU E5-2650 v4 @ 2.20GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 129600 (36 hours) |
GPU | |
Inner Apptainer? | True |
Job state | jobscript_error |
Started | 2025-09-16 14:58:47 |
Input files | vd-protodune:np02vd_raw_run039324_0043_df-s03-d0_dw_0_20250905T153550.hdf5
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Max RSS bytes | 0 (0 MiB) |
Outputting started | |
Output files | |
Finished | 2025-09-16 15:47:05 |
Saved logs | justin-logs:46998.1-dunegpschedd01.fnal.gov.logs.tgz |
List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder: OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
input nticks=10000 keeping as is
[17:44:36.797] D [ main ] executing 1 apps, thread limit 0:
[17:44:36.797] D [ main ] executing 1 apps, thread limit 0:
[17:44:36.797] D [ main ] executing app: "Pgrapher"
[17:44:36.797] D [ pgraph ] <Pgrapher:> executing graph
[17:44:36.797] D [ pgraph ] executing with 26 nodes
[17:44:36.798] D [ glue ] <FrameFanout:nfsp> call=26: input: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[17:44:36.799] W [ glue ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped.
[17:44:36.800] D [ glue ] <ChannelSelector:chsel7> input frame: ident=22646 time=44 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=22646 time=44 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[17:44:36.800] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 input frame: frame: ident=22646 time=44 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[17:44:36.800] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 init nticks=10000 tbinmin=0 tbinmax=10000
[17:44:36.851] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 0, ntraces=1536, input bad regions: 0
[17:44:39.474] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 1, ntraces=1536, input bad regions: 0
[17:44:42.111] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 2, ntraces=1536, input bad regions: 0
==================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
==================================================================================================================================
Full event 5.9635e-05 188.613 314.642 176.545 73.7396 14
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read) 5.2894e-05 7.1245e-05 0.000180617 6.16285e-05 3.26126e-05 14
produce:tpcrawdecoder:PDVDTPCReader 22.7694 37.028 56.7695 35.2907 8.65827 14
produce:triggerrawdecoder:PDVDTriggerReader4 0.0344291 0.0452007 0.15024 0.0344835 0.02956 14
produce:pdvddaphne:DAPHNEReaderPDVD 4.92254 6.44988 9.54609 6.02791 1.38244 14
produce:ophit:OpHitFinder 0.0420568 0.0466914 0.0673377 0.0452142 0.00599701 14
produce:opflash:OpFlashFinderVerticalDrift 0.00801958 0.0121577 0.015205 0.0124288 0.00209493 14
produce:wclsdatavd:WireCellToolkit 54.0519 67.3989 90.0131 64.7645 9.08245 13
produce:gaushit:GausHitFinder 1.14774 1.55569 2.2903 1.4839 0.329136 13
produce:nhitsfilter:NumberOfHitsFilter 0.000223823 0.000352541 0.000620933 0.00030796 0.000111232 13
produce:reco3d:SpacePointSolver 8.6611 14.3543 25.7606 12.9838 4.66163 13
produce:hitpdune:DisambigFromSpacePoints 0.122128 0.263617 0.595635 0.223204 0.122407 13
produce:pandora:StandardPandora 32.4308 72.1773 170.095 52.8517 44.6834 13
produce:pandoraTrack:LArPandoraTrackCreation 0.484829 0.956356 1.92429 0.90039 0.34631 13
produce:pandoraGnocalo:GnocchiCalorimetry 0.0199465 0.029004 0.0502405 0.0279599 0.00753756 13
[art]:TriggerResults:TriggerResultInserter 1.7336e-05 2.64641e-05 8.6178e-05 2.1966e-05 1.74519e-05 13
end_path:out1:RootOutput 3.601e-06 7.40385e-06 2.8204e-05 6.352e-06 6.1315e-06 13
end_path:out1:RootOutput(write) 4.07686 4.47302 6.26763 4.33377 0.552629 13
==================================================================================================================================
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 8589.92 MB
Peak resident set size usage (VmHWM): 6707.36 MB
Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException: PostEndJob 16-Sep-2025 17:46:43 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- BadAlloc BEGIN
A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39324 subRun: 1 event: 22646
The job has probably exhausted the virtual memory available to the process.
---- BadAlloc END
Exception going through path produce
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1