justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 46998.1@dunegpschedd01.fnal.gov

Jobsub ID46998.1@dunegpschedd01.fnal.gov
Workflow ID2260
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod.mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-16 14:37:01
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_dissel
Last heartbeat2025-09-16 15:47:05
From worker nodeHostnamewn-choc-032.farm.nikhef.nl
cpuinfoIntel(R) Xeon(R) CPU E5-2650 v4 @ 2.20GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-16 14:58:47
Input filesvd-protodune:np02vd_raw_run039324_0043_df-s03-d0_dw_0_20250905T153550.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-16 15:47:05
Saved logsjustin-logs:46998.1-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 17:44:36 CEST  run: 39324 subRun: 1 event: 22646
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=10000 keeping as is
[17:44:36.797] D [  main  ] executing 1 apps, thread limit 0:
[17:44:36.797] D [  main  ] executing 1 apps, thread limit 0:
[17:44:36.797] D [  main  ] executing app: "Pgrapher"
[17:44:36.797] D [ pgraph ] <Pgrapher:> executing graph 
[17:44:36.797] D [ pgraph ] executing with 26 nodes
[17:44:36.798] D [  glue  ] <FrameFanout:nfsp> call=26: input: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[17:44:36.799] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[17:44:36.800] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=22646 time=44 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=22646 time=44 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[17:44:36.800] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 input frame: frame: ident=22646 time=44 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[17:44:36.800] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 init nticks=10000 tbinmin=0 tbinmax=10000 
[17:44:36.851] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 0, ntraces=1536, input bad regions: 0 
[17:44:39.474] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 1, ntraces=1536, input bad regions: 0 
[17:44:42.111] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      5.9635e-05      188.613       314.642       176.545       73.7396        14     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      5.2894e-05    7.1245e-05    0.000180617   6.16285e-05   3.26126e-05      14     
produce:tpcrawdecoder:PDVDTPCReader               22.7694       37.028        56.7695       35.2907       8.65827        14     
produce:triggerrawdecoder:PDVDTriggerReader4     0.0344291     0.0452007      0.15024      0.0344835      0.02956        14     
produce:pdvddaphne:DAPHNEReaderPDVD               4.92254       6.44988       9.54609       6.02791       1.38244        14     
produce:ophit:OpHitFinder                        0.0420568     0.0466914     0.0673377     0.0452142    0.00599701       14     
produce:opflash:OpFlashFinderVerticalDrift      0.00801958     0.0121577     0.015205      0.0124288    0.00209493       14     
produce:wclsdatavd:WireCellToolkit                54.0519       67.3989       90.0131       64.7645       9.08245        13     
produce:gaushit:GausHitFinder                     1.14774       1.55569       2.2903        1.4839       0.329136        13     
produce:nhitsfilter:NumberOfHitsFilter          0.000223823   0.000352541   0.000620933   0.00030796    0.000111232      13     
produce:reco3d:SpacePointSolver                   8.6611        14.3543       25.7606       12.9838       4.66163        13     
produce:hitpdune:DisambigFromSpacePoints         0.122128      0.263617      0.595635      0.223204      0.122407        13     
produce:pandora:StandardPandora                   32.4308       72.1773       170.095       52.8517       44.6834        13     
produce:pandoraTrack:LArPandoraTrackCreation     0.484829      0.956356       1.92429       0.90039       0.34631        13     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0199465     0.029004      0.0502405     0.0279599    0.00753756       13     
[art]:TriggerResults:TriggerResultInserter      1.7336e-05    2.64641e-05   8.6178e-05    2.1966e-05    1.74519e-05      13     
end_path:out1:RootOutput                         3.601e-06    7.40385e-06   2.8204e-05     6.352e-06    6.1315e-06       13     
end_path:out1:RootOutput(write)                   4.07686       4.47302       6.26763       4.33377      0.552629        13     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.92 MB
  Peak resident set size usage (VmHWM): 6707.36 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 16-Sep-2025 17:46:43 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39324 subRun: 1 event: 22646
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-09-18 19:25:43 UTC       justIN version: 01.05.00