justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 47052.4@dunegpschedd01.fnal.gov

Jobsub ID47052.4@dunegpschedd01.fnal.gov
Workflow ID2332
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod.mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-16 16:42:07
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_brug
Last heartbeat2025-09-16 17:10:39
From worker nodeHostnamewn-choc-033.farm.nikhef.nl
cpuinfoIntel(R) Xeon(R) CPU E5-2650 v4 @ 2.20GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-16 16:42:29
Input filesvd-protodune:np02vd_raw_run039343_0008_df-s01-d3_dw_0_20250908T122406.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-16 17:10:39
Saved logsjustin-logs:47052.4-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

SG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:09:07 CEST  run: 39343 subRun: 1 event: 4361
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=6400 keeping as is
[19:09:07.400] D [  main  ] executing 1 apps, thread limit 0:
[19:09:07.400] D [  main  ] executing 1 apps, thread limit 0:
[19:09:07.400] D [  main  ] executing app: "Pgrapher"
[19:09:07.400] D [ pgraph ] <Pgrapher:> executing graph 
[19:09:07.400] D [ pgraph ] executing with 26 nodes
[19:09:07.402] D [  glue  ] <FrameFanout:nfsp> call=12: input: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[19:09:07.403] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[19:09:07.404] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=4361 time=22 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=4361 time=22 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[19:09:07.404] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 input frame: frame: ident=4361 time=22 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[19:09:07.404] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 init nticks=6400 tbinmin=0 tbinmax=6400 
[19:09:07.444] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 0, ntraces=1536, input bad regions: 0 
[19:09:09.200] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 1, ntraces=1536, input bad regions: 0 
[19:09:10.930] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=12 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      6.4129e-05      219.456       690.971       154.624       205.637         7     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      6.0354e-05    8.0428e-05    0.000172859   6.4291e-05    3.79012e-05       7     
produce:tpcrawdecoder:PDVDTPCReader               11.4042       11.9595       14.4094       11.5688       1.00349         7     
produce:triggerrawdecoder:PDVDTriggerReader4     0.512127      0.531571      0.542358      0.529653     0.00966694        7     
produce:pdvddaphne:DAPHNEReaderPDVD               4.02412       4.29705       4.5118        4.32005      0.157008         7     
produce:ophit:OpHitFinder                        0.0434386     0.0511102     0.0562815      0.05147     0.00370082        7     
produce:opflash:OpFlashFinderVerticalDrift       0.0110659     0.0155398     0.0176028     0.0164288    0.00215823        7     
produce:wclsdatavd:WireCellToolkit                54.0592       75.1835       114.463       70.3719       19.1558         6     
produce:gaushit:GausHitFinder                    0.770456       1.58168       2.84449       1.4209       0.638462         6     
produce:nhitsfilter:NumberOfHitsFilter          0.000291828   0.000399809   0.000498931   0.000404759   9.31561e-05       6     
produce:reco3d:SpacePointSolver                   6.64519       13.1193       22.2648       11.5557       4.82276         6     
produce:hitpdune:DisambigFromSpacePoints         0.105264      0.230727      0.548772      0.172494      0.146948         6     
produce:pandora:StandardPandora                   17.208        142.942       563.229       51.7366       191.795         6     
produce:pandoraTrack:LArPandoraTrackCreation     0.504778       1.28542       2.83935      0.989921      0.783858         6     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0214692     0.0313349     0.038815      0.033263     0.00561922        6     
[art]:TriggerResults:TriggerResultInserter      2.4651e-05    3.8288e-05    6.1795e-05    3.5152e-05    1.14002e-05       6     
end_path:out1:RootOutput                         6.174e-06    9.2175e-06    2.1911e-05     6.714e-06    5.69512e-06       6     
end_path:out1:RootOutput(write)                   4.16978       4.6976        6.25245        4.305       0.738709         6     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.51 MB
  Peak resident set size usage (VmHWM): 6675.44 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 16-Sep-2025 19:10:22 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39343 subRun: 1 event: 4361
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-09-18 19:28:31 UTC       justIN version: 01.05.00