justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 47050.1@dunegpschedd01.fnal.gov

Jobsub ID47050.1@dunegpschedd01.fnal.gov
Workflow ID2286
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod.mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-16 16:42:07
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_brug
Last heartbeat2025-09-16 17:33:23
From worker nodeHostnamewn-sate-038.farm.nikhef.nl
cpuinfoAMD EPYC 7551P 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-16 16:42:43
Input filesvd-protodune:np02vd_raw_run039324_0632_df-s03-d1_dw_0_20250906T041123.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-16 17:33:23
Saved logsjustin-logs:47050.1-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
%MSG-e OpHitFinder:  OpHitFinder:ophit@BeginModule 16-Sep-2025 19:31:54 CEST  run: 39324 subRun: 1 event: 331747
Error! unrecognized channel number -1. Ignoring pulse
%MSG
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=6400 keeping as is
[19:31:54.199] D [  main  ] executing 1 apps, thread limit 0:
[19:31:54.199] D [  main  ] executing 1 apps, thread limit 0:
[19:31:54.199] D [  main  ] executing app: "Pgrapher"
[19:31:54.199] D [ pgraph ] <Pgrapher:> executing graph 
[19:31:54.199] D [ pgraph ] executing with 26 nodes
[19:31:54.200] D [  glue  ] <FrameFanout:nfsp> call=26: input: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[19:31:54.201] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[19:31:54.201] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=331747 time=40 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=331747 time=40 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[19:31:54.201] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 input frame: frame: ident=331747 time=40 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[19:31:54.201] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 init nticks=6400 tbinmin=0 tbinmax=6400 
[19:31:54.232] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 0, ntraces=1536, input bad regions: 0 
[19:31:55.779] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 1, ntraces=1536, input bad regions: 0 
[19:31:57.394] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=26 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      7.1745e-05      201.34        286.063       203.022       69.3229        14     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                       5.793e-05    8.33684e-05   0.000268565   6.9511e-05    5.16038e-05      14     
produce:tpcrawdecoder:PDVDTPCReader               51.2791       76.1592       117.402       74.7706       15.2289        14     
produce:triggerrawdecoder:PDVDTriggerReader4     0.0347255     0.0393805     0.0521838     0.0352583     0.005924        14     
produce:pdvddaphne:DAPHNEReaderPDVD               6.08745       8.79937       11.9114       8.86717       1.57973        14     
produce:ophit:OpHitFinder                        0.0379498     0.0473331     0.0577425     0.0464993    0.00568531       14     
produce:opflash:OpFlashFinderVerticalDrift      0.00655343     0.0125901     0.0160468     0.0134612    0.00261089       14     
produce:wclsdatavd:WireCellToolkit                43.9532       59.5925       80.5245       56.8386       10.2254        13     
produce:gaushit:GausHitFinder                    0.708856       1.17049       1.43945       1.14107      0.214306        13     
produce:nhitsfilter:NumberOfHitsFilter          0.000198614    0.0003498    0.000541099   0.000326485   9.09468e-05      13     
produce:reco3d:SpacePointSolver                   5.98713       12.0491       17.4819       11.9023       3.25275        13     
produce:hitpdune:DisambigFromSpacePoints         0.0689592     0.175235      0.275774      0.158962      0.0580364       13     
produce:pandora:StandardPandora                   18.5526       53.2603       120.938       42.643        29.9081        13     
produce:pandoraTrack:LArPandoraTrackCreation     0.293014      0.845092       1.33154      0.885628      0.330766        13     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0166251     0.0278249     0.0361108     0.028843     0.00524063       13     
[art]:TriggerResults:TriggerResultInserter      2.2523e-05    3.77167e-05   0.000107132   2.9386e-05    2.22234e-05      13     
end_path:out1:RootOutput                         5.62e-06     1.06123e-05   3.4054e-05     9.107e-06    6.88409e-06      13     
end_path:out1:RootOutput(write)                   3.87739       4.65407       5.98012       4.26636      0.735918        13     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.84 MB
  Peak resident set size usage (VmHWM): 6711.62 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 16-Sep-2025 19:33:03 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39324 subRun: 1 event: 331747
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-09-18 19:28:55 UTC       justIN version: 01.05.00