justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 40398.45@dunegpschedd02.fnal.gov

Jobsub ID40398.45@dunegpschedd02.fnal.gov
Workflow ID2644
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-16 12:44:56
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_juk
Last heartbeat2025-09-16 14:07:47
From worker nodeHostnamewn-snel-030.farm.nikhef.nl
cpuinfoAMD EPYC 7H12 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-09-16 13:53:51
Input filesvd-protodune:np02vd_raw_run039353_0986_df-s05-d3_dw_0_20250916T114453.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-09-16 14:07:47
Saved logsjustin-logs:40398.45-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

channel 9013
wclsFrameSaver: no samples within desired window for channel 9013
wclsFrameSaver: no samples within desired window for channel 9015
wclsFrameSaver: no samples within desired window for channel 9016
wclsFrameSaver: no samples within desired window for channel 9016
wclsFrameSaver: no samples within desired window for channel 9017
wclsFrameSaver: no samples within desired window for channel 9017
wclsFrameSaver: no samples within desired window for channel 9601
wclsFrameSaver: no samples within desired window for channel 9604
wclsFrameSaver: no samples within desired window for channel 9605
wclsFrameSaver: no samples within desired window for channel 9705
wclsFrameSaver: no samples within desired window for channel 9838
wclsFrameSaver: no samples within desired window for channel 9850
wclsFrameSaver: no samples within desired window for channel 10146
wclsFrameSaver: no samples within desired window for channel 10147
wclsFrameSaver: no samples within desired window for channel 10157
wclsFrameSaver: no samples within desired window for channel 10228
wclsFrameSaver: no samples within desired window for channel 10286
wclsFrameSaver: no samples within desired window for channel 10307
wclsFrameSaver: no samples within desired window for channel 10336
wclsFrameSaver: no samples within desired window for channel 10499
wclsFrameSaver: no samples within desired window for channel 10500
wclsFrameSaver: no samples within desired window for channel 11020
wclsFrameSaver: no samples within desired window for channel 11640
wclsFrameSaver: no samples within desired window for channel 11640
wclsFrameSaver: no samples within desired window for channel 11641
wclsFrameSaver: no samples within desired window for channel 11758
wclsFrameSaver: no samples within desired window for channel 11759
wclsFrameSaver: no samples within desired window for channel 11760
wclsFrameSaver: no samples within desired window for channel 11761
wclsFrameSaver: no samples within desired window for channel 12274
wclsFrameSaver: no samples within desired window for channel 12275
wclsFrameSaver: no samples within desired window for channel 12276
wclsFrameSaver: no samples within desired window for channel 12276
wclsFrameSaver: no samples within desired window for channel 12278
FrameSaver: q=9.37509e+06 n=1108623 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 2070 XUs and 1863 XVs -> 90 XUVs
C:0 T:1 225 XUs and 290 XVs -> 20 XUVs
C:0 T:2 2184 XUs and 2302 XVs -> 83 XUVs
C:0 T:3 178 XUs and 234 XVs -> 8 XUVs
C:0 T:4 362 XUs and 560 XVs -> 16 XUVs
C:0 T:5 2534 XUs and 2581 XVs -> 103 XUVs
C:0 T:6 1269 XUs and 1187 XVs -> 37 XUVs
C:0 T:7 374 XUs and 387 XVs -> 32 XUVs
C:0 T:8 1376 XUs and 2166 XVs -> 92 XUVs
C:0 T:9 3136 XUs and 4103 XVs -> 185 XUVs
C:0 T:10 1860 XUs and 2324 XVs -> 131 XUVs
C:0 T:11 2903 XUs and 2441 XVs -> 135 XUVs
C:0 T:12 674 XUs and 1188 XVs -> 44 XUVs
C:0 T:13 1567 XUs and 2194 XVs -> 112 XUVs
C:0 T:14 3090 XUs and 2739 XVs -> 155 XUVs
C:0 T:15 1848 XUs and 1985 XVs -> 74 XUVs
1317 XUVs total
1163 collection wire objects
1317 potential space points
Neighbour search...
6393 tests to find 3368 neighbours
Iterating with no regularization...
Begin: 4.24088e+08
0 4.1633e+08
1 4.16227e+08
Now with regularization...
Begin: 4.10622e+08
0 4.1062e+08
BdtBeamParticleIdTool::SliceFeatures::GetLeadingCaloHits - empty calo hit list
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=10000 keeping as is
[16:06:41.661] D [  main  ] executing 1 apps, thread limit 0:
[16:06:41.661] D [  main  ] executing 1 apps, thread limit 0:
[16:06:41.661] D [  main  ] executing app: "Pgrapher"
[16:06:41.661] D [ pgraph ] <Pgrapher:> executing graph 
[16:06:41.661] D [ pgraph ] executing with 26 nodes
[16:06:41.662] D [  glue  ] <FrameFanout:nfsp> call=6: input: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[16:06:41.663] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[16:06:41.663] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=75911 time=12 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=75911 time=12 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[16:06:41.664] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=6 input frame: frame: ident=75911 time=12 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[16:06:41.664] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=6 init nticks=10000 tbinmin=0 tbinmax=10000 
[16:06:41.710] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=6 load plane index: 0, ntraces=1536, input bad regions: 0 
[16:06:44.203] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=6 load plane index: 1, ntraces=1536, input bad regions: 0 
[16:06:45.885] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=6 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      7.8186e-05      163.121       246.309       203.087       96.1585         4     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      7.5251e-05    0.000113954   0.000225723    7.742e-05    6.45385e-05       4     
produce:tpcrawdecoder:PDVDTPCReader               81.1303       87.9226       91.3993       89.5804       4.13588         4     
produce:triggerrawdecoder:PDVDTriggerReader4     0.292578      0.296927      0.307164      0.293984      0.0059779        4     
produce:pdvddaphne:DAPHNEReaderPDVD             0.000407865   0.000543046   0.000797536   0.000483391   0.000158548       4     
produce:ophit:OpHitFinder                       0.000131307   0.000333834   0.000836709   0.00018366    0.000292941       4     
produce:opflash:OpFlashFinderVerticalDrift      5.6466e-05    0.000134682   0.000364283   5.89905e-05   0.000132567       4     
produce:wclsdatavd:WireCellToolkit                56.0653       61.4109       72.0319       56.1354       7.51028         3     
produce:gaushit:GausHitFinder                    0.974093       1.16483       1.32141       1.19897      0.143832         3     
produce:nhitsfilter:NumberOfHitsFilter          0.000283211   0.000377261   0.000460804   0.000387767   7.28817e-05       3     
produce:reco3d:SpacePointSolver                   11.0932       13.4995       15.9832       13.4221       1.9971          3     
produce:hitpdune:DisambigFromSpacePoints         0.168809      0.180411      0.200023      0.172399      0.0139457        3     
produce:pandora:StandardPandora                   26.9239       47.8246       59.4514       57.0986       14.8102         3     
produce:pandoraTrack:LArPandoraTrackCreation     0.709746      0.897301       1.20278      0.779378      0.217868         3     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0265278     0.0291841     0.0320615     0.028963     0.00226453        3     
[art]:TriggerResults:TriggerResultInserter      1.4006e-05    2.36977e-05   4.1678e-05    1.5409e-05    1.27269e-05       3     
end_path:out1:RootOutput                         3.407e-06     9.565e-06    1.6341e-05     8.947e-06    5.29834e-06       3     
end_path:out1:RootOutput(write)                   4.20221       4.25186       4.30076       4.25261      0.0402366        3     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.63 MB
  Peak resident set size usage (VmHWM): 6679.35 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 16-Sep-2025 16:07:29 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 39353 subRun: 1 event: 75911
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-09-19 00:21:14 UTC       justIN version: 01.05.00