justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 231192.14@dunegpschedd02.fnal.gov

Jobsub ID231192.14@dunegpschedd02.fnal.gov
Workflow ID9173
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-10-20 08:38:40
SiteFR_CCIN2P3
EntryDUNE_FR_CCIN2P3_cccondorce02
Last heartbeat2025-10-20 08:51:21
From worker nodeHostnameccwcondor0012
cpuinfoAMD EPYC 9334 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit106200 (29 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-10-20 08:39:50
Input filesvd-protodune:np02vd_raw_run040140_2307_df-s04-d0_dw_0_20251020T071245.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-10-20 08:51:21
Saved logsjustin-logs:231192.14-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

window for channel 12280
wclsFrameSaver: no samples within desired window for channel 12280
wclsFrameSaver: no samples within desired window for channel 12280
wclsFrameSaver: no samples within desired window for channel 12280
wclsFrameSaver: no samples within desired window for channel 12281
wclsFrameSaver: no samples within desired window for channel 12281
wclsFrameSaver: no samples within desired window for channel 12281
wclsFrameSaver: no samples within desired window for channel 12281
wclsFrameSaver: no samples within desired window for channel 12282
wclsFrameSaver: no samples within desired window for channel 12282
wclsFrameSaver: no samples within desired window for channel 12282
wclsFrameSaver: no samples within desired window for channel 12282
wclsFrameSaver: no samples within desired window for channel 12282
wclsFrameSaver: no samples within desired window for channel 12283
wclsFrameSaver: no samples within desired window for channel 12283
wclsFrameSaver: no samples within desired window for channel 12283
wclsFrameSaver: no samples within desired window for channel 12283
wclsFrameSaver: no samples within desired window for channel 12283
wclsFrameSaver: no samples within desired window for channel 12284
wclsFrameSaver: no samples within desired window for channel 12284
wclsFrameSaver: no samples within desired window for channel 12284
wclsFrameSaver: no samples within desired window for channel 12285
wclsFrameSaver: no samples within desired window for channel 12285
wclsFrameSaver: no samples within desired window for channel 12285
wclsFrameSaver: no samples within desired window for channel 12285
wclsFrameSaver: no samples within desired window for channel 12285
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12286
wclsFrameSaver: no samples within desired window for channel 12287
wclsFrameSaver: no samples within desired window for channel 12287
wclsFrameSaver: no samples within desired window for channel 12287
wclsFrameSaver: no samples within desired window for channel 12287
wclsFrameSaver: no samples within desired window for channel 12287
FrameSaver: q=1.8718e+07 n=1333245 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 1451 XUs and 1623 XVs -> 98 XUVs
C:0 T:1 11331 XUs and 13267 XVs -> 869 XUVs
C:0 T:2 343 XUs and 291 XVs -> 9 XUVs
C:0 T:3 964 XUs and 846 XVs -> 39 XUVs
C:0 T:4 3233 XUs and 4154 XVs -> 157 XUVs
C:0 T:5 862 XUs and 1290 XVs -> 33 XUVs
C:0 T:6 1351 XUs and 1529 XVs -> 87 XUVs
C:0 T:7 11819 XUs and 15691 XVs -> 1181 XUVs
C:0 T:8 1406 XUs and 1421 XVs -> 76 XUVs
C:0 T:9 2305 XUs and 2352 XVs -> 109 XUVs
C:0 T:10 1357 XUs and 1028 XVs -> 50 XUVs
C:0 T:11 1132 XUs and 1544 XVs -> 41 XUVs
C:0 T:12 2006 XUs and 2116 XVs -> 124 XUVs
C:0 T:13 522 XUs and 413 XVs -> 23 XUVs
C:0 T:14 1190 XUs and 1315 XVs -> 100 XUVs
C:0 T:15 2592 XUs and 2015 XVs -> 124 XUVs
3120 XUVs total
1765 collection wire objects
3120 potential space points
Neighbour search...
88696 tests to find 22310 neighbours
Iterating with no regularization...
Begin: 7.47639e+09
0 7.06857e+09
1 7.04355e+09
2 7.04248e+09
Now with regularization...
Begin: 6.91489e+09
0 6.91451e+09
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=128 keeping as is
[10:50:55.111] D [  main  ] executing 1 apps, thread limit 0:
[10:50:55.111] D [  main  ] executing 1 apps, thread limit 0:
[10:50:55.111] D [  main  ] executing app: "Pgrapher"
[10:50:55.111] D [ pgraph ] <Pgrapher:> executing graph 
[10:50:55.111] D [ pgraph ] executing with 26 nodes
[10:50:55.112] D [  glue  ] <FrameFanout:nfsp> call=14: input: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[10:50:55.112] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[10:50:55.113] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=404500 time=22 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=404500 time=22 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[10:50:55.113] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=14 input frame: frame: ident=404500 time=22 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[10:50:55.113] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=14 init nticks=128 tbinmin=0 tbinmax=128 
[10:50:55.126] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=14 load plane index: 0, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                        61.5011       90.8571       174.232       75.9594       36.6207         7     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      6.4488e-05    9.10433e-05   0.000211431   7.46585e-05   4.62568e-05       8     
produce:tpcrawdecoder:PDVDTPCReader               1.24938       5.67847       10.8115       5.30539       2.47668         8     
produce:triggerrawdecoder:PDVDTriggerReader4     0.046196      0.0495098     0.0565602     0.0475221     0.0038262        8     
produce:pdvddaphne:DAPHNEReaderPDVD             0.00026386      1.53778       1.99327       1.79833      0.610201         8     
produce:ophit:OpHitFinder                       6.2074e-05     0.0254714     0.0361656     0.0291911     0.0103095        8     
produce:opflash:OpFlashFinderVerticalDrift       3.963e-05    0.00234883    0.00398556    0.00265118    0.00138162        8     
produce:wclsdatavd:WireCellToolkit                34.2118       47.3757       95.975        36.5719       20.8682         7     
produce:gaushit:GausHitFinder                    0.448778      0.822741       1.14735      0.805973       0.24675         7     
produce:nhitsfilter:NumberOfHitsFilter          0.000116767   0.000260444   0.00045483    0.000236048   0.000125752       7     
produce:reco3d:SpacePointSolver                   4.62038       8.70379       14.1233       9.22987       2.8611          7     
produce:hitpdune:DisambigFromSpacePoints         0.0996679     0.146605      0.209432       0.13402      0.0334224        7     
produce:pandora:StandardPandora                   8.94731       20.1641       38.7197       21.099        9.59125         7     
produce:pandoraTrack:LArPandoraTrackCreation       0.384        0.87729       2.14317      0.787593      0.552344         7     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0126796     0.0260276     0.0563121     0.0216442     0.0133685        7     
[art]:TriggerResults:TriggerResultInserter      1.9279e-05    4.27163e-05   9.6497e-05     2.633e-05    2.90942e-05       7     
end_path:out1:RootOutput                         7.652e-06    1.19079e-05   2.5108e-05     8.382e-06    6.2104e-06        7     
end_path:out1:RootOutput(write)                   2.99111       4.51719       8.95574       4.10553       1.90527         7     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 5898.04 MB
  Peak resident set size usage (VmHWM): 3943.28 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 20-Oct-2025 10:50:55 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 40140 subRun: 1 event: 404500
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-11-05 09:03:30 UTC       justIN version: 01.05.01