justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 239244.27@dunegpschedd01.fnal.gov

Jobsub ID239244.27@dunegpschedd01.fnal.gov
Workflow ID9410
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-10-28 12:16:10
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_klomp
Last heartbeat2025-10-28 12:55:41
From worker nodeHostnamewn-sate-054.farm.nikhef.nl
cpuinfoAMD EPYC 7551P 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job stateaborted
Started2025-10-28 12:21:06
Input filesvd-protodune:np02vd_raw_run040267_0421_df-s04-d2_dw_0_20251026T061914.hdf5
Outputting started 
Output files
Finished2025-10-28 12:55:15
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

071 XUs and 1561 XVs -> 53 XUVs
C:0 T:6 314 XUs and 285 XVs -> 9 XUVs
C:0 T:7 1258 XUs and 1165 XVs -> 45 XUVs
C:0 T:8 353 XUs and 641 XVs -> 155 XUVs
C:0 T:9 610 XUs and 935 XVs -> 111 XUVs
C:0 T:10 300 XUs and 830 XVs -> 63 XUVs
C:0 T:11 81 XUs and 111 XVs -> 12 XUVs
C:0 T:12 2214 XUs and 2813 XVs -> 113 XUVs
C:0 T:13 2444 XUs and 3293 XVs -> 186 XUVs
C:0 T:14 1478 XUs and 1631 XVs -> 66 XUVs
C:0 T:15 2105 XUs and 2585 XVs -> 171 XUVs
1141 XUVs total
882 collection wire objects
1141 potential space points
Neighbour search...
12633 tests to find 7302 neighbours
Iterating with no regularization...
Begin: 7.22759e+08
0 7.14379e+08
1 7.1419e+08
Now with regularization...
Begin: 7.06262e+08
0 7.06249e+08
BdtBeamParticleIdTool::SliceFeatures::GetLeadingCaloHits - empty calo hit list
BdtBeamParticleIdTool::SliceFeatures::GetLeadingCaloHits - empty calo hit list
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=9024 keeping as is
[13:47:36.010] D [  main  ] executing 1 apps, thread limit 0:
[13:47:36.010] D [  main  ] executing 1 apps, thread limit 0:
[13:47:36.010] D [  main  ] executing app: "Pgrapher"
[13:47:36.010] D [ pgraph ] <Pgrapher:> executing graph 
[13:47:36.011] D [ pgraph ] executing with 26 nodes
[13:47:36.012] D [  glue  ] <FrameFanout:nfsp> call=20: input: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[13:47:36.012] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[13:47:36.013] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=78022 time=49 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:47:36.013] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 input frame: frame: ident=78022 time=49 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:47:36.013] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 init nticks=9024 tbinmin=0 tbinmax=9024 
[13:47:36.041] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 load plane index: 0, ntraces=1536, input bad regions: 0 
[13:47:38.843] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 load plane index: 1, ntraces=1536, input bad regions: 0 
[13:47:41.649] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 load plane index: 2, ntraces=1536, input bad regions: 0 
[13:47:50.814] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 0, Qtot=570575597 Qloss=-60326046, 10429 indices spanning [54013,64441] "wiener" 
[13:47:51.128] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 0, Qtot=524583120 Qloss=-49029885, 8095 indices spanning [64442,72536] "gauss" 
[13:47:51.901] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 1, Qtot=450311703 Qloss=-42897379, 9349 indices spanning [72537,81885] "wiener" 
[13:47:52.209] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 1, Qtot=408795510 Qloss=-32837433, 7163 indices spanning [81886,89048] "gauss" 
[13:47:52.753] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 2, Qtot=401054692 Qloss=-25942332, 10620 indices spanning [89049,99668] "wiener" 
[13:47:53.289] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 save plane index: 2, Qtot=386888029 Qloss=-11535020, 8555 indices spanning [99669,108223] "gauss" 
[13:47:53.289] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 produce 108224 traces: 30398 wiener7, 0 decon_charge7, 23813 gauss7, frame tag: sigproc 
[13:47:53.289] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=20 output frame: frame: ident=78022 time=49 tick=512 with 108224 traces.  frame tags:[ "sigproc" ] 4 tagged trace sets:[ "gauss7":23813 [0] "mp2_roi7":43667 [0] "mp3_roi7":10346 [0] "wiener7":30398 [30398] ] cmm:[ ] 
[13:47:53.703] W [  glue  ] <ChannelSelector:chsel6> Untagged summary not supported, summary will be dropped. 
[13:47:53.703] D [  glue  ] <ChannelSelector:chsel6> input frame: ident=78022 time=49 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=78022 time=49 tick=512 with 1536 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:47:53.704] D [sigproc ] <OmnibusSigProc:anode6sigproc6> call=20 input frame: frame: ident=78022 time=49 tick=512 with 1536 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:47:53.704] D [sigproc ] <OmnibusSigProc:anode6sigproc6> call=20 init nticks=9024 tbinmin=0 tbinmax=9024 
[13:47:53.739] D [sigproc ] <OmnibusSigProc:anode6sigproc6> call=20 load plane index: 0, ntraces=1536, input bad regions: 0 
[13:47:56.571] D [sigproc ] <OmnibusSigProc:anode6sigproc6> call=20 load plane index: 1, ntraces=1536, input bad regions: 0 
[13:47:59.369] D [sigproc ] <OmnibusSigProc:anode6sigproc6> call=20 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      7.0302e-05      137.112       185.275       149.106       47.3993        11     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      5.9171e-05    8.10395e-05   0.000213772   6.6946e-05    4.23662e-05      11     
produce:tpcrawdecoder:PDVDTPCReader               26.3719       34.9228       52.2037       34.1073       7.38111        11     
produce:triggerrawdecoder:PDVDTriggerReader4     0.293241      0.316605      0.384021      0.295997      0.0298548       11     
produce:pdvddaphne:DAPHNEReaderPDVD             0.000370847   0.000415359   0.000772732   0.000382309   0.00011312       11     
produce:ophit:OpHitFinder                       6.2547e-05    0.000112952   0.000577415   6.5743e-05    0.000146916      11     
produce:opflash:OpFlashFinderVerticalDrift      5.1477e-05    8.13666e-05   0.000347022   5.5324e-05    8.40265e-05      11     
produce:wclsdatavd:WireCellToolkit                52.627        68.3952       94.1837       64.5653       12.079         10     
produce:gaushit:GausHitFinder                    0.683995       1.25061       1.61461       1.28011      0.265919        10     
produce:nhitsfilter:NumberOfHitsFilter           0.0001701    0.000298261   0.000517252   0.000286087   8.72782e-05      10     
produce:reco3d:SpacePointSolver                   5.48534       12.0624       17.2735       12.1431       3.29545        10     
produce:hitpdune:DisambigFromSpacePoints         0.0697091     0.167689      0.301107      0.154428      0.0647709       10     
produce:pandora:StandardPandora                   11.5795       27.9621       62.0237       25.0803       13.486         10     
produce:pandoraTrack:LArPandoraTrackCreation     0.289383       0.97554       1.88097       0.81357      0.481155        10     
produce:pandoraGnocalo:GnocchiCalorimetry        0.014647      0.0333287     0.0455908     0.0346145    0.00782241       10     
[art]:TriggerResults:TriggerResultInserter      2.1871e-05    3.34028e-05   8.3086e-05    2.5984e-05    1.75319e-05      10     
end_path:out1:RootOutput                         6.883e-06    9.8304e-06    2.7071e-05     8.11e-06     5.7764e-06       10     
end_path:out1:RootOutput(write)                   4.3594        4.96168       5.72017       4.82995      0.409854        10     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.87 MB
  Peak resident set size usage (VmHWM): 6703.92 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 28-Oct-2025 13:49:26 CET ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 40267 subRun: 1 event: 78022
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-11-05 09:06:10 UTC       justIN version: 01.05.01