justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 233605.0@dunegpschedd02.fnal.gov

Jobsub ID233605.0@dunegpschedd02.fnal.gov
Workflow ID9152
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-10-28 10:30:03
SiteNL_NIKHEF
EntryVIRGO_NL_NIKHEF_juk
Last heartbeat2025-10-28 12:06:12
From worker nodeHostnamewn-lot-030.farm.nikhef.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-10-28 10:30:18
Input filesvd-protodune:np02vd_raw_run040140_1655_df-s04-d1_dw_0_20251019T134635.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-10-28 12:06:12
Saved logsjustin-logs:233605.0-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

OS
[12:54:28.940] D [ pgraph ] <Pgrapher:> graph execution complete 
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 42.51 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 18.09 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 12.98 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 12.8 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 12.55 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 12.52 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 11.91 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::OmnibusSigProc : 11.75 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::Aux::Resampler : 0.69 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::Aux::Resampler : 0.68 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::Aux::Resampler : 0.68 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::Aux::Resampler : 0.67 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::Gen::FrameFanin : 0.04 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0.01 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0.01 sec
[12:54:28.940] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0.01 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::Gen::Retagger : 0.01 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::Gen::FrameFanout : 0.01 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[12:54:28.941] I [ timer  ] Timer: WireCell::Gen::DumpFrames : 0 sec
[12:54:28.941] I [ timer  ] Timer: wcls::RawFrameSource : 0 sec
[12:54:28.941] I [ timer  ] Timer: wcls::FrameSaver : 0 sec
[12:54:28.941] I [ timer  ] Timer: Total node execution : 137.9199987296015 sec
wclsFrameSaver saving cooked to 10000 ticks
wclsFrameSaver: saving 106392 traces tagged "gauss"
FrameSaver: q=7.40912e+07 n=2883769 tag=gauss
wclsFrameSaver: saving 134922 traces tagged "wiener"
FrameSaver: q=7.69596e+07 n=2761739 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 2859 XUs and 2828 XVs -> 152 XUVs
C:0 T:1 2261 XUs and 3084 XVs -> 132 XUVs
C:0 T:2 1241 XUs and 1489 XVs -> 32 XUVs
C:0 T:3 1509 XUs and 1859 XVs -> 59 XUVs
C:0 T:4 52340 XUs and 64868 XVs -> 3940 XUVs
C:0 T:5 1685 XUs and 2724 XVs -> 75 XUVs
C:0 T:6 12532 XUs and 12458 XVs -> 614 XUVs
C:0 T:7 1496 XUs and 2063 XVs -> 57 XUVs
C:0 T:8 1771 XUs and 2088 XVs -> 91 XUVs
C:0 T:9 3566 XUs and 3755 XVs -> 240 XUVs
C:0 T:10 1276 XUs and 1316 XVs -> 95 XUVs
C:0 T:11 976 XUs and 1075 XVs -> 69 XUVs
C:0 T:12 76883 XUs and 135912 XVs -> 5637 XUVs
C:0 T:13 7253 XUs and 6881 XVs -> 297 XUVs
C:0 T:14 4576 XUs and 4651 XVs -> 319 XUVs
C:0 T:15 2064 XUs and 1565 XVs -> 113 XUVs
11922 XUVs total
6485 collection wire objects
11922 potential space points
Neighbour search...
252618 tests to find 65386 neighbours
Iterating with no regularization...
Begin: 4.43441e+12
0 4.36916e+12
1 4.36851e+12
Now with regularization...
Begin: 4.36205e+12
0 4.36204e+12
RawFrameSource: got 12288 raw::RawDigit objects
	input nticks=8000 keeping as is
[13:05:03.133] D [  main  ] executing 1 apps, thread limit 0:
[13:05:03.134] D [  main  ] executing 1 apps, thread limit 0:
[13:05:03.134] D [  main  ] executing app: "Pgrapher"
[13:05:03.134] D [ pgraph ] <Pgrapher:> executing graph 
[13:05:03.134] D [ pgraph ] executing with 26 nodes
[13:05:03.135] D [  glue  ] <FrameFanout:nfsp> call=38: input: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]  
[13:05:03.135] W [  glue  ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped. 
[13:05:03.136] D [  glue  ] <ChannelSelector:chsel7> input frame: ident=290597 time=83 tick=512 with 12288 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=290597 time=83 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:05:03.136] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=38 input frame: frame: ident=290597 time=83 tick=512 with 1536 traces.  frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] 
[13:05:03.136] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=38 init nticks=8000 tbinmin=0 tbinmax=8000 
[13:05:03.165] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=38 load plane index: 0, ntraces=1536, input bad regions: 0 
[13:05:04.826] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=38 load plane index: 1, ntraces=1536, input bad regions: 0 
[13:05:06.457] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=38 load plane index: 2, ntraces=1536, input bad regions: 0 

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                      8.8636e-05      280.261       1825.47       134.188       391.82         20     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      6.3088e-05    8.76087e-05   0.000217626   8.11315e-05   3.1108e-05       20     
produce:tpcrawdecoder:PDVDTPCReader               17.0307       30.7437       76.6901       23.9898       15.5303        20     
produce:triggerrawdecoder:PDVDTriggerReader4     0.292811      0.314179      0.354022      0.311808      0.0133153       20     
produce:pdvddaphne:DAPHNEReaderPDVD               3.31182       3.97872       5.7556        3.90702      0.590291        20     
produce:ophit:OpHitFinder                        0.0406384     0.0494577     0.0603477     0.049576     0.00511892       20     
produce:opflash:OpFlashFinderVerticalDrift      0.000572539   0.00669357     0.0158551    0.00651825    0.00348707       20     
produce:wclsdatavd:WireCellToolkit                49.5747       77.1254       139.77        59.5274       27.4881        19     
produce:gaushit:GausHitFinder                    0.873017       1.29565       2.76965       1.13715      0.448088        19     
produce:nhitsfilter:NumberOfHitsFilter          0.000242603   0.000438334    0.0017972    0.000336779   0.000332606      19     
produce:reco3d:SpacePointSolver                   8.63056       15.7091       52.0633       11.8709       10.5467        19     
produce:hitpdune:DisambigFromSpacePoints         0.0979448     0.270297       1.1616       0.173858      0.251395        19     
produce:pandora:StandardPandora                   14.8757       159.49        1623.72       25.2587       368.816        19     
produce:pandoraTrack:LArPandoraTrackCreation     0.370433       1.73263        12.45       0.795228       2.64409        19     
produce:pandoraGnocalo:GnocchiCalorimetry        0.0209817     0.0338613     0.0728864     0.0327556     0.0109207       19     
[art]:TriggerResults:TriggerResultInserter      1.5088e-05    2.05958e-05   5.1897e-05    1.8094e-05    7.7909e-06       19     
end_path:out1:RootOutput                         4.028e-06    5.22989e-06   2.1069e-05     4.358e-06    3.73871e-06      19     
end_path:out1:RootOutput(write)                   4.00497       4.45079       5.29986       4.29872      0.423099        19     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 8589.88 MB
  Peak resident set size usage (VmHWM): 6702.58 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 28-Oct-2025 13:05:48 CET ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- BadAlloc BEGIN
      A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 40140 subRun: 1 event: 290597
      The job has probably exhausted the virtual memory available to the process.
    ---- BadAlloc END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1
justIN time: 2025-11-05 05:39:09 UTC       justIN version: 01.05.01