Jobsub ID 241354.0@dunegpschedd01.fnal.gov
| Jobsub ID | 241354.0@dunegpschedd01.fnal.gov |
| Workflow ID | 9373 |
| Stage ID | 1 |
| User name | ykermaid@fnal.gov |
| HTCondor Group | group_dune.prod_mcsim |
| Requested | Processors | 1 |
| GPU | No |
| RSS bytes | 4193255424 (3999 MiB) |
| Wall seconds limit | 18000 (5 hours) |
| Submitted time | 2025-10-31 09:44:48 |
| Site | NL_NIKHEF |
| Entry | VIRGO_NL_NIKHEF_juk |
| Last heartbeat | 2025-10-31 10:32:54 |
| From worker node | Hostname | wn-sate-038.farm.nikhef.nl |
| cpuinfo | AMD EPYC 7551P 32-Core Processor |
| OS release | Scientific Linux release 7.9 (Nitrogen) |
| Processors | 1 |
| RSS bytes | 4194304000 (4000 MiB) |
| Wall seconds limit | 129600 (36 hours) |
| GPU | |
| Inner Apptainer? | True |
| Job state | jobscript_error |
| Started | 2025-10-31 09:45:46 |
| Input files | vd-protodune:np02vd_raw_run040266_0240_df-s04-d1_dw_0_20251025T035257.hdf5
|
| Jobscript | Exit code | 1 |
| Real time | 0m (0s) |
| CPU time | 0m (0s = 0%) |
| Max RSS bytes | 0 (0 MiB) |
| Outputting started | |
| Output files | |
| Finished | 2025-10-31 10:32:54 |
| Saved logs | justin-logs:241354.0-dunegpschedd01.fnal.gov.logs.tgz |
| List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
glue ] frame sink sees EOS
[11:30:12.840] D [ pgraph ] <Pgrapher:> graph execution complete
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 15.43 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 14.4 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 13.64 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 13.37 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 12.97 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 12.94 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 12.16 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 12.04 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Aux::Resampler : 0.49 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Aux::Resampler : 0.48 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Aux::Resampler : 0.48 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Aux::Resampler : 0.47 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Gen::FrameFanin : 0.01 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Gen::Retagger : 0.01 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Gen::DumpFrames : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[11:30:12.840] I [ timer ] Timer: WireCell::Gen::FrameFanout : 0 sec
[11:30:12.840] I [ timer ] Timer: wcls::RawFrameSource : 0 sec
[11:30:12.840] I [ timer ] Timer: wcls::FrameSaver : 0 sec
[11:30:12.840] I [ timer ] Timer: Total node execution : 108.88999979570508 sec
wclsFrameSaver saving cooked to 10000 ticks
wclsFrameSaver: saving 54211 traces tagged "gauss"
FrameSaver: q=1.35824e+07 n=1152461 tag=gauss
wclsFrameSaver: saving 67163 traces tagged "wiener"
FrameSaver: q=1.46767e+07 n=1101658 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 486 XUs and 437 XVs -> 16 XUVs
C:0 T:1 1052 XUs and 1171 XVs -> 58 XUVs
C:0 T:2 4542 XUs and 3215 XVs -> 154 XUVs
C:0 T:3 23796 XUs and 33213 XVs -> 2556 XUVs
C:0 T:4 835 XUs and 991 XVs -> 37 XUVs
C:0 T:5 1559 XUs and 2402 XVs -> 40 XUVs
C:0 T:6 277 XUs and 254 XVs -> 10 XUVs
C:0 T:7 1482 XUs and 1949 XVs -> 45 XUVs
C:0 T:8 13494 XUs and 30087 XVs -> 7205 XUVs
C:0 T:9 4037 XUs and 8290 XVs -> 700 XUVs
C:0 T:10 1284 XUs and 2020 XVs -> 485 XUVs
C:0 T:11 2081 XUs and 1601 XVs -> 108 XUVs
C:0 T:12 547 XUs and 859 XVs -> 43 XUVs
C:0 T:13 1970 XUs and 1614 XVs -> 101 XUVs
C:0 T:14 363 XUs and 431 XVs -> 13 XUVs
C:0 T:15 990 XUs and 1295 XVs -> 61 XUVs
11632 XUVs total
1938 collection wire objects
11632 potential space points
Neighbour search...
1939074 tests to find 707774 neighbours
Iterating with no regularization...
Begin: 1.63372e+10
0 1.56285e+10
1 1.5573e+10
2 1.55721e+10
Now with regularization...
Begin: 1.54014e+10
0 1.54006e+10
RawFrameSource: got 12288 raw::RawDigit objects
input nticks=8448 keeping as is
[11:31:32.794] D [ main ] executing 1 apps, thread limit 0:
[11:31:32.794] D [ main ] executing 1 apps, thread limit 0:
[11:31:32.794] D [ main ] executing app: "Pgrapher"
[11:31:32.794] D [ pgraph ] <Pgrapher:> executing graph
[11:31:32.794] D [ pgraph ] executing with 26 nodes
[11:31:32.795] D [ glue ] <FrameFanout:nfsp> call=36: input: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[11:31:32.796] W [ glue ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped.
[11:31:32.796] D [ glue ] <ChannelSelector:chsel7> input frame: ident=44373 time=81 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=44373 time=81 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[11:31:32.797] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=36 input frame: frame: ident=44373 time=81 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[11:31:32.797] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=36 init nticks=8448 tbinmin=0 tbinmax=8448
[11:31:32.828] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=36 load plane index: 0, ntraces=1536, input bad regions: 0
[11:31:34.704] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=36 load plane index: 1, ntraces=1536, input bad regions: 0
[11:31:36.634] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=36 load plane index: 2, ntraces=1536, input bad regions: 0
==================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
==================================================================================================================================
Full event 6.397e-05 141.514 291.647 141.759 53.0298 19
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read) 6.2678e-05 8.28285e-05 0.000171152 7.7275e-05 2.33956e-05 19
produce:tpcrawdecoder:PDVDTPCReader 12.2258 13.1951 18.5574 12.7827 1.37916 19
produce:triggerrawdecoder:PDVDTriggerReader4 0.514682 0.528097 0.554924 0.523135 0.0111902 19
produce:pdvddaphne:DAPHNEReaderPDVD 0.000362663 0.000418679 0.000732388 0.0003825 8.0929e-05 19
produce:ophit:OpHitFinder 5.7959e-05 0.000102246 0.00059494 7.483e-05 0.000117264 19
produce:opflash:OpFlashFinderVerticalDrift 4.8361e-05 7.10352e-05 0.000335411 5.3331e-05 6.33256e-05 19
produce:wclsdatavd:WireCellToolkit 57.7267 79.5383 140.073 66.9112 21.1135 18
produce:gaushit:GausHitFinder 0.772701 1.33901 1.79029 1.33033 0.254638 18
produce:nhitsfilter:NumberOfHitsFilter 0.000243778 0.000395497 0.000784918 0.000378764 0.000131247 18
produce:reco3d:SpacePointSolver 6.4767 13.1085 16.8965 13.589 3.03458 18
produce:hitpdune:DisambigFromSpacePoints 0.0813483 0.197073 0.303041 0.190769 0.057246 18
produce:pandora:StandardPandora 11.4888 34.9263 113.078 31.6071 21.0112 18
produce:pandoraTrack:LArPandoraTrackCreation 0.570375 1.36674 2.93607 1.26082 0.521114 18
produce:pandoraGnocalo:GnocchiCalorimetry 0.0251868 0.0383063 0.05397 0.0392027 0.00715731 18
[art]:TriggerResults:TriggerResultInserter 2.2863e-05 3.99887e-05 8.3888e-05 3.42795e-05 1.54676e-05 18
end_path:out1:RootOutput 7.705e-06 1.08149e-05 2.1581e-05 9.9435e-06 3.03694e-06 18
end_path:out1:RootOutput(write) 4.52228 5.05801 6.0739 4.99718 0.415224 18
==================================================================================================================================
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 8589.43 MB
Peak resident set size usage (VmHWM): 6654.91 MB
Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException: PostEndJob 31-Oct-2025 11:32:31 CET ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- BadAlloc BEGIN
A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 40266 subRun: 1 event: 44373
The job has probably exhausted the virtual memory available to the process.
---- BadAlloc END
Exception going through path produce
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1