Jobsub ID 241274.0@dunegpschedd01.fnal.gov
| Jobsub ID | 241274.0@dunegpschedd01.fnal.gov |
| Workflow ID | 9374 |
| Stage ID | 1 |
| User name | ykermaid@fnal.gov |
| HTCondor Group | group_dune.prod_mcsim |
| Requested | Processors | 1 |
| GPU | No |
| RSS bytes | 4193255424 (3999 MiB) |
| Wall seconds limit | 18000 (5 hours) |
| Submitted time | 2025-10-31 01:32:23 |
| Site | NL_NIKHEF |
| Entry | VIRGO_NL_NIKHEF_klomp |
| Last heartbeat | 2025-10-31 03:05:54 |
| From worker node | Hostname | wn-sate-053.farm.nikhef.nl |
| cpuinfo | AMD EPYC 7551P 32-Core Processor |
| OS release | Scientific Linux release 7.9 (Nitrogen) |
| Processors | 1 |
| RSS bytes | 4194304000 (4000 MiB) |
| Wall seconds limit | 129600 (36 hours) |
| GPU | |
| Inner Apptainer? | True |
| Job state | jobscript_error |
| Started | 2025-10-31 01:33:00 |
| Input files | vd-protodune:np02vd_raw_run040266_0287_df-s03-d3_dw_0_20251025T051201.hdf5
|
| Jobscript | Exit code | 1 |
| Real time | 0m (0s) |
| CPU time | 0m (0s = 0%) |
| Max RSS bytes | 0 (0 MiB) |
| Outputting started | |
| Output files | |
| Finished | 2025-10-31 03:05:54 |
| Saved logs | justin-logs:241274.0-dunegpschedd01.fnal.gov.logs.tgz |
| List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
[ glue ] frame sink sees EOS
[03:59:08.466] D [ pgraph ] <Pgrapher:> graph execution complete
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 68.75 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 40.83 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 22.05 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 9.63 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 9.46 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 8.56 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 8.53 sec
[03:59:08.466] I [ timer ] Timer: WireCell::SigProc::OmnibusSigProc : 7.59 sec
[03:59:08.466] I [ timer ] Timer: WireCell::Aux::Resampler : 0.26 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Aux::Resampler : 0.25 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Aux::Resampler : 0.25 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Aux::Resampler : 0.25 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Gen::FrameFanin : 0.03 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0.02 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0.01 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Gen::Retagger : 0.01 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Gen::DumpFrames : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::SigProc::ChannelSelector : 0 sec
[03:59:08.467] I [ timer ] Timer: WireCell::Gen::FrameFanout : 0 sec
[03:59:08.467] I [ timer ] Timer: wcls::RawFrameSource : 0 sec
[03:59:08.467] I [ timer ] Timer: wcls::FrameSaver : 0 sec
[03:59:08.467] I [ timer ] Timer: Total node execution : 176.48000151477754 sec
wclsFrameSaver saving cooked to 10000 ticks
wclsFrameSaver: saving 103732 traces tagged "gauss"
FrameSaver: q=7.82255e+08 n=2525142 tag=gauss
wclsFrameSaver: saving 97524 traces tagged "wiener"
FrameSaver: q=8.32797e+08 n=2527170 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 534 XUs and 656 XVs -> 21 XUVs
C:0 T:1 603 XUs and 608 XVs -> 13 XUVs
C:0 T:2 819 XUs and 726 XVs -> 24 XUVs
C:0 T:3 831 XUs and 923 XVs -> 24 XUVs
C:0 T:4 640 XUs and 748 XVs -> 13 XUVs
C:0 T:5 7828 XUs and 13600 XVs -> 670 XUVs
C:0 T:6 755 XUs and 872 XVs -> 16 XUVs
C:0 T:7 404 XUs and 419 XVs -> 9 XUVs
C:0 T:8 52030 XUs and 138914 XVs -> 7902 XUVs
C:0 T:9 1046 XUs and 3510 XVs -> 18 XUVs
C:0 T:10 41530 XUs and 96176 XVs -> 6443 XUVs
C:0 T:11 64004 XUs and 144334 XVs -> 9418 XUVs
C:0 T:12 131 XUs and 209 XVs -> 6 XUVs
C:0 T:13 34 XUs and 59 XVs -> 3 XUVs
C:0 T:14 33875 XUs and 38082 XVs -> 2008 XUVs
C:0 T:15 1726 XUs and 1805 XVs -> 98 XUVs
26686 XUVs total
5093 collection wire objects
26686 potential space points
Neighbour search...
1545632 tests to find 391408 neighbours
Iterating with no regularization...
Begin: 3.84656e+13
0 3.77173e+13
1 3.7655e+13
2 3.76543e+13
Now with regularization...
Begin: 3.76513e+13
0 3.76511e+13
RawFrameSource: got 12288 raw::RawDigit objects
input nticks=8640 keeping as is
[04:04:16.929] D [ main ] executing 1 apps, thread limit 0:
[04:04:16.929] D [ main ] executing 1 apps, thread limit 0:
[04:04:16.929] D [ main ] executing app: "Pgrapher"
[04:04:16.929] D [ pgraph ] <Pgrapher:> executing graph
[04:04:16.929] D [ pgraph ] executing with 26 nodes
[04:04:16.931] D [ glue ] <FrameFanout:nfsp> call=44: input: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig" ] 0 tagged trace sets:[ ] cmm:[ ] output 0: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig0" ] 0 tagged trace sets:[ ] cmm:[ ] output 1: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig1" ] 0 tagged trace sets:[ ] cmm:[ ] output 2: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig2" ] 0 tagged trace sets:[ ] cmm:[ ] output 3: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig3" ] 0 tagged trace sets:[ ] cmm:[ ] output 4: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig4" ] 0 tagged trace sets:[ ] cmm:[ ] output 5: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig5" ] 0 tagged trace sets:[ ] cmm:[ ] output 6: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig6" ] 0 tagged trace sets:[ ] cmm:[ ] output 7: frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[04:04:16.931] W [ glue ] <ChannelSelector:chsel7> Untagged summary not supported, summary will be dropped.
[04:04:16.932] D [ glue ] <ChannelSelector:chsel7> input frame: ident=53043 time=98 tick=512 with 12288 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ] output: frame: ident=53043 time=98 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[04:04:16.932] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=44 input frame: frame: ident=53043 time=98 tick=512 with 1536 traces. frame tags:[ "orig7" ] 0 tagged trace sets:[ ] cmm:[ ]
[04:04:16.932] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=44 init nticks=8640 tbinmin=0 tbinmax=8640
[04:04:16.959] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=44 load plane index: 0, ntraces=1536, input bad regions: 0
[04:04:19.141] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=44 load plane index: 1, ntraces=1536, input bad regions: 0
[04:04:21.372] D [sigproc ] <OmnibusSigProc:anode7sigproc7> call=44 load plane index: 2, ntraces=1536, input bad regions: 0
==================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
==================================================================================================================================
Full event 7.969e-05 234.896 532.809 224.718 92.1537 23
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read) 5.8651e-05 7.66444e-05 0.000204435 7.0372e-05 2.82346e-05 23
produce:tpcrawdecoder:PDVDTPCReader 26.8544 71.1307 94.0677 73.1109 15.3893 23
produce:triggerrawdecoder:PDVDTriggerReader4 0.520077 0.728774 1.90797 0.679486 0.270605 23
produce:pdvddaphne:DAPHNEReaderPDVD 0.000375637 0.000418712 0.000751405 0.000398199 7.36963e-05 23
produce:ophit:OpHitFinder 6.2398e-05 0.000100151 0.000546399 7.5843e-05 9.7174e-05 23
produce:opflash:OpFlashFinderVerticalDrift 5.2919e-05 7.89457e-05 0.000328979 6.0454e-05 5.62434e-05 23
produce:wclsdatavd:WireCellToolkit 55.3556 96.4719 178.916 92.9318 29.202 22
produce:gaushit:GausHitFinder 0.72546 2.2095 14.9027 1.47545 2.90097 22
produce:nhitsfilter:NumberOfHitsFilter 0.000200518 0.000379128 0.000615369 0.000359722 9.62583e-05 22
produce:reco3d:SpacePointSolver 5.47695 16.3674 48.48 14.7157 8.20252 22
produce:hitpdune:DisambigFromSpacePoints 0.0845686 0.257072 0.834101 0.246494 0.151261 22
produce:pandora:StandardPandora 13.9456 49.2365 215.954 37.0431 41.945 22
produce:pandoraTrack:LArPandoraTrackCreation 0.504938 1.78365 3.57187 1.55508 0.777059 22
produce:pandoraGnocalo:GnocchiCalorimetry 0.0221032 0.0434864 0.0660228 0.0407799 0.0115675 22
[art]:TriggerResults:TriggerResultInserter 2.1431e-05 5.11891e-05 8.7745e-05 5.54995e-05 1.80565e-05 22
end_path:out1:RootOutput 7.254e-06 1.16539e-05 2.5759e-05 1.07855e-05 4.1331e-06 22
end_path:out1:RootOutput(write) 4.62263 5.25667 6.24551 5.22949 0.430367 22
==================================================================================================================================
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 8589.9 MB
Peak resident set size usage (VmHWM): 6708.61 MB
Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException: PostEndJob 31-Oct-2025 04:05:25 CET ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- BadAlloc BEGIN
A bad_alloc exception was thrown while processing module WireCellToolkit/wclsdatavd run: 40266 subRun: 1 event: 53043
The job has probably exhausted the virtual memory available to the process.
---- BadAlloc END
Exception going through path produce
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
%MSG
Art has completed and will exit with status 1.
Error in reco1