justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 240229.132@dunegpschedd01.fnal.gov

Jobsub ID240229.132@dunegpschedd01.fnal.gov
Workflow ID9431
Stage ID1
User namedrivera@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes6815744000 (6500 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-10-29 12:46:29
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc03
Last heartbeat2025-10-29 20:53:08
From worker nodeHostnamewn-la-01.gina.surf.nl
cpuinfoAMD EPYC 9754 128-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes7864320000 (7500 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job stateoutputting_failed
Started2025-10-29 13:31:01
Input fileshd-protodune:np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553.hdf5
JobscriptExit code0
Real time7h (26107s)
CPU time7h (25554s = 97%)
Max RSS bytes2467758080 (2353 MiB)
Outputting started2025-10-29 20:46:08
Output files
Finished2025-10-29 20:53:08
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

esired window for channel 9395
wclsFrameSaver: no samples within desired window for channel 9396
wclsFrameSaver: no samples within desired window for channel 9397
wclsFrameSaver: no samples within desired window for channel 9398
wclsFrameSaver: no samples within desired window for channel 9398
wclsFrameSaver: no samples within desired window for channel 9399
wclsFrameSaver: no samples within desired window for channel 9403
wclsFrameSaver: no samples within desired window for channel 9404
wclsFrameSaver: no samples within desired window for channel 9507
wclsFrameSaver: no samples within desired window for channel 9654
wclsFrameSaver: no samples within desired window for channel 9655
wclsFrameSaver: no samples within desired window for channel 9656
wclsFrameSaver: no samples within desired window for channel 9656
wclsFrameSaver: no samples within desired window for channel 9657
wclsFrameSaver: no samples within desired window for channel 9658
wclsFrameSaver: no samples within desired window for channel 9658
wclsFrameSaver: no samples within desired window for channel 9658
wclsFrameSaver: no samples within desired window for channel 9659
wclsFrameSaver: no samples within desired window for channel 9659
wclsFrameSaver: no samples within desired window for channel 9659
wclsFrameSaver: no samples within desired window for channel 9660
wclsFrameSaver: no samples within desired window for channel 9660
wclsFrameSaver: no samples within desired window for channel 9661
wclsFrameSaver: no samples within desired window for channel 9661
wclsFrameSaver: no samples within desired window for channel 9661
wclsFrameSaver: no samples within desired window for channel 9661
wclsFrameSaver: no samples within desired window for channel 9662
wclsFrameSaver: no samples within desired window for channel 9662
wclsFrameSaver: no samples within desired window for channel 9662
wclsFrameSaver: no samples within desired window for channel 9663
wclsFrameSaver: no samples within desired window for channel 9664
wclsFrameSaver: no samples within desired window for channel 9664
wclsFrameSaver: no samples within desired window for channel 9666
wclsFrameSaver: no samples within desired window for channel 9667
wclsFrameSaver: no samples within desired window for channel 9691
wclsFrameSaver: no samples within desired window for channel 9692
wclsFrameSaver: no samples within desired window for channel 9692
wclsFrameSaver: no samples within desired window for channel 9693
wclsFrameSaver: no samples within desired window for channel 9694
wclsFrameSaver: no samples within desired window for channel 9694
wclsFrameSaver: no samples within desired window for channel 9694
wclsFrameSaver: no samples within desired window for channel 9695
wclsFrameSaver: no samples within desired window for channel 9695
wclsFrameSaver: no samples within desired window for channel 9695
wclsFrameSaver: no samples within desired window for channel 9696
wclsFrameSaver: no samples within desired window for channel 9696
wclsFrameSaver: no samples within desired window for channel 9697
wclsFrameSaver: no samples within desired window for channel 9697
wclsFrameSaver: no samples within desired window for channel 9698
wclsFrameSaver: no samples within desired window for channel 9698
wclsFrameSaver: no samples within desired window for channel 9698
wclsFrameSaver: no samples within desired window for channel 9699
wclsFrameSaver: no samples within desired window for channel 9699
wclsFrameSaver: no samples within desired window for channel 9700
wclsFrameSaver: no samples within desired window for channel 9701
wclsFrameSaver: no samples within desired window for channel 9701
wclsFrameSaver: no samples within desired window for channel 9702
wclsFrameSaver: no samples within desired window for channel 9703
wclsFrameSaver: no samples within desired window for channel 9704
wclsFrameSaver: no samples within desired window for channel 9704
wclsFrameSaver: no samples within desired window for channel 9705
wclsFrameSaver: no samples within desired window for channel 9705
wclsFrameSaver: no samples within desired window for channel 9705
wclsFrameSaver: no samples within desired window for channel 9706
wclsFrameSaver: no samples within desired window for channel 9706
wclsFrameSaver: no samples within desired window for channel 9706
wclsFrameSaver: no samples within desired window for channel 9707
wclsFrameSaver: no samples within desired window for channel 9707
wclsFrameSaver: no samples within desired window for channel 9708
wclsFrameSaver: no samples within desired window for channel 9708
wclsFrameSaver: no samples within desired window for channel 9708
wclsFrameSaver: no samples within desired window for channel 9709
wclsFrameSaver: no samples within desired window for channel 9709
wclsFrameSaver: no samples within desired window for channel 9710
wclsFrameSaver: no samples within desired window for channel 9710
wclsFrameSaver: no samples within desired window for channel 9711
wclsFrameSaver: no samples within desired window for channel 9711
wclsFrameSaver: no samples within desired window for channel 9711
wclsFrameSaver: no samples within desired window for channel 9712
wclsFrameSaver: no samples within desired window for channel 9713
FrameSaver: q=2.17435e+07 n=1978464 tag=wiener
29-Oct-2025 21:46:05 CET  Closed output file "np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553_1_240229_132_1761744674_deco_reco.root"

====================================================================================================================================
TimeTracker printout (sec)                            Min           Avg           Max         Median          RMS         nEvts   
====================================================================================================================================
Full event                                          93.4082       666.904       4487.55       273.938       873.874        39     
------------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                        6.2664e-05    8.60125e-05   0.000194952   8.1252e-05    2.05414e-05      39     
produce:tpcrawdecoder:PDHDTPCReader                 15.2481       16.192        20.1915       16.0726      0.774731        39     
produce:fembfilter:PDHDFEMBFilter                 9.2467e-05    0.000121759   0.000323555   0.000109624   3.91425e-05      39     
produce:wclsdatahd:WireCellToolkit                  67.6366       89.1547       107.908       88.4027       9.61425        39     
produce:gaushit:GausHitFinder                      0.304801       1.24757       2.2967        1.18182      0.487337        39     
produce:nhitsfilter:NumberOfHitsFilter            0.000184596   0.000635684   0.00130235    0.000717924   0.000326472      39     
produce:reco3d:SpacePointSolver                    0.272203       79.3134       263.396       28.2879       83.2783        39     
produce:hitpdune:DisambigFromSpacePoints           0.164604       6.42045       18.4259       5.83474       5.8963         39     
produce:pandora:StandardPandora                     5.37254       451.357       4055.4        105.45        794.346        39     
produce:pandoraWriter:StandardPandora              0.0980651     0.331711      0.539274      0.326134      0.130597        39     
produce:pandoraTrack:LArPandoraTrackCreation       0.805127       6.64299       15.7161       5.45182       4.26047        39     
produce:pandoraShower:LArPandoraShowerCreation     0.803597       8.05698       19.4686       6.54723       5.15161        39     
produce:pandoracalo:Calorimetry                    0.369427       3.18758       7.33306       2.64627       2.08351        39     
produce:pandoracalonosce:Calorimetry               0.332777       2.94461       7.06043       2.3727        1.88499        39     
[art]:TriggerResults:TriggerResultInserter        3.2238e-05    5.13374e-05   0.000129353   4.7341e-05    1.63326e-05      39     
end_path:out1:RootOutput                           6.74e-06     1.00541e-05   3.8639e-05     9.023e-06    4.8632e-06       39     
end_path:out1:RootOutput(write)                    0.432344       1.5496        2.60151       1.50447       0.64642        39     
====================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 4364.15 MB
  Peak resident set size usage (VmHWM): 2467.76 MB
  Details saved in: 'mem.db'
====================================================================================================

{'art_out': 'np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553_1_240229_132_1761744674_deco_reco.root', 'start_time': 1761744674.2431142, 'end_time': 1761770766.9035578}
lsing
total 993M
-rw-r--r--. 1 dune009 dune 714M Oct 29 21:46 np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553_1_240229_132_1761744674_deco_reco.root
-rw-r--r--. 1 dune009 dune 258M Oct 29 21:45 Pandora_Events.pndr
-rw-r--r--. 1 dune009 dune 6.3M Oct 29 21:46 jobscript.log
-rw-r--r--. 1 dune009 dune 6.1M Oct 29 21:46 np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553_1_240229_132_1761744674_deco_reco.err
-rw-r--r--. 1 dune009 dune 280K Oct 29 21:46 mem.db
-rw-r--r--. 1 dune009 dune 100K Oct 29 21:46 np04hd_raw_run032942_0001_dataflow7_datawriter_0_20241204T073553_1_240229_132_1761744674_deco_reco.log
-rw-r--r--. 1 dune009 dune  56K Oct 29 21:46 time.db
-rw-r--r--. 1 dune009 dune  19K Oct 29 21:46 pdhd_keepup_decoder.root
-rw-r--r--. 1 dune009 dune 3.5K Oct 29 21:46 Pandora_Geometry.xml
-rw-r--r--. 1 dune009 dune   83 Oct 29 14:31 all-input-dids.txt
-rw-r--r--. 1 dune009 dune    0 Oct 29 14:31 debugprod.log
disk usage
993M	.
justIN time: 2025-11-03 18:30:34 UTC       justIN version: 01.05.01