justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 36357.44@dunegpschedd02.fnal.gov

Jobsub ID36357.44@dunegpschedd02.fnal.gov
Workflow ID2332
Stage ID1
User nameykermaid@fnal.gov
HTCondor Groupgroup_dune.prod.mcsim
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2025-09-08 12:27:58
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc01
Last heartbeat2025-09-08 15:52:42
From worker nodeHostnamewn-da-01.gina.surf.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job stateoutputting_failed
Started2025-09-08 14:31:37
Input filesvd-protodune:np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011.hdf5
JobscriptExit code0
Real time1h (4366s)
CPU time1h (3983s = 91%)
Max RSS bytes2829189120 (2698 MiB)
Outputting started2025-09-08 15:44:24
Output files
Finished2025-09-08 15:52:42
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

es within desired window for channel 10542
wclsFrameSaver: no samples within desired window for channel 10662
wclsFrameSaver: no samples within desired window for channel 10942
wclsFrameSaver: no samples within desired window for channel 11030
wclsFrameSaver: no samples within desired window for channel 11031
wclsFrameSaver: no samples within desired window for channel 11031
wclsFrameSaver: no samples within desired window for channel 11059
wclsFrameSaver: no samples within desired window for channel 11217
wclsFrameSaver: no samples within desired window for channel 11218
wclsFrameSaver: no samples within desired window for channel 11219
wclsFrameSaver: no samples within desired window for channel 11219
wclsFrameSaver: no samples within desired window for channel 11219
wclsFrameSaver: no samples within desired window for channel 11220
wclsFrameSaver: no samples within desired window for channel 11220
wclsFrameSaver: no samples within desired window for channel 11220
wclsFrameSaver: no samples within desired window for channel 11221
wclsFrameSaver: no samples within desired window for channel 11222
wclsFrameSaver: no samples within desired window for channel 11223
wclsFrameSaver: no samples within desired window for channel 11223
wclsFrameSaver: no samples within desired window for channel 11224
wclsFrameSaver: no samples within desired window for channel 11224
wclsFrameSaver: no samples within desired window for channel 11225
wclsFrameSaver: no samples within desired window for channel 11225
wclsFrameSaver: no samples within desired window for channel 11226
wclsFrameSaver: no samples within desired window for channel 11226
wclsFrameSaver: no samples within desired window for channel 11227
wclsFrameSaver: no samples within desired window for channel 11228
wclsFrameSaver: no samples within desired window for channel 11229
wclsFrameSaver: no samples within desired window for channel 11229
wclsFrameSaver: no samples within desired window for channel 12156
wclsFrameSaver: no samples within desired window for channel 12157
wclsFrameSaver: no samples within desired window for channel 12158
wclsFrameSaver: no samples within desired window for channel 12158
wclsFrameSaver: no samples within desired window for channel 12159
wclsFrameSaver: no samples within desired window for channel 12159
wclsFrameSaver: no samples within desired window for channel 12226
wclsFrameSaver: no samples within desired window for channel 12226
wclsFrameSaver: no samples within desired window for channel 12226
wclsFrameSaver: no samples within desired window for channel 12227
FrameSaver: q=5.5865e+06 n=732760 tag=wiener
0 X, 0 U, 0 V bad channels
Finding XUV coincidences...
C:0 T:0 4049 XUs and 3686 XVs -> 191 XUVs
C:0 T:1 1169 XUs and 771 XVs -> 87 XUVs
C:0 T:2 329 XUs and 359 XVs -> 23 XUVs
C:0 T:3 504 XUs and 579 XVs -> 31 XUVs
C:0 T:4 303 XUs and 293 XVs -> 29 XUVs
C:0 T:5 1424 XUs and 1461 XVs -> 59 XUVs
C:0 T:6 1074 XUs and 1391 XVs -> 68 XUVs
C:0 T:7 1039 XUs and 1227 XVs -> 59 XUVs
C:0 T:8 128 XUs and 176 XVs -> 11 XUVs
C:0 T:9 1060 XUs and 1169 XVs -> 71 XUVs
C:0 T:10 80 XUs and 87 XVs -> 6 XUVs
C:0 T:11 1360 XUs and 1592 XVs -> 57 XUVs
C:0 T:12 1808 XUs and 1801 XVs -> 83 XUVs
C:0 T:13 898 XUs and 1089 XVs -> 92 XUVs
C:0 T:14 1303 XUs and 988 XVs -> 58 XUVs
C:0 T:15 975 XUs and 765 XVs -> 68 XUVs
993 XUVs total
807 collection wire objects
993 potential space points
Neighbour search...
9923 tests to find 5300 neighbours
Iterating with no regularization...
Begin: 3.05125e+08
0 2.99296e+08
1 2.99089e+08
Now with regularization...
Begin: 2.9545e+08
0 2.95439e+08
08-Sep-2025 17:42:12 CEST  Closed output file "np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup.root"

==================================================================================================================================
TimeTracker printout (sec)                          Min           Avg           Max         Median          RMS         nEvts   
==================================================================================================================================
Full event                                        35.9105       208.855       385.061       209.328       60.4435        20     
----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                      8.7165e-05    0.000114217   0.000329807   0.000101748   5.1112e-05       20     
produce:tpcrawdecoder:PDVDTPCReader               9.8994        16.5888       19.7427       16.8675       1.79282        20     
produce:triggerrawdecoder:PDVDTriggerReader4     0.551594      0.632769      0.803212      0.625069      0.0661335       20     
produce:pdvddaphne:DAPHNEReaderPDVD             0.000345988   0.000422537   0.00102971    0.000380192   0.000146119      20     
produce:ophit:OpHitFinder                       0.00015733    0.000239033   0.00111546    0.000182205   0.000205828      20     
produce:opflash:OpFlashFinderVerticalDrift      4.5847e-05    7.2875e-05    0.000447249   5.23145e-05   8.60246e-05      20     
produce:wclsdatavd:WireCellToolkit                15.9384       109.293       125.181       115.816       22.9998        20     
produce:gaushit:GausHitFinder                     0.50916       1.36865       1.80007       1.46002      0.286846        20     
produce:nhitsfilter:NumberOfHitsFilter          0.000338172   0.00136951    0.00202587    0.00144462    0.000425586      20     
produce:reco3d:SpacePointSolver                   2.76085       14.3479       22.1646       15.1076       4.04278        20     
produce:hitpdune:DisambigFromSpacePoints         0.0467114     0.207953      0.307084      0.221232      0.061447        20     
produce:pandora:StandardPandora                   4.56917       59.8632       229.408       52.2518       44.0778        20     
produce:pandoraTrack:LArPandoraTrackCreation     0.352525       1.83825       5.91485       1.3744        1.30037        20     
produce:pandoraGnocalo:GnocchiCalorimetry        0.014449      0.0377657     0.0597789     0.0393305     0.0092832       20     
[art]:TriggerResults:TriggerResultInserter      1.9647e-05    7.17037e-05   0.000237131   6.5604e-05    4.3029e-05       20     
end_path:out1:RootOutput                         6.152e-06    1.27041e-05   5.9453e-05     8.757e-06    1.20535e-05      20     
end_path:out1:RootOutput(write)                   1.16298       4.59139       5.0175        4.72385      0.800148        20     
==================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 4718.56 MB
  Peak resident set size usage (VmHWM): 2829.19 MB
  Details saved in: 'mem.db'
====================================================================================================
Art has completed and will exit with status 0.
Output files:
\tReco: np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup.root
\tHists: np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup_hists.root
Forming reco metadata
Successfully opened file np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup.root
Ran successfully
{
  "name": "np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup.root",
  "namespace": "vd-protodune-det-reco",
  "metadata": {
    "core.file_format": "artroot",
    "core.application.name": "reco",
    "core.application.family": "dunesw",
    "core.application.version": "v10_10_00d00",
    "core.data_tier": "full-reconstructed",
    "dune.config_file": "standard_reco_stage1_protodunevd_keepup_all.fcl",
    "dune.campaign": "vd-protodune-reco-keepup-v0",
    "core.start_time": 1757346132.0,
    "core.end_time": 1757346133.0,
    "core.events": [
      8528,
      8548,
      8568,
      8588,
      8608,
      8628,
      8648,
      8668,
      8688,
      8708,
      8728,
      8748,
      8768,
      8788,
      8808,
      8828,
      8848,
      8868,
      8888,
      8908
    ],
    "core.event_count": 20,
    "core.first_event_number": 8528,
    "core.last_event_number": 8908,
    "core.data_stream": "physics",
    "core.file_content_status": "good",
    "core.file_type": "detector",
    "core.run_type": "vd-protodune",
    "core.runs": [
      39338
    ],
    "core.runs_subruns": [
      3933800001
    ],
    "dune.daq_test": false,
    "retention.status": "active",
    "retention.class": "physics"
  },
  "parents": [
    {
      "did": "vd-protodune:np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011.hdf5"
    }
  ]
}Forming hist metadata
formed
{
  "name": "np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011_reco_stage1_20250908T154212_keepup_hists.root",
  "namespace": "vd-protodune-det-reco",
  "metadata": {
    "core.file_format": "root",
    "core.application.name": "reco",
    "core.application.family": "dunesw",
    "core.application.version": "v10_10_00d00",
    "core.data_tier": "root-tuple-virtual",
    "dune.config_file": "standard_reco_stage1_protodunevd_keepup_all.fcl",
    "dune.campaign": "vd-protodune-reco-keepup-v0",
    "core.start_time": 1757346132.0,
    "core.end_time": 1757346133.0,
    "core.data_stream": "physics",
    "core.file_content_status": "good",
    "core.file_type": "detector",
    "core.run_type": "vd-protodune",
    "core.runs": [
      39338
    ],
    "core.runs_subruns": [
      3933800001
    ],
    "dune.daq_test": false,
    "retention.status": "active",
    "retention.class": "physics"
  },
  "parents": [
    {
      "did": "vd-protodune:np02vd_raw_run039338_0015_df-s03-d2_dw_0_20250908T120011.hdf5"
    }
  ]
}
justIN time: 2025-09-18 17:59:36 UTC       justIN version: 01.05.00