justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 261203.66@dunegpschedd02.fnal.gov

Jobsub ID261203.66@dunegpschedd02.fnal.gov
Workflow ID11160
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes9437184000 (9000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-12-11 20:02:38
SiteUK_Durham
EntryDUNE_UK_SGridDurham_ce4
Last heartbeat2025-12-12 05:18:47
From worker nodeHostnamen265.dur.scotgrid.ac.uk
cpuinfoAMD EPYC 9534 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes10485760000 (10000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-12-12 01:51:05
Input fileshd-protodune:pdhd_prod_beam__242589_0_1_20251113T023228Z_gen_g4_IonScintPDExt.root_248005_39_1_20251118T082600Z_PDInt.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-12-12 05:18:47
Saved logsjustin-logs:261203.66-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

3rd record. run: 20250627 subRun: 1 event: 482 at 12-Dec-2025 03:31:24 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 03:31:56 GMT  run: 20250627 subRun: 1 event: 482
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 897938 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 897938 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 897938 depos spanning: [-3297.02, 3091.95]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 4th record. run: 20250627 subRun: 1 event: 483 at 12-Dec-2025 03:39:50 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 03:40:17 GMT  run: 20250627 subRun: 1 event: 483
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 840005 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 840005 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 840005 depos spanning: [-3264.83, 3107.4]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 5th record. run: 20250627 subRun: 1 event: 484 at 12-Dec-2025 03:50:55 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 03:52:20 GMT  run: 20250627 subRun: 1 event: 484
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 763746 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 763746 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 763746 depos spanning: [-3302.51, 3099.72]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 6th record. run: 20250627 subRun: 1 event: 485 at 12-Dec-2025 04:02:48 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 04:03:41 GMT  run: 20250627 subRun: 1 event: 485
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1453818 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1453818 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1453818 depos spanning: [-3231.43, 3035.41]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 7th record. run: 20250627 subRun: 1 event: 486 at 12-Dec-2025 04:24:50 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 04:26:37 GMT  run: 20250627 subRun: 1 event: 486
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 939532 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 939532 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 939532 depos spanning: [-3180.72, 3091.35]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 8th record. run: 20250627 subRun: 1 event: 487 at 12-Dec-2025 04:42:40 GMT
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 04:44:28 GMT  run: 20250627 subRun: 1 event: 487
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1216675 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1216675 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1216675 depos spanning: [-3302.4, 3092.56]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 9th record. run: 20250627 subRun: 1 event: 488 at 12-Dec-2025 04:59:56 GMT
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 05:01:36 GMT  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 05:03:04 GMT  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 05:04:26 GMT  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 05:04:26 GMT  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1126030 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1126030 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1126030 depos spanning: [-3281.81, 3063.22]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
R__unzip: error -3 in inflate (zlib)
12-Dec-2025 05:18:22 GMT  Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/28/4c/pdhd_prod_beam__242589_0_1_20251113T023228Z_gen_g4_IonScintPDExt.root_248005_39_1_20251118T082600Z_PDInt.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00616109      715.799       1268.65       677.234       336.772         9     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00470005     0.0281008     0.111057     0.00515676     0.0429668        9     
simulate:rns:RandomNumberSaver                3.9629e-05    6.51891e-05   0.000206871   4.6621e-05    5.07464e-05       9     
simulate:tpcrawdecoder:WireCellToolkit          290.288       398.064       529.449       411.928       67.676          9     
simulate:opdigi:OpDetDigitizerProtoDUNEHD       139.284       400.332       736.186       353.216       162.098         9     
simulate:crt:CRTSimRefac                       0.0416714     0.0743181     0.0948505     0.0813967     0.0190711        8     
[art]:TriggerResults:TriggerResultInserter     1.849e-05     3.116e-05     7.031e-05     2.713e-05    1.52545e-05       8     
end_path:out1:RootOutput                       3.58e-06      7.739e-06     1.521e-05     6.825e-06    3.15552e-06       8     
end_path:out1:RootOutput(write)                 2.2518        2.50386       2.79299       2.52782      0.198869         8     
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 12-Dec-2025 05:18:22 GMT  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'random'
  master seed: 701053834
  seed within: [ 1 ; 900000000 ]

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 9005.03 MB
  Peak resident set size usage (VmHWM): 7287.32 MB
  Details saved in: 'mem.db'
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 9 passed = 8 failed = 1

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport          8          8          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 2573.072733 Real = 12404.375400

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 9005.03 VmHWM = 7287.32

%MSG-s ArtException:  PostEndJob 12-Dec-2025 05:18:23 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TBasket::ReadBasketBuffers
        fNbytes = 85273870, fKeylen = 109, fObjlen = 357253787, noutot = 218103795, nout=0, nin=5632512, nbuf=16777215
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module CRTSimRefac/crt run: 20250627 subRun: 1 event: 488
    ---- FileReadError END
    Exception going through path simulate
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 8 entries while art::RNGsnapshots_rns__IonScintPDExt. has 10 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
Detsim returns 1
justIN time: 2025-12-18 23:23:50 UTC       justIN version: 01.05.03