justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 258660.1@dunegpschedd02.fnal.gov

Jobsub ID258660.1@dunegpschedd02.fnal.gov
Workflow ID11042
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes9437184000 (9000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-12-06 23:54:34
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc02
Last heartbeat2025-12-07 01:31:37
From worker nodeHostnamewn-db-07.gina.surf.nl
cpuinfoAMD EPYC 7702P 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes10485760000 (10000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-12-06 23:55:21
Input fileshd-protodune:pdhd_prod_beam__248356_112_1_20251113T031152Z_gen_g4_IonScintPDExt.root_247897_127_1_20251118T032516Z_PDInt.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-12-07 01:31:37
Saved logsjustin-logs:258660.1-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

ng the 4th record. run: 20250623 subRun: 1 event: 33 at 07-Dec-2025 01:24:25 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 01:24:34 CET  run: 20250623 subRun: 1 event: 33
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 895453 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 895453 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 895453 depos spanning: [-3186.37, 3073.91]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 5th record. run: 20250623 subRun: 1 event: 34 at 07-Dec-2025 01:31:47 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 01:31:55 CET  run: 20250623 subRun: 1 event: 34
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 908470 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 908470 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 908470 depos spanning: [-3292.38, 3094.42]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 6th record. run: 20250623 subRun: 1 event: 35 at 07-Dec-2025 01:39:44 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 01:39:53 CET  run: 20250623 subRun: 1 event: 35
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1582203 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1582203 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1582203 depos spanning: [-3261.99, 3107.16]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 7th record. run: 20250623 subRun: 1 event: 36 at 07-Dec-2025 01:52:32 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 01:52:44 CET  run: 20250623 subRun: 1 event: 36
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1183676 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1183676 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1183676 depos spanning: [-3267.65, 3116.91]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 8th record. run: 20250623 subRun: 1 event: 37 at 07-Dec-2025 02:01:59 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 02:02:09 CET  run: 20250623 subRun: 1 event: 37
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1018291 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1018291 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1018291 depos spanning: [-3286.5, 3031.78]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 9th record. run: 20250623 subRun: 1 event: 38 at 07-Dec-2025 02:10:23 CET
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 02:10:34 CET  run: 20250623 subRun: 1 event: 38
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1201147 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1201147 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1201147 depos spanning: [-3288.04, 2997.08]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 10th record. run: 20250623 subRun: 1 event: 39 at 07-Dec-2025 02:19:51 CET
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 07-Dec-2025 02:19:56 CET  run: 20250623 subRun: 1 event: 39
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 07-Dec-2025 02:20:00 CET  run: 20250623 subRun: 1 event: 39
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 07-Dec-2025 02:20:03 CET  run: 20250623 subRun: 1 event: 39
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
%MSG-w PhotonBackTrackerService:  ProcessEvent 07-Dec-2025 02:20:03 CET  run: 20250623 subRun: 1 event: 39
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1480759 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1480759 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1480759 depos spanning: [-3320.13, 3101.08]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
R__unzip: error -3 in inflate (zlib)
07-Dec-2025 02:31:03 CET  Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/5c/52/pdhd_prod_beam__248356_112_1_20251113T031152Z_gen_g4_IonScintPDExt.root_247897_127_1_20251118T032516Z_PDInt.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00916499      471.514       757.794       480.358       182.528        10     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00715992     0.0302774     0.118847      0.0096816     0.0411716       10     
simulate:rns:RandomNumberSaver                4.9064e-05    0.000104269   0.000416454   6.1097e-05    0.000105855      10     
simulate:tpcrawdecoder:WireCellToolkit          383.935       498.639       709.725       487.035       97.7824        10     
simulate:opdigi:OpDetDigitizerProtoDUNEHD       26.9284       34.5877       44.9151       33.4789       6.25169        10     
simulate:crt:CRTSimRefac                       0.0569445     0.0784069     0.115864      0.0719047     0.0197915        9     
[art]:TriggerResults:TriggerResultInserter    2.6681e-05    4.34306e-05   0.000103738    3.711e-05    2.21637e-05       9     
end_path:out1:RootOutput                       5.691e-06     9.547e-06    2.9285e-05     6.633e-06    7.11564e-06       9     
end_path:out1:RootOutput(write)                 3.61097       3.91244       4.58657       3.82312      0.283307         9     
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 07-Dec-2025 02:31:03 CET  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'random'
  master seed: 590738663
  seed within: [ 1 ; 900000000 ]

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 10168.3 MB
  Peak resident set size usage (VmHWM): 8462.02 MB
  Details saved in: 'mem.db'
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 10 passed = 9 failed = 1

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport          9          9          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 5217.301833 Real = 5684.249696

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 10168.3 VmHWM = 8462.02

%MSG-s ArtException:  PostEndJob 07-Dec-2025 02:31:05 CET ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TBasket::ReadBasketBuffers
        fNbytes = 94403549, fKeylen = 109, fObjlen = 401334326, noutot = 201326580, nout=0, nin=5617716, nbuf=16777215
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module CRTSimRefac/crt run: 20250623 subRun: 1 event: 39
    ---- FileReadError END
    Exception going through path simulate
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 9 entries while art::RNGsnapshots_rns__IonScintPDExt. has 10 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
Detsim returns 1
justIN time: 2025-12-19 02:01:11 UTC       justIN version: 01.05.03