justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 266996.74@dunegpschedd01.fnal.gov

Jobsub ID266996.74@dunegpschedd01.fnal.gov
Workflow ID11160
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes9437184000 (9000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-12-11 20:42:39
SiteUS_Wisconsin
EntryHCCHTPC_US_Wisconsin_osg01_rhel7
Last heartbeat2025-12-12 07:11:03
From worker nodeHostnamee4040
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes10485760000 (10000 MiB)
Wall seconds limit82800 (23 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-12-12 06:27:00
Input fileshd-protodune:pdhd_prod_beam__242589_0_1_20251113T023228Z_gen_g4_IonScintPDExt.root_248005_39_1_20251118T082600Z_PDInt.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-12-12 07:11:03
Saved logsjustin-logs:266996.74-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

3rd record. run: 20250627 subRun: 1 event: 482 at 12-Dec-2025 00:37:09 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 00:37:13 CST  run: 20250627 subRun: 1 event: 482
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 897938 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 897938 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 897938 depos spanning: [-3297.02, 3091.95]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 4th record. run: 20250627 subRun: 1 event: 483 at 12-Dec-2025 00:41:52 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 00:41:56 CST  run: 20250627 subRun: 1 event: 483
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 840005 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 840005 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 840005 depos spanning: [-3264.83, 3107.4]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 5th record. run: 20250627 subRun: 1 event: 484 at 12-Dec-2025 00:46:02 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 00:46:05 CST  run: 20250627 subRun: 1 event: 484
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 763746 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 763746 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 763746 depos spanning: [-3302.51, 3099.72]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 6th record. run: 20250627 subRun: 1 event: 485 at 12-Dec-2025 00:49:59 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 00:50:03 CST  run: 20250627 subRun: 1 event: 485
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1453818 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1453818 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1453818 depos spanning: [-3231.43, 3035.41]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 7th record. run: 20250627 subRun: 1 event: 486 at 12-Dec-2025 00:56:46 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 00:56:49 CST  run: 20250627 subRun: 1 event: 486
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 939532 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 939532 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 939532 depos spanning: [-3180.72, 3091.35]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 8th record. run: 20250627 subRun: 1 event: 487 at 12-Dec-2025 01:01:06 CST
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 01:01:11 CST  run: 20250627 subRun: 1 event: 487
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1216675 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1216675 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1216675 depos spanning: [-3302.4, 3092.56]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
Begin processing the 9th record. run: 20250627 subRun: 1 event: 488 at 12-Dec-2025 01:06:07 CST
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 01:06:09 CST  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 01:06:10 CST  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
R__unzip: error -3 in inflate (zlib)
%MSG-w ParticleInventory:  ProcessEvent 12-Dec-2025 01:06:11 CST  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the MCParticles. This is expected when running on a generation or simulation step.
%MSG
%MSG-w PhotonBackTrackerService:  ProcessEvent 12-Dec-2025 01:06:11 CST  run: 20250627 subRun: 1 event: 488
Rebuild failed to get the OpDetBTRs. This is expected when running on a generation or simulation step.
%MSG
SimDepoSource got 1126030 depos from art tag "InputTag: label = 'IonAndScint', instance = ''" returns: okay
Larwirecell::SimDepoSource got 1126030 associated depos from InputTag: label = 'IonAndScint', instance = 'priorSCE'
SimDepoSource: ready with 1126030 depos spanning: [-3281.81, 3063.22]us
Retagger: tagging trace set: daq with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "daq"
R__unzip: error -3 in inflate (zlib)
12-Dec-2025 01:10:49 CST  Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/28/4c/pdhd_prod_beam__242589_0_1_20251113T023228Z_gen_g4_IonScintPDExt.root_248005_39_1_20251118T082600Z_PDInt.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00493721      246.999       402.043       256.275       99.3994         9     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00428533     0.0094306     0.0258058    0.00493721     0.0075196        9     
simulate:rns:RandomNumberSaver                5.1035e-05    9.65324e-05   0.000371795   6.0914e-05    9.77604e-05       9     
simulate:tpcrawdecoder:WireCellToolkit          219.021       260.068       379.114       244.531       45.2506         9     
simulate:opdigi:OpDetDigitizerProtoDUNEHD       10.911        14.2158       18.7814       13.7275       2.49589         9     
simulate:crt:CRTSimRefac                       0.0685299     0.0838995     0.0997914     0.082784      0.0109422        8     
[art]:TriggerResults:TriggerResultInserter    2.0949e-05    3.37905e-05   9.6531e-05    2.5578e-05    2.37872e-05       8     
end_path:out1:RootOutput                       7.334e-06    1.07035e-05   2.8523e-05     8.265e-06    6.77464e-06       8     
end_path:out1:RootOutput(write)                 3.25085       3.6494        4.34815       3.49704      0.352662         8     
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 12-Dec-2025 01:10:49 CST  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'random'
  master seed: 802689196
  seed within: [ 1 ; 900000000 ]

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 9195.59 MB
  Peak resident set size usage (VmHWM): 7407.42 MB
  Details saved in: 'mem.db'
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 9 passed = 8 failed = 1

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport          8          8          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 2466.326788 Real = 2597.704715

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 9195.59 VmHWM = 7407.42

%MSG-s ArtException:  PostEndJob 12-Dec-2025 01:10:50 CST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TBasket::ReadBasketBuffers
        fNbytes = 85273870, fKeylen = 109, fObjlen = 357253787, noutot = 218103795, nout=0, nin=5632512, nbuf=16777215
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module CRTSimRefac/crt run: 20250627 subRun: 1 event: 488
    ---- FileReadError END
    Exception going through path simulate
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 8 entries while art::RNGsnapshots_rns__IonScintPDExt. has 10 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
Detsim returns 1
justIN time: 2025-12-19 09:19:34 UTC       justIN version: 01.05.03