justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 297341.13@dunegpschedd01.fnal.gov

Jobsub ID297341.13@dunegpschedd01.fnal.gov
Workflow ID12134
Stage ID1
User namegalli@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-21 01:34:12
SiteUK_Sheffield
EntryDUNE_UK_Sheffield_lcgce2
Last heartbeat2026-01-21 03:25:03
From worker nodeHostnamewn004.hep
cpuinfoIntel(R) Core(TM) i7-5960X CPU @ 3.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit257400 (71 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-21 01:35:33
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50258586_491_20231122T083046Z_gen_g4_detsim_hitreco__20240503T055541Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-21 03:25:03
Saved logsjustin-logs:297341.13-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

Input PFN = root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/b9/d3/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50258586_491_20231122T083046Z_gen_g4_detsim_hitreco__20240503T055541Z_reco2.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Using custom sources from /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_04
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune/testproducts:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 77:  1261 Segmentation fault      (core dumped) lar -c $FCL_FILE $events_option $OUTPUT_CMD "$pfn" > ${fname}_reco_${now}.log 2>&1
lar exit code 139
=== Start last 100 lines of lar log file ===
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 615
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 616
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 617
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 618
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 619
No muon's daughter PDG code: 2112
No muon's daughter Track ID: 620
No muon's daughter PDG code: 22
No muon's daughter Track ID: 621
No muon's daughter PDG code: 22
No muon's daughter Track ID: 622
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 623
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 637
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 638
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 639
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 640
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 641
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 642
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 643
No muon's daughter PDG code: 22
No muon's daughter Track ID: 644
No muon's daughter PDG code: 2112
No muon's daughter Track ID: 645
No muon's daughter PDG code: 1000180400
No muon's daughter Track ID: 646
%MSG-e ana:  AtmSelection:atmselection@BeginModule 21-Jan-2026 03:21:46 GMT  run: 50258586 subRun: 1 event: 98298
... Filling Reco Tree ... 
%MSG
N PFParticles: 4
%MSG-e ana:  AtmSelection:atmselection@BeginModule 21-Jan-2026 03:21:47 GMT  run: 50258586 subRun: 1 event: 98298
------------------------ RECO ---------------------------------------
---> Module_Label: pandora;		 Instance_Name: ;	 Product_Type: recob::PFParticle; Output_Name: Reco
---> Module_Label: pandoraTrack;	 Instance_Name: ;	 Product_Type: recob::Track;	 Output_Name: Reco
---> Module_Label: pandora;		 Instance_Name: ;	 Product_Type: recob::Cluster;	 Output_Name: Reco
Number of PFParticles: 4
---------------------------------------------------------------------
%MSG
number: 0

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                      1.80341       41.718        363.591       16.7612       61.6727        98     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.00997686     0.861619       39.7047      0.107418       4.07185        98     
prod:emtrkmichelid:EmTrackMichelId              1.58756       35.0003       342.897       10.3783       58.2291        98     
[art]:TriggerResults:TriggerResultInserter    1.2336e-05    2.00548e-05   5.0965e-05    1.9221e-05    4.88485e-06      98     
end_path:atmselection:AtmSelection             0.0564036      6.84802       171.75        1.04239       18.8091        97     
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 4977.38 MB
  Peak resident set size usage (VmHWM): 1463.88 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 21-Jan-2026 03:24:42 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- EventProcessorFailure BEGIN
    EndPathExecutor: an exception occurred during current event processing
    ---- ScheduleExecutionFailure BEGIN
      Path: ProcessingStopped.
      ---- FileReadError BEGIN
        ---- FatalRootError BEGIN
          Fatal Root Error: TNetXNGFile::ReadBuffer
          [FATAL] Hand shake failed
          ROOT severity: 3000
        ---- FatalRootError END
        
        The above exception was thrown while processing module AtmSelection/atmselection run: 50258586 subRun: 1 event: 98298
      ---- FileReadError END
      Exception going through path end_path
    ---- ScheduleExecutionFailure END
  ---- EventProcessorFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
%MSG
=== End last 100 lines of lar log file ===
.:
total 3224
-rw-r--r--. 1 dune004 dune 3159497 Jan 21 03:24 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50258586_491_20231122T083046Z_gen_g4_detsim_hitreco__20240503T055541Z_reco2_ana_2026-01-21T_013537Z.root
-rw-r--r--. 1 dune004 dune  125594 Jan 21 03:24 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50258586_491_20231122T083046Z_gen_g4_detsim_hitreco__20240503T055541Z_reco2_reco_2026-01-21T_013537Z.log
-rw-r--r--. 1 dune004 dune    7135 Jan 21 03:24 jobscript.log
-rw-r--r--. 1 dune004 dune     138 Jan 21 01:35 all-input-dids.txt
-rw-r--r--. 1 dune004 dune       0 Jan 21 01:35 debugprod.log
justIN time: 2026-02-04 04:48:38 UTC       justIN version: 01.06.00