justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 291609.124@dunegpschedd02.fnal.gov

Jobsub ID291609.124@dunegpschedd02.fnal.gov
Workflow ID12134
Stage ID1
User namegalli@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-21 01:40:12
SiteUK_Sheffield
EntryDUNE_UK_Sheffield_lcgce2
Last heartbeat2026-01-21 02:05:37
From worker nodeHostnamewn097.hep
cpuinfoIntel(R) Xeon(R) CPU E5-2698 v4 @ 2.20GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit257400 (71 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-21 01:43:14
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66009532_292_20231203T232041Z_gen_g4_detsim_hitreco__20240509T205144Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-21 02:05:37
Saved logsjustin-logs:291609.124-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

Input PFN = root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/83/bc/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66009532_292_20231203T232041Z_gen_g4_detsim_hitreco__20240509T205144Z_reco2.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Using custom sources from /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_04
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune/testproducts:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 77:  1261 Segmentation fault      (core dumped) lar -c $FCL_FILE $events_option $OUTPUT_CMD "$pfn" > ${fname}_reco_${now}.log 2>&1
lar exit code 139
=== Start last 100 lines of lar log file ===
Muon's daughter PDG code: 22
Muon's daughter Track ID: 286
Muon's daughter PDG code: 22
Muon's daughter Track ID: 287
Muon's daughter PDG code: 22
Muon's daughter Track ID: 288
Muon's daughter PDG code: 22
Muon's daughter Track ID: 289
Muon's daughter PDG code: 22
Muon's daughter Track ID: 290
Muon's daughter PDG code: 14
Muon's daughter Track ID: 291
Muon's daughter PDG code: 22
Muon's daughter Track ID: 292
Muon's daughter PDG code: 22
Muon's daughter Track ID: 293
Muon's daughter PDG code: 1000170400
Muon's daughter Track ID: 294
%MSG-e ana:  AtmSelection:atmselection@BeginModule 21-Jan-2026 02:02:18 GMT  run: 66009532 subRun: 1 event: 29214
... Filling Reco Tree ... 
%MSG
N PFParticles: 4
%MSG-e ana:  AtmSelection:atmselection@BeginModule 21-Jan-2026 02:02:20 GMT  run: 66009532 subRun: 1 event: 29214
------------------------ RECO ---------------------------------------
---> Module_Label: pandora;		 Instance_Name: ;	 Product_Type: recob::PFParticle; Output_Name: Reco
---> Module_Label: pandoraTrack;	 Instance_Name: ;	 Product_Type: recob::Track;	 Output_Name: Reco
---> Module_Label: pandora;		 Instance_Name: ;	 Product_Type: recob::Cluster;	 Output_Name: Reco
Number of PFParticles: 4
---------------------------------------------------------------------
%MSG
number: 0
EndHit of sim_muon: 20
EndHit of sim_muon: 0
EndHit of sim_muon: 0
Vertex vector size = 4
Begin processing the 15th record. run: 66009532 subRun: 1 event: 29215 at 21-Jan-2026 02:02:31 GMT
/home

CCNC: 1
PDG: -14
N GENIE particles: 19
...............................................................

Begin processing the 16th record. run: 66009532 subRun: 1 event: 29216 at 21-Jan-2026 02:03:02 GMT
/home

CCNC: 1
PDG: 14
N GENIE particles: 29
...............................................................


================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.0392715      42.2511       234.537       18.141        66.0803        16     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0392715     0.220263      0.912918      0.135255      0.218738        16     
prod:emtrkmichelid:EmTrackMichelId              4.91007       38.1302       218.607       14.6404       61.4399        16     
[art]:TriggerResults:TriggerResultInserter     2.103e-05    3.98967e-05   8.6215e-05    3.99375e-05   1.51852e-05      16     
end_path:atmselection:AtmSelection             0.172332       5.68901       15.7503       1.04862       6.20869        15     
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 13179.8 MB
  Peak resident set size usage (VmHWM): 1067.5 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 21-Jan-2026 02:05:11 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- EventProcessorFailure BEGIN
    EndPathExecutor: an exception occurred during current event processing
    ---- ScheduleExecutionFailure BEGIN
      Path: ProcessingStopped.
      ---- FileReadError BEGIN
        ---- FatalRootError BEGIN
          Fatal Root Error: TNetXNGFile::ReadBuffer
          [FATAL] Hand shake failed
          ROOT severity: 3000
        ---- FatalRootError END
        
        The above exception was thrown while processing module AtmSelection/atmselection run: 66009532 subRun: 1 event: 29216
      ---- FileReadError END
      Exception going through path end_path
    ---- ScheduleExecutionFailure END
  ---- EventProcessorFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
%MSG
=== End last 100 lines of lar log file ===
.:
total 1220
-rw-r--r--. 1 dune004 dune 1205316 Jan 21 02:05 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66009532_292_20231203T232041Z_gen_g4_detsim_hitreco__20240509T205144Z_reco2_ana_2026-01-21T_014320Z.root
-rw-r--r--. 1 dune004 dune   28169 Jan 21 02:05 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_66009532_292_20231203T232041Z_gen_g4_detsim_hitreco__20240509T205144Z_reco2_reco_2026-01-21T_014320Z.log
-rw-r--r--. 1 dune004 dune    6831 Jan 21 02:05 jobscript.log
-rw-r--r--. 1 dune004 dune     138 Jan 21 01:43 all-input-dids.txt
-rw-r--r--. 1 dune004 dune       0 Jan 21 01:43 debugprod.log
justIN time: 2026-02-04 04:48:40 UTC       justIN version: 01.06.00