justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 297369.34@dunegpschedd01.fnal.gov

Jobsub ID297369.34@dunegpschedd01.fnal.gov
Workflow ID12138
Stage ID1
User namegalli@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-21 01:40:14
SiteUK_Brunel
EntryCMSHTPC_T2_UK_London_Brunel_dc2_26
Last heartbeat2026-01-21 02:13:16
From worker nodeHostnamewn-a4-03
cpuinfoAMD EPYC 7452 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-21 01:42:10
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_421_20231202T113536Z_gen_g4_detsim_hitreco__20240507T214811Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-21 02:13:16
Saved logsjustin-logs:297369.34-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

Input PFN = root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/cd/ac/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_421_20231202T113536Z_gen_g4_detsim_hitreco__20240507T214811Z_reco2.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Using custom sources from /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_04
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune/testproducts:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 77:  1261 Segmentation fault      (core dumped) lar -c $FCL_FILE $events_option $OUTPUT_CMD "$pfn" > ${fname}_reco_${now}.log 2>&1
lar exit code 139
=== Start last 100 lines of lar log file ===
/home

Begin processing the 3rd record. run: 6405486 subRun: 1 event: 42103 at 21-Jan-2026 01:44:24 GMT
/home

Begin processing the 4th record. run: 6405486 subRun: 1 event: 42104 at 21-Jan-2026 01:46:32 GMT
/home

Begin processing the 5th record. run: 6405486 subRun: 1 event: 42105 at 21-Jan-2026 01:46:53 GMT
/home

Begin processing the 6th record. run: 6405486 subRun: 1 event: 42106 at 21-Jan-2026 01:47:02 GMT
/home

Begin processing the 7th record. run: 6405486 subRun: 1 event: 42107 at 21-Jan-2026 01:47:16 GMT
/home

Begin processing the 8th record. run: 6405486 subRun: 1 event: 42108 at 21-Jan-2026 01:48:31 GMT
/home

Begin processing the 9th record. run: 6405486 subRun: 1 event: 42109 at 21-Jan-2026 01:48:47 GMT
/home

Begin processing the 10th record. run: 6405486 subRun: 1 event: 42110 at 21-Jan-2026 01:48:52 GMT
/home

Begin processing the 11th record. run: 6405486 subRun: 1 event: 42111 at 21-Jan-2026 01:49:02 GMT
/home

Begin processing the 12th record. run: 6405486 subRun: 1 event: 42112 at 21-Jan-2026 01:50:51 GMT
/home

Begin processing the 13th record. run: 6405486 subRun: 1 event: 42113 at 21-Jan-2026 01:51:51 GMT
/home

Begin processing the 14th record. run: 6405486 subRun: 1 event: 42114 at 21-Jan-2026 01:53:04 GMT
/home

Begin processing the 15th record. run: 6405486 subRun: 1 event: 42115 at 21-Jan-2026 02:01:05 GMT
/home

Begin processing the 16th record. run: 6405486 subRun: 1 event: 42116 at 21-Jan-2026 02:02:51 GMT
/home

Begin processing the 17th record. run: 6405486 subRun: 1 event: 42117 at 21-Jan-2026 02:03:58 GMT
/home

Begin processing the 18th record. run: 6405486 subRun: 1 event: 42118 at 21-Jan-2026 02:05:16 GMT
/home

Begin processing the 19th record. run: 6405486 subRun: 1 event: 42119 at 21-Jan-2026 02:08:41 GMT
/home

Begin processing the 20th record. run: 6405486 subRun: 1 event: 42120 at 21-Jan-2026 02:09:50 GMT

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                      2.28804       37.497        262.001       16.7469       57.8997        20     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0395385     0.674004       6.37875      0.225806       1.37014        20     
prod:emtrkmichelid:EmTrackMichelId              1.92562       38.3188       261.934       16.3079       59.1214        19     
[art]:TriggerResults:TriggerResultInserter    1.6881e-05    2.91036e-05   8.1802e-05    2.6971e-05    1.58814e-05      19     
end_path:myanalysis:MyAnalysis                0.000560333    0.441798       5.79388     0.00443898      1.28392        19     
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 19688.6 MB
  Peak resident set size usage (VmHWM): 1117.69 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 21-Jan-2026 02:12:49 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TNetXNGFile::TNetXNGFile
        The remote file is not open
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module EmTrackMichelId/emtrkmichelid run: 6405486 subRun: 1 event: 42120
    ---- FileReadError END
    Exception going through path prod
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
%MSG
=== End last 100 lines of lar log file ===
.:
total 68
-rw-r--r--. 1 dune000 dune 48418 Jan 21 02:12 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_421_20231202T113536Z_gen_g4_detsim_hitreco__20240507T214811Z_reco2_ana_2026-01-21T_014218Z.root
-rw-r--r--. 1 dune000 dune  7321 Jan 21 02:12 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_6405486_421_20231202T113536Z_gen_g4_detsim_hitreco__20240507T214811Z_reco2_reco_2026-01-21T_014218Z.log
-rw-r--r--. 1 dune000 dune  6729 Jan 21 02:12 jobscript.log
-rw-r--r--. 1 dune000 dune   137 Jan 21 01:42 all-input-dids.txt
-rw-r--r--. 1 dune000 dune     0 Jan 21 01:42 debugprod.log
justIN time: 2026-02-04 04:29:29 UTC       justIN version: 01.06.00