justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 291772.43@dunegpschedd02.fnal.gov

Jobsub ID291772.43@dunegpschedd02.fnal.gov
Workflow ID12140
Stage ID1
User namegalli@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-21 02:30:14
SiteUK_Durham
EntryDUNE_UK_SGridDurham_ce3
Last heartbeat2026-01-21 02:35:41
From worker nodeHostnamen163.dur.scotgrid.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 5220 CPU @ 2.20GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-21 02:31:58
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-21 02:35:41
Saved logsjustin-logs:291772.43-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

Input PFN = root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/33/75/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Using custom sources from /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_04
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune/testproducts:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/f2aa0933d6b0302a87f8b92969147394234f3bf2/larsoft/localProducts_larsoft_v09_91_04_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 77:  1261 Segmentation fault      (core dumped) lar -c $FCL_FILE $events_option $OUTPUT_CMD "$pfn" > ${fname}_reco_${now}.log 2>&1
lar exit code 139
=== Start last 100 lines of lar log file ===
Info in <TGeoManager::Import>: Reading geometry from file: /cvmfs/dune.opensciencegrid.org/products/dune/dunecore/v09_91_04d00/gdml/dune10kt_v5_refactored_1x2x6.gdml
Info in <TGeoManager::TGeoManager>: Geometry GDMLImport, Geometry imported from GDML created
Error: Unsupported GDML Tag Used :gdml_simple_extension. Please Check Geometry/Schema.
Error: Unsupported GDML Tag Used :extension. Please Check Geometry/Schema.
Info in <TGeoManager::SetTopVolume>: Top volume is volWorld. Master volume is volWorld
Info in <TGeoNavigator::BuildCache>: --- Maximum geometry depth set to 100
Info in <TGeoManager::CheckGeometry>: Fixing runtime shapes...
Info in <TGeoManager::CheckGeometry>: ...Nothing to fix
Info in <TGeoManager::CloseGeometry>: Counting nodes...
Info in <TGeoManager::Voxelize>: Voxelizing...
Info in <TGeoManager::CloseGeometry>: Building cache...
Info in <TGeoManager::CountLevels>: max level = 5, max placements = 1149
Info in <TGeoManager::CloseGeometry>: 67545 nodes/ 3831 volume UID's in Geometry imported from GDML
Info in <TGeoManager::CloseGeometry>: ----------------modeler ready----------------
registering to  primaryGeneratorActionsMap_
registering to   eventActionsMap_
registering to   trackingActionsMap_
registering to  steppingActionsMap_
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak).
Reading model from /cvmfs/dune.opensciencegrid.org/products/dune/dune_pardata/v01_84_00/CnnModels/cnn_ndkemtrk_pitch_5_wire_44_drift_48_down_6_mean_notes_AtmAndNdk.nnet
Layers 12
Layer 0 Convolution2D
LayerConv2D 48x1x5x5 border_mode valid
Layer 1 Activation
Activation type relu
Layer 2 Dropout
Layer 3 Flatten
Layer 4 Dense
weights 84480
bias 128
Layer 5 Activation
Activation type tanh
Layer 6 Dropout
Layer 7 Dense
weights 128
bias 32
Layer 8 Activation
Activation type tanh
Layer 9 Dropout
Layer 10 Dense
weights 32
bias 4
Layer 11 Activation
Activation type sigmoid
21-Jan-2026 02:32:24 GMT  Initiating request to open input file "root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/33/75/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2.root"
21-Jan-2026 02:32:39 GMT  Opened input file "root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/fardet-hd/33/75/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2.root"
Begin processing the 1st record. run: 50642276 subRun: 1 event: 57601 at 21-Jan-2026 02:33:39 GMT

====================================================================================================================
TimeTracker printout (sec)            Min           Avg           Max         Median          RMS         nEvts   
====================================================================================================================
[ No processed events ]
====================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 11840 MB
  Peak resident set size usage (VmHWM): 801.055 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 21-Jan-2026 02:35:23 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TNetXNGFile::TNetXNGFile
        The remote file is not open
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module EmTrackMichelId/emtrkmichelid run: 50642276 subRun: 1 event: 57601
    ---- FileReadError END
    Exception going through path prod
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [FATAL] Hand shake failed
  ROOT severity: 3000
---- FatalRootError END
%MSG
=== End last 100 lines of lar log file ===
.:
total 36
-rw-r--r--. 1 dune004 dune 14789 Jan 21 02:35 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2_ana_2026-01-21T_023202Z.root
-rw-r--r--. 1 dune004 dune  6766 Jan 21 02:35 jobscript.log
-rw-r--r--. 1 dune004 dune  4548 Jan 21 02:35 atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50642276_576_20231208T032038Z_gen_g4_detsim_hitreco__20240510T064031Z_reco2_reco_2026-01-21T_023202Z.log
-rw-r--r--. 1 dune004 dune   138 Jan 21 02:31 all-input-dids.txt
-rw-r--r--. 1 dune004 dune     0 Jan 21 02:32 debugprod.log
justIN time: 2026-02-04 04:32:45 UTC       justIN version: 01.06.00