justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 19874.143@dunegpschedd01.fnal.gov

Jobsub ID19874.143@dunegpschedd01.fnal.gov
Workflow ID166
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod.mcsim
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-07-31 15:34:08
SiteUK_RAL-Tier1
EntryLIGO_UK_RAL_arc_ce05
Last heartbeat2025-07-31 16:27:51
From worker nodeHostnamedune001-7064962.0-lcg2669.gridpp.rl.ac.uk
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit216000 (60 hours)
GPU
Inner Apptainer?True
Job statestalled
Started2025-07-31 16:27:40
Input files
Outputting started2025-07-31 16:27:51
Output files
Finished2025-07-31 16:57:20
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Justin processors: 1
Will use justin-get-file
get_file receives:
DB connection lost and cannot reconnect: 'NoneType' object has no attribute 'ping'get-file fails with HTTP code 500 from allocator!
Could not get file
justIN time: 2025-08-04 16:30:37 UTC       justIN version: 01.04.00