Jobsub ID 19336.28@dunegpschedd01.fnal.gov
Jobsub ID | 19336.28@dunegpschedd01.fnal.gov |
Workflow ID | 150 |
Stage ID | 1 |
User name | lwhite86@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
GPU | No |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 3600 (1 hours) |
Submitted time | 2025-07-30 12:10:35 |
Site | UK_Edinburgh |
Entry | DUNE_UK_SGridECDF_ce1_multicore |
Last heartbeat | 2025-07-30 12:11:59 |
From worker node | Hostname | node2b23.ecdf.ed.ac.uk |
cpuinfo | Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 171000 (47 hours) |
GPU | |
Inner Apptainer? | True |
Job state | jobscript_error |
Started | 2025-07-30 12:11:13 |
Input files | fardet-hd:nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root
|
Jobscript | Exit code | 20 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Max RSS bytes | 0 (0 MiB) |
Outputting started | |
Output files | |
Finished | 2025-07-30 12:11:59 |
Saved logs | justin-logs:19336.28-dunegpschedd01.fnal.gov.logs.tgz |
List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
syncSock ][ 1147] [xrootd.pic.es:1094.0] Socket error while handshaking: [FATAL] TLS error: resource temporarily unavailable
[2025-07-30 13:11:44.047531 +0100][Debug ][AsyncSock ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
[2025-07-30 13:11:44.047537 +0100][Debug ][Poller ][ 1147] <[::ffff:192.41.105.56]:50948><--><[::ffff:193.109.172.155]:1094> Removing socket from the poller
[2025-07-30 13:11:44.047571 +0100][Error ][PostMaster ][ 1147] [xrootd.pic.es:1094] elapsed = 1, pConnectionWindow = 30 seconds.
[2025-07-30 13:11:44.047592 +0100][Error ][PostMaster ][ 1147] [xrootd.pic.es:1094] Unable to recover: [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.047617 +0100][Error ][XRootD ][ 1147] [xrootd.pic.es:1094] Impossible to send message kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ). Trying to recover.
[2025-07-30 13:11:44.047631 +0100][Debug ][XRootD ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.047655 +0100][Info ][XRootD ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-30 13:11:44.047775 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xa5d7620 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-30 13:11:44.047787 +0100][Debug ][XRootD ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.047792 +0100][Info ][XRootD ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-30 13:11:44.047836 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xa5d7620 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-30 13:11:44.047843 +0100][Debug ][XRootD ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.047848 +0100][Info ][XRootD ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-30 13:11:44.047889 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xa5d7620 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-30 13:11:44.047901 +0100][Debug ][XRootD ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.047925 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Passing to the thread-pool MsgHandler: 0xa5d7620 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-30 13:11:44.048084 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Calling MsgHandler: 0xa5d7620 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ) with status: [FATAL] TLS error: resource temporarily unavailable.
[2025-07-30 13:11:44.048164 +0100][Debug ][File ][ 1147] [0xcd6c3b0@root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?xrdcl.requuid=1b5368e7-56c9-4040-b507-5181ae9cce9c] Open has returned with status [FATAL] TLS error: resource temporarily unavailable
[2025-07-30 13:11:44.048197 +0100][Debug ][File ][ 1147] [0xcd6c3b0@root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root?xrdcl.requuid=1b5368e7-56c9-4040-b507-5181ae9cce9c] Error while opening at xrootd.pic.es:1094: [FATAL] TLS error: resource temporarily unavailable
[2025-07-30 13:11:44.048225 +0100][Debug ][Utility ][ 1147] Monitor library name not set. No monitoring
[2025-07-30 13:11:44.048275 +0100][Debug ][XRootD ][ 1147] Redirect trace-back:
[2025-07-30 13:11:44.048275 +0100][Debug ][XRootD ][ 1147] 0. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root
[2025-07-30 13:11:44.048275 +0100][Debug ][XRootD ][ 1147] 1. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root
[2025-07-30 13:11:44.048275 +0100][Debug ][XRootD ][ 1147] 2. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root
[2025-07-30 13:11:44.048283 +0100][Debug ][ExDbgMsg ][ 1147] [xrootd.pic.es:1094] Destroying MsgHandler: 0xa5d7620.
MultiPandoraApiImpl::DeletePandoraInstances - unable to find daughter instances associated with primary 0
%MSG-s ArtException: TriggerResultInserter:TriggerResults@Construction 30-Jul-2025 13:11:44 BST ModuleConstruction
cet::exception caught in art
---- FileOpenError BEGIN
---- FatalRootError BEGIN
Fatal Root Error: TNetXNGFile::Open
[FATAL] TLS error: resource temporarily unavailable
ROOT severity: 3000
---- FatalRootError END
RootInputFileSequence::initFile(): Input file root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/6f/82/nutau_dune10kt_1x2x6_1437_293_20230828T143335Z_gen_g4_detsim_hitreco__20240219T183401Z_reco2.root was not found or could not be opened.
---- FileOpenError END
%MSG
Art has completed and will exit with status 20.
[2025-07-30 13:11:44.266490 +0100][Debug ][JobMgr ][ 1147] Stopping the job manager...
[2025-07-30 13:11:44.267505 +0100][Debug ][JobMgr ][ 1147] Job manager stopped
[2025-07-30 13:11:44.267957 +0100][Debug ][TaskMgr ][ 1147] Stopping the task manager...
[2025-07-30 13:11:44.268042 +0100][Debug ][TaskMgr ][ 1147] Task manager stopped
[2025-07-30 13:11:44.268266 +0100][Debug ][Poller ][ 1147] Stopping the poller...
[2025-07-30 13:11:44.268454 +0100][Debug ][AsyncSock ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
[2025-07-30 13:11:44.268577 +0100][Debug ][PostMaster ][ 1147] [xrootd.pic.es:1094] Destroying stream
[2025-07-30 13:11:44.268693 +0100][Debug ][AsyncSock ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
lar exit code 20
mv: cannot stat 'eventClassificationTraining.root': No such file or directory
total 88
-rw-r--r-- 1 gl05pi6 eddie_users 216 Jul 30 13:11 all-input-dids.txt
-rw-r--r-- 1 gl05pi6 eddie_users 0 Jul 30 13:11 debugprod.log
-rw-r--r-- 1 gl05pi6 eddie_users 70899 Jul 30 13:11 jobscript.log
-rw-r--r-- 1 gl05pi6 eddie_users 166 Jul 30 13:11 justin-processed-pfns.txt
drwxr-xr-x 4 gl05pi6 eddie_users 4096 Jul 30 13:11 larpandoracontent
-rw-r--r-- 1 gl05pi6 eddie_users 519 Jul 30 13:11 reco2_hist.root