justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 13383.51@dunegpschedd02.fnal.gov

Jobsub ID13383.51@dunegpschedd02.fnal.gov
Workflow ID176
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-07-31 16:38:05
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1_multicore
Last heartbeat2025-07-31 16:39:43
From worker nodeHostnamenode2b08.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2025-07-31 16:39:10
Input filesfardet-hd:anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root
JobscriptExit code20
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-07-31 16:39:43
Saved logsjustin-logs:13383.51-dunegpschedd02.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

1147] [xrootd.pic.es:1094.0] Socket error while handshaking: [FATAL] TLS error: resource temporarily unavailable
[2025-07-31 17:39:41.770099 +0100][Debug  ][AsyncSock         ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
[2025-07-31 17:39:41.770107 +0100][Debug  ][Poller            ][ 1147] <[::ffff:192.41.105.41]:51032><--><[::ffff:193.109.172.130]:1094> Removing socket from the poller
[2025-07-31 17:39:41.770160 +0100][Error  ][PostMaster        ][ 1147] [xrootd.pic.es:1094] elapsed = 0, pConnectionWindow = 30 seconds.
[2025-07-31 17:39:41.770181 +0100][Error  ][PostMaster        ][ 1147] [xrootd.pic.es:1094] Unable to recover: [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770200 +0100][Error  ][XRootD            ][ 1147] [xrootd.pic.es:1094] Impossible to send message kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ). Trying to recover.
[2025-07-31 17:39:41.770213 +0100][Debug  ][XRootD            ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770235 +0100][Info   ][XRootD            ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-31 17:39:41.770324 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xaa80f90 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-31 17:39:41.770334 +0100][Debug  ][XRootD            ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770339 +0100][Info   ][XRootD            ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es&triedrc=srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-31 17:39:41.770383 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xaa80f90 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-31 17:39:41.770390 +0100][Debug  ][XRootD            ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770396 +0100][Info   ][XRootD            ][ 1147] [xrootd.pic.es:1094] Retrying request: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ).
[2025-07-31 17:39:41.770447 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Retry at server MsgHandler: 0xaa80f90 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-31 17:39:41.770461 +0100][Debug  ][XRootD            ][ 1147] [xrootd.pic.es:1094] Handling error while processing kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ): [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770483 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Passing to the thread-pool MsgHandler: 0xaa80f90 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ).
[2025-07-31 17:39:41.770520 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Calling MsgHandler: 0xaa80f90 (message: kXR_open (file: pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?tried=xrootd.pic.es,xrootd.pic.es,xrootd.pic.es&triedrc=srverr,srverr,srverr, mode: 00, flags: kXR_open_read kXR_async kXR_retstat ) ) with status: [FATAL] TLS error: resource temporarily unavailable.
[2025-07-31 17:39:41.770563 +0100][Debug  ][File              ][ 1147] [0xc177c90@root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?xrdcl.requuid=148634c2-5721-471f-b76a-1a09d41ac558] Open has returned with status [FATAL] TLS error: resource temporarily unavailable
[2025-07-31 17:39:41.770587 +0100][Debug  ][File              ][ 1147] [0xc177c90@root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root?xrdcl.requuid=148634c2-5721-471f-b76a-1a09d41ac558] Error while opening at xrootd.pic.es:1094: [FATAL] TLS error: resource temporarily unavailable
[2025-07-31 17:39:41.770608 +0100][Debug  ][Utility           ][ 1147] Monitor library name not set. No monitoring
[2025-07-31 17:39:41.770650 +0100][Debug  ][XRootD            ][ 1147] Redirect trace-back:
[2025-07-31 17:39:41.770650 +0100][Debug  ][XRootD            ][ 1147] 	0. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root
[2025-07-31 17:39:41.770650 +0100][Debug  ][XRootD            ][ 1147] 	1. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root
[2025-07-31 17:39:41.770650 +0100][Debug  ][XRootD            ][ 1147] 	2. Retrying: root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root
[2025-07-31 17:39:41.770658 +0100][Debug  ][ExDbgMsg          ][ 1147] [xrootd.pic.es:1094] Destroying MsgHandler: 0xaa80f90.
MultiPandoraApiImpl::DeletePandoraInstances - unable to find daughter instances associated with primary 0
%MSG-s ArtException:  TriggerResultInserter:TriggerResults@Construction  31-Jul-2025 17:39:41 BST ModuleConstruction
cet::exception caught in art
---- FileOpenError BEGIN
  ---- FatalRootError BEGIN
    Fatal Root Error: TNetXNGFile::Open
    [FATAL] TLS error: resource temporarily unavailable
    ROOT severity: 3000
  ---- FatalRootError END
  
  RootInputFileSequence::initFile(): Input file root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/fardet-hd/c5/a9/anutau_dune10kt_1x2x6_1131_107_20230828T210301Z_gen_g4_detsim_hitreco__20240220T190510Z_reco2.root was not found or could not be opened.
---- FileOpenError END
%MSG
Art has completed and will exit with status 20.
[2025-07-31 17:39:41.975347 +0100][Debug  ][JobMgr            ][ 1147] Stopping the job manager...
[2025-07-31 17:39:41.976471 +0100][Debug  ][JobMgr            ][ 1147] Job manager stopped
[2025-07-31 17:39:41.977202 +0100][Debug  ][TaskMgr           ][ 1147] Stopping the task manager...
[2025-07-31 17:39:41.977464 +0100][Debug  ][TaskMgr           ][ 1147] Task manager stopped
[2025-07-31 17:39:41.977473 +0100][Debug  ][Poller            ][ 1147] Stopping the poller...
[2025-07-31 17:39:41.977714 +0100][Debug  ][AsyncSock         ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
[2025-07-31 17:39:41.977920 +0100][Debug  ][PostMaster        ][ 1147] [xrootd.pic.es:1094] Destroying stream
[2025-07-31 17:39:41.978054 +0100][Debug  ][AsyncSock         ][ 1147] [xrootd.pic.es:1094.0] Closing the socket
lar exit code 20
mv: cannot stat 'eventClassificationTraining.root': No such file or directory
total 88
-rw-r--r-- 1 gl05pi6 eddie_users   218 Jul 31 17:39 all-input-dids.txt
-rw-r--r-- 1 gl05pi6 eddie_users     0 Jul 31 17:39 debugprod.log
-rw-r--r-- 1 gl05pi6 eddie_users 70923 Jul 31 17:39 jobscript.log
-rw-r--r-- 1 gl05pi6 eddie_users   167 Jul 31 17:39 justin-processed-pfns.txt
drwxr-xr-x 4 gl05pi6 eddie_users  4096 Jul 31 17:39 larpandoracontent
-rw-r--r-- 1 gl05pi6 eddie_users   519 Jul 31 17:39 reco2_hist.root
justIN time: 2025-08-04 14:16:46 UTC       justIN version: 01.04.00