justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 301997.0@dunegpschedd01.fnal.gov

Jobsub ID301997.0@dunegpschedd01.fnal.gov
Workflow ID12500
Stage ID1
User nameamoor@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes6291456000 (6000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-28 17:10:54
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1_multicore
Last heartbeat2026-01-28 17:13:04
From worker nodeHostnamenode2b07.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes7864320000 (7500 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-28 17:12:02
Input filesmonte-carlo-012500-000002
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-28 17:13:04
Saved logsjustin-logs:301997.0-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

ing the socket
[2026-01-28 17:12:05.856244 +0000][Debug  ][Poller            ] <[::ffff:192.41.105.40]:52022><--><[::ffff:131.225.69.39]:1094> Removing socket from the poller
[2026-01-28 17:12:05.856389 +0000][Debug  ][PostMaster        ] [fndcadoor.fnal.gov:1094] Destroying stream
[2026-01-28 17:12:05.856399 +0000][Debug  ][AsyncSock         ] [fndcadoor.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:12:05.856413 +0000][Debug  ][AsyncSock         ] [stkendca2224.fnal.gov:20256.0] Closing the socket
[2026-01-28 17:12:05.856419 +0000][Debug  ][Poller            ] <[::ffff:192.41.105.40]:37614><--><[::ffff:131.225.69.189]:20256> Removing socket from the poller
[2026-01-28 17:12:05.856433 +0000][Debug  ][PostMaster        ] [stkendca2224.fnal.gov:20256] Destroying stream
[2026-01-28 17:12:05.856438 +0000][Debug  ][AsyncSock         ] [stkendca2224.fnal.gov:20256.0] Closing the socket
input_pfn file = root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000817_reco_data_2025-12-01T_121242Z_reco_data_2025-12-03T_155731Z_reco_data_2025-12-03T_173658Z_reco_data_2025-12-03T_194705Z_reco_data_2025-12-04T_103151Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
=== Start last 50 lines of lar log file ===
[2026-01-28 17:12:46.050254 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Destroying MsgHandler: 0xc2a02d0.
[2026-01-28 17:12:46.050532 +0000][Debug  ][File              ][ 1161] [0xcfb11c0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000817_reco_data_2025-12-01T_121242Z_reco_data_2025-12-03T_155731Z_reco_data_2025-12-03T_173658Z_reco_data_2025-12-03T_194705Z_reco_data_2025-12-04T_103151Z.root?xrdcl.requuid=dca543dc-e0b0-4fea-98df-1d7312cae73d] Sending a read command for handle 0x0 to stkendca2004.fnal.gov:23244
[2026-01-28 17:12:46.050568 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] MsgHandler created: 0xbd64260 (message: kXR_read (handle: 0x00000000, offset: 18579, size: 122) ).
[2026-01-28 17:12:46.050631 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Moving MsgHandler: 0xbd64260 (message: kXR_read (handle: 0x00000000, offset: 18579, size: 122) ) from out-queu to in-queue.
[2026-01-28 17:12:46.175022 +0000][Debug  ][ExDbgMsg          ][ 1161] [msg: 0xb0964d0] Assigned MsgHandler: 0xbd64260.
[2026-01-28 17:12:46.175060 +0000][Debug  ][ExDbgMsg          ][ 1161] [handler: 0xbd64260] Removed MsgHandler: 0xbd64260 from the in-queue.
[2026-01-28 17:12:46.175096 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Calling MsgHandler: 0xbd64260 (message: kXR_read (handle: 0x00000000, offset: 18579, size: 122) ) with status: [SUCCESS] .
[2026-01-28 17:12:46.175115 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Destroying MsgHandler: 0xbd64260.
28-Jan-2026 17:12:46 GMT  Opened output file with pattern "000002_reco_data_2026-01-28T_171207Z.root"
[2026-01-28 17:12:46.724036 +0000][Debug  ][File              ][ 1161] [0xcfb11c0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000817_reco_data_2025-12-01T_121242Z_reco_data_2025-12-03T_155731Z_reco_data_2025-12-03T_173658Z_reco_data_2025-12-03T_194705Z_reco_data_2025-12-04T_103151Z.root?xrdcl.requuid=dca543dc-e0b0-4fea-98df-1d7312cae73d] Sending a close command for handle 0x0 to stkendca2004.fnal.gov:23244
[2026-01-28 17:12:46.724093 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] MsgHandler created: 0xb594a40 (message: kXR_close (handle: 0x00000000) ).
[2026-01-28 17:12:46.724188 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Moving MsgHandler: 0xb594a40 (message: kXR_close (handle: 0x00000000) ) from out-queu to in-queue.
[2026-01-28 17:12:46.848906 +0000][Debug  ][ExDbgMsg          ][ 1161] [msg: 0xc353d00] Assigned MsgHandler: 0xb594a40.
[2026-01-28 17:12:46.848943 +0000][Debug  ][ExDbgMsg          ][ 1161] [handler: 0xb594a40] Removed MsgHandler: 0xb594a40 from the in-queue.
[2026-01-28 17:12:46.848980 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Calling MsgHandler: 0xb594a40 (message: kXR_close (handle: 0x00000000) ) with status: [SUCCESS] .
[2026-01-28 17:12:46.849009 +0000][Debug  ][File              ][ 1161] [0xcfb11c0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000817_reco_data_2025-12-01T_121242Z_reco_data_2025-12-03T_155731Z_reco_data_2025-12-03T_173658Z_reco_data_2025-12-03T_194705Z_reco_data_2025-12-04T_103151Z.root?xrdcl.requuid=dca543dc-e0b0-4fea-98df-1d7312cae73d] Close returned from stkendca2004.fnal.gov:23244 with: [SUCCESS] 
[2026-01-28 17:12:46.849032 +0000][Debug  ][ExDbgMsg          ][ 1161] [stkendca2004.fnal.gov:23244] Destroying MsgHandler: 0xb594a40.
28-Jan-2026 17:12:46 GMT  Closed input file "root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000817_reco_data_2025-12-01T_121242Z_reco_data_2025-12-03T_155731Z_reco_data_2025-12-03T_173658Z_reco_data_2025-12-03T_194705Z_reco_data_2025-12-04T_103151Z.root"

====================================================================================================================
TimeTracker printout (sec)            Min           Avg           Max         Median          RMS         nEvts   
====================================================================================================================
[ No processed events ]
====================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2149.47 MB
  Peak resident set size usage (VmHWM): 1036.22 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 28-Jan-2026 17:12:46 GMT ModuleEndJob
---- CatalogServiceError BEGIN
  Input file not found: 000002.
---- CatalogServiceError END
%MSG
Art has completed and will exit with status 1.
[2026-01-28 17:12:47.154714 +0000][Debug  ][JobMgr            ][ 1161] Stopping the job manager...
[2026-01-28 17:12:47.155666 +0000][Debug  ][JobMgr            ][ 1161] Job manager stopped
[2026-01-28 17:12:47.156123 +0000][Debug  ][TaskMgr           ][ 1161] Stopping the task manager...
[2026-01-28 17:12:47.156186 +0000][Debug  ][TaskMgr           ][ 1161] Task manager stopped
[2026-01-28 17:12:47.156315 +0000][Debug  ][Poller            ][ 1161] Stopping the poller...
[2026-01-28 17:12:47.156507 +0000][Debug  ][AsyncSock         ][ 1161] [fndca1.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:12:47.156536 +0000][Debug  ][Poller            ][ 1161] <[::ffff:192.41.105.40]:41022><--><[::ffff:131.225.69.121]:1094> Removing socket from the poller
[2026-01-28 17:12:47.156846 +0000][Debug  ][PostMaster        ][ 1161] [fndca1.fnal.gov:1094] Destroying stream
[2026-01-28 17:12:47.156868 +0000][Debug  ][AsyncSock         ][ 1161] [fndca1.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:12:47.156907 +0000][Debug  ][AsyncSock         ][ 1161] [stkendca2004.fnal.gov:23244.0] Closing the socket
[2026-01-28 17:12:47.156924 +0000][Debug  ][Poller            ][ 1161] <[::ffff:192.41.105.40]:42788><--><[::ffff:131.225.69.128]:23244> Removing socket from the poller
[2026-01-28 17:12:47.156957 +0000][Debug  ][PostMaster        ][ 1161] [stkendca2004.fnal.gov:23244] Destroying stream
[2026-01-28 17:12:47.156973 +0000][Debug  ][AsyncSock         ][ 1161] [stkendca2004.fnal.gov:23244.0] Closing the socket
=== End last 50 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 520
-rw-r--r-- 1 gl05pi8 eddie_users 251000 Jan 28 17:12 xrootdfiles.list
-rw-r--r-- 1 gl05pi8 eddie_users 123453 Jan 28 17:12 000002_reco_2026-01-28T_171207Z.log
-rw-r--r-- 1 gl05pi8 eddie_users  82651 Jan 28 17:12 RootOutput-93c4-dd01-ff3f-36ba.root
-rw-r--r-- 1 gl05pi8 eddie_users  26079 Jan 28 17:12 jobscript.log
-rw-r--r-- 1 gl05pi8 eddie_users  11549 Jan 28 17:12 PandoraSettings_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users   7560 Jan 28 17:12 reco2_hist.root
-rw-r--r-- 1 gl05pi8 eddie_users   2086 Jan 28 17:12 PandoraSettings_Cheating_Master_DUNEFD.xml
-rw-r--r-- 1 gl05pi8 eddie_users   1935 Jan 28 17:12 PandoraSettings_Slicing_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users    237 Jan 28 17:12 PandoraSettings_Cosmic_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users     52 Jan 28 17:12 all-input-dids.txt
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 000002_reco_data_2026-01-28T_171207Z.root.ext.json
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 000002_reco_data_2026-01-28T_171207Z.root.json
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 debugprod.log
justIN time: 2026-02-04 04:28:18 UTC       justIN version: 01.06.00