justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 301997.2@dunegpschedd01.fnal.gov

Jobsub ID301997.2@dunegpschedd01.fnal.gov
Workflow ID12500
Stage ID1
User nameamoor@fnal.gov
RequestedProcessors1
GPUNo
RSS bytes6291456000 (6000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2026-01-28 17:10:54
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1_multicore
Last heartbeat2026-01-28 17:12:57
From worker nodeHostnamenode2b07.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes7864320000 (7500 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Started2026-01-28 17:11:44
Input filesmonte-carlo-012500-000001
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2026-01-28 17:12:57
Saved logsjustin-logs:301997.2-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

ing the socket
[2026-01-28 17:11:48.102047 +0000][Debug  ][Poller            ] <[::ffff:192.41.105.40]:55120><--><[::ffff:131.225.69.39]:1094> Removing socket from the poller
[2026-01-28 17:11:48.102163 +0000][Debug  ][PostMaster        ] [fndcadoor.fnal.gov:1094] Destroying stream
[2026-01-28 17:11:48.102171 +0000][Debug  ][AsyncSock         ] [fndcadoor.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:11:48.102185 +0000][Debug  ][AsyncSock         ] [stkendca2224.fnal.gov:20256.0] Closing the socket
[2026-01-28 17:11:48.102191 +0000][Debug  ][Poller            ] <[::ffff:192.41.105.40]:47798><--><[::ffff:131.225.69.189]:20256> Removing socket from the poller
[2026-01-28 17:11:48.102203 +0000][Debug  ][PostMaster        ] [stkendca2224.fnal.gov:20256] Destroying stream
[2026-01-28 17:11:48.102208 +0000][Debug  ][AsyncSock         ] [stkendca2224.fnal.gov:20256.0] Closing the socket
input_pfn file = root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000681_reco_data_2025-12-01T_121113Z_reco_data_2025-12-03T_155338Z_reco_data_2025-12-03T_174157Z_reco_data_2025-12-03T_195711Z_reco_data_2025-12-04T_103152Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_5_5a/Linux64bit+3.10-2.17-e26-p3915-prof/lib/libXrdPosixPreload.so
=== Start last 50 lines of lar log file ===
[2026-01-28 17:12:38.616588 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Destroying MsgHandler: 0xb6a7080.
[2026-01-28 17:12:38.616920 +0000][Debug  ][File              ][ 1160] [0xc82fcd0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000681_reco_data_2025-12-01T_121113Z_reco_data_2025-12-03T_155338Z_reco_data_2025-12-03T_174157Z_reco_data_2025-12-03T_195711Z_reco_data_2025-12-04T_103152Z.root?xrdcl.requuid=3b44b836-f2eb-4f23-85ad-24c6436701f4] Sending a read command for handle 0x0 to stkendca2014.fnal.gov:23148
[2026-01-28 17:12:38.616950 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] MsgHandler created: 0xa1a7a60 (message: kXR_read (handle: 0x00000000, offset: 18618, size: 122) ).
[2026-01-28 17:12:38.617011 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Moving MsgHandler: 0xa1a7a60 (message: kXR_read (handle: 0x00000000, offset: 18618, size: 122) ) from out-queu to in-queue.
[2026-01-28 17:12:38.738177 +0000][Debug  ][ExDbgMsg          ][ 1160] [msg: 0x95772d0] Assigned MsgHandler: 0xa1a7a60.
[2026-01-28 17:12:38.738230 +0000][Debug  ][ExDbgMsg          ][ 1160] [handler: 0xa1a7a60] Removed MsgHandler: 0xa1a7a60 from the in-queue.
[2026-01-28 17:12:38.738283 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Calling MsgHandler: 0xa1a7a60 (message: kXR_read (handle: 0x00000000, offset: 18618, size: 122) ) with status: [SUCCESS] .
[2026-01-28 17:12:38.738318 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Destroying MsgHandler: 0xa1a7a60.
28-Jan-2026 17:12:38 GMT  Opened output file with pattern "000001_reco_data_2026-01-28T_171153Z.root"
[2026-01-28 17:12:39.612641 +0000][Debug  ][File              ][ 1160] [0xc82fcd0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000681_reco_data_2025-12-01T_121113Z_reco_data_2025-12-03T_155338Z_reco_data_2025-12-03T_174157Z_reco_data_2025-12-03T_195711Z_reco_data_2025-12-04T_103152Z.root?xrdcl.requuid=3b44b836-f2eb-4f23-85ad-24c6436701f4] Sending a close command for handle 0x0 to stkendca2014.fnal.gov:23148
[2026-01-28 17:12:39.612693 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] MsgHandler created: 0xb66af80 (message: kXR_close (handle: 0x00000000) ).
[2026-01-28 17:12:39.612778 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Moving MsgHandler: 0xb66af80 (message: kXR_close (handle: 0x00000000) ) from out-queu to in-queue.
[2026-01-28 17:12:39.734172 +0000][Debug  ][ExDbgMsg          ][ 1160] [msg: 0xc82e110] Assigned MsgHandler: 0xb66af80.
[2026-01-28 17:12:39.734222 +0000][Debug  ][ExDbgMsg          ][ 1160] [handler: 0xb66af80] Removed MsgHandler: 0xb66af80 from the in-queue.
[2026-01-28 17:12:39.734260 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Calling MsgHandler: 0xb66af80 (message: kXR_close (handle: 0x00000000) ) with status: [SUCCESS] .
[2026-01-28 17:12:39.734287 +0000][Debug  ][File              ][ 1160] [0xc82fcd0@root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000681_reco_data_2025-12-01T_121113Z_reco_data_2025-12-03T_155338Z_reco_data_2025-12-03T_174157Z_reco_data_2025-12-03T_195711Z_reco_data_2025-12-04T_103152Z.root?xrdcl.requuid=3b44b836-f2eb-4f23-85ad-24c6436701f4] Close returned from stkendca2014.fnal.gov:23148 with: [SUCCESS] 
[2026-01-28 17:12:39.734309 +0000][Debug  ][ExDbgMsg          ][ 1160] [stkendca2014.fnal.gov:23148] Destroying MsgHandler: 0xb66af80.
28-Jan-2026 17:12:39 GMT  Closed input file "root://fndca1.fnal.gov:1094//pnfs/fnal.gov/usr/dune/scratch/users/amoor/fnal/10987/1/001/000681_reco_data_2025-12-01T_121113Z_reco_data_2025-12-03T_155338Z_reco_data_2025-12-03T_174157Z_reco_data_2025-12-03T_195711Z_reco_data_2025-12-04T_103152Z.root"

====================================================================================================================
TimeTracker printout (sec)            Min           Avg           Max         Median          RMS         nEvts   
====================================================================================================================
[ No processed events ]
====================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2149.47 MB
  Peak resident set size usage (VmHWM): 1033.06 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 28-Jan-2026 17:12:39 GMT ModuleEndJob
---- CatalogServiceError BEGIN
  Input file not found: 000001.
---- CatalogServiceError END
%MSG
Art has completed and will exit with status 1.
[2026-01-28 17:12:40.050369 +0000][Debug  ][JobMgr            ][ 1160] Stopping the job manager...
[2026-01-28 17:12:40.051479 +0000][Debug  ][JobMgr            ][ 1160] Job manager stopped
[2026-01-28 17:12:40.052208 +0000][Debug  ][TaskMgr           ][ 1160] Stopping the task manager...
[2026-01-28 17:12:40.052771 +0000][Debug  ][TaskMgr           ][ 1160] Task manager stopped
[2026-01-28 17:12:40.052814 +0000][Debug  ][Poller            ][ 1160] Stopping the poller...
[2026-01-28 17:12:40.052976 +0000][Debug  ][AsyncSock         ][ 1160] [fndca1.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:12:40.053026 +0000][Debug  ][Poller            ][ 1160] <[::ffff:192.41.105.40]:41014><--><[::ffff:131.225.69.121]:1094> Removing socket from the poller
[2026-01-28 17:12:40.053291 +0000][Debug  ][PostMaster        ][ 1160] [fndca1.fnal.gov:1094] Destroying stream
[2026-01-28 17:12:40.053328 +0000][Debug  ][AsyncSock         ][ 1160] [fndca1.fnal.gov:1094.0] Closing the socket
[2026-01-28 17:12:40.053365 +0000][Debug  ][AsyncSock         ][ 1160] [stkendca2014.fnal.gov:23148.0] Closing the socket
[2026-01-28 17:12:40.053373 +0000][Debug  ][Poller            ][ 1160] <[::ffff:192.41.105.40]:44382><--><[::ffff:131.225.69.138]:23148> Removing socket from the poller
[2026-01-28 17:12:40.053400 +0000][Debug  ][PostMaster        ][ 1160] [stkendca2014.fnal.gov:23148] Destroying stream
[2026-01-28 17:12:40.053470 +0000][Debug  ][AsyncSock         ][ 1160] [stkendca2014.fnal.gov:23148.0] Closing the socket
=== End last 50 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v10_11_00d01/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 520
-rw-r--r-- 1 gl05pi8 eddie_users 251000 Jan 28 17:11 xrootdfiles.list
-rw-r--r-- 1 gl05pi8 eddie_users 123453 Jan 28 17:12 000001_reco_2026-01-28T_171153Z.log
-rw-r--r-- 1 gl05pi8 eddie_users  82651 Jan 28 17:12 RootOutput-7eb9-abad-428a-d4ea.root
-rw-r--r-- 1 gl05pi8 eddie_users  26121 Jan 28 17:12 jobscript.log
-rw-r--r-- 1 gl05pi8 eddie_users  11549 Jan 28 17:11 PandoraSettings_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users   7560 Jan 28 17:12 reco2_hist.root
-rw-r--r-- 1 gl05pi8 eddie_users   2086 Jan 28 17:11 PandoraSettings_Cheating_Master_DUNEFD.xml
-rw-r--r-- 1 gl05pi8 eddie_users   1935 Jan 28 17:11 PandoraSettings_Slicing_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users    237 Jan 28 17:11 PandoraSettings_Cosmic_Cheat.xml
-rw-r--r-- 1 gl05pi8 eddie_users     52 Jan 28 17:11 all-input-dids.txt
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 000001_reco_data_2026-01-28T_171153Z.root.ext.json
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 000001_reco_data_2026-01-28T_171153Z.root.json
-rw-r--r-- 1 gl05pi8 eddie_users      0 Jan 28 17:12 debugprod.log
justIN time: 2026-02-04 08:44:44 UTC       justIN version: 01.06.00