Jobsub ID 253648.178@dunegpschedd01.fnal.gov
| Jobsub ID | 253648.178@dunegpschedd01.fnal.gov |
| Workflow ID | 10329 |
| Stage ID | 1 |
| User name | higuera@fnal.gov |
| HTCondor Group | group_dune.prod_mcsim |
| Requested | Processors | 1 |
| GPU | No |
| RSS bytes | 4194304000 (4000 MiB) |
| Wall seconds limit | 80000 (22 hours) |
| Submitted time | 2025-11-18 08:25:52 |
| Site | UK_Edinburgh |
| Entry | DUNE_UK_SGridECDF_ce1_multicore |
| Last heartbeat | 2025-11-18 08:44:30 |
| From worker node | Hostname | node2b08.ecdf.ed.ac.uk |
| cpuinfo | Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz |
| OS release | Scientific Linux release 7.9 (Nitrogen) |
| Processors | 1 |
| RSS bytes | 4194304000 (4000 MiB) |
| Wall seconds limit | 171000 (47 hours) |
| GPU | |
| Inner Apptainer? | True |
| Job state | jobscript_error |
| Started | 2025-11-18 08:27:49 |
| Input files | hd-protodune:pdhd_prod_beam__242390_48_1_20251112T221046Z_gen_g4_IonScintPDExt.root
|
| Jobscript | Exit code | 1 |
| Real time | 0m (0s) |
| CPU time | 0m (0s = 0%) |
| Max RSS bytes | 0 (0 MiB) |
| Outputting started | |
| Output files | |
| Finished | 2025-11-18 08:44:30 |
| Saved logs | justin-logs:253648.178-dunegpschedd01.fnal.gov.logs.tgz |
| List job events Cached HTCondor job logs |
Jobscript log (last 10,000 characters)
0220.
[2025-11-18 08:32:02.484575 +0000][Debug ][ExDbgMsg ] [msg: 0x10a5c110] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:02.484619 +0000][Debug ][ExDbgMsg ] [handler: 0x28490220] Removed MsgHandler: 0x28490220 from the in-queue.
[2025-11-18 08:32:02.511748 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Calling MsgHandler: 0x28490220 (message: kXR_read (handle: 0x00000000, offset: 1292467756, size: 78459457) ) with status: [SUCCESS] .
[2025-11-18 08:32:02.511793 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Destroying MsgHandler: 0x28490220.
Begin processing the 4th record. run: 20250627 subRun: 1 event: 493 at 18-Nov-2025 08:32:41 GMT
[2025-11-18 08:32:41.037886 +0000][Debug ][File ] [0x95add20@root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/dc/9f/pdhd_prod_beam__242390_48_1_20251112T221046Z_gen_g4_IonScintPDExt.root?xrdcl.requuid=d5182f07-747d-4595-a969-5b40ba6baabc] Sending a read command for handle 0x0 to stkendca2220.fnal.gov:22861
[2025-11-18 08:32:41.037927 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] MsgHandler created: 0x28490220 (message: kXR_read (handle: 0x00000000, offset: 1438981484, size: 65895624) ).
[2025-11-18 08:32:41.038101 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Moving MsgHandler: 0x28490220 (message: kXR_read (handle: 0x00000000, offset: 1438981484, size: 65895624) ) from out-queu to in-queue.
[2025-11-18 08:32:41.141904 +0000][Debug ][ExDbgMsg ] [msg: 0x2bd3cb20] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.065690 +0000][Debug ][ExDbgMsg ] [msg: 0x1076acf0] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.169496 +0000][Debug ][ExDbgMsg ] [msg: 0x36607fc0] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.269718 +0000][Debug ][ExDbgMsg ] [msg: 0x10e65560] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.365788 +0000][Debug ][ExDbgMsg ] [msg: 0x10a9dc70] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.475149 +0000][Debug ][ExDbgMsg ] [msg: 0x31b10750] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.573838 +0000][Debug ][ExDbgMsg ] [msg: 0x109fa990] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.619080 +0000][Debug ][ExDbgMsg ] [msg: 0x31dfc210] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:42.619119 +0000][Debug ][ExDbgMsg ] [handler: 0x28490220] Removed MsgHandler: 0x28490220 from the in-queue.
[2025-11-18 08:32:42.776362 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Calling MsgHandler: 0x28490220 (message: kXR_read (handle: 0x00000000, offset: 1438981484, size: 65895624) ) with status: [SUCCESS] .
[2025-11-18 08:32:42.776419 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Destroying MsgHandler: 0x28490220.
R__unzip: error -5 in inflate (zlib)
[2025-11-18 08:32:42.913631 +0000][Debug ][File ] [0x95add20@root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/dc/9f/pdhd_prod_beam__242390_48_1_20251112T221046Z_gen_g4_IonScintPDExt.root?xrdcl.requuid=d5182f07-747d-4595-a969-5b40ba6baabc] Sending a close command for handle 0x0 to stkendca2220.fnal.gov:22861
[2025-11-18 08:32:42.914057 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] MsgHandler created: 0x28490220 (message: kXR_close (handle: 0x00000000) ).
[2025-11-18 08:32:42.914157 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Moving MsgHandler: 0x28490220 (message: kXR_close (handle: 0x00000000) ) from out-queu to in-queue.
[2025-11-18 08:32:43.017357 +0000][Debug ][ExDbgMsg ] [msg: 0x10946690] Assigned MsgHandler: 0x28490220.
[2025-11-18 08:32:43.017410 +0000][Debug ][ExDbgMsg ] [handler: 0x28490220] Removed MsgHandler: 0x28490220 from the in-queue.
[2025-11-18 08:32:43.017551 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Calling MsgHandler: 0x28490220 (message: kXR_close (handle: 0x00000000) ) with status: [SUCCESS] .
[2025-11-18 08:32:43.017650 +0000][Debug ][File ] [0x95add20@root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/dc/9f/pdhd_prod_beam__242390_48_1_20251112T221046Z_gen_g4_IonScintPDExt.root?xrdcl.requuid=d5182f07-747d-4595-a969-5b40ba6baabc] Close returned from stkendca2220.fnal.gov:22861 with: [SUCCESS]
[2025-11-18 08:32:43.017698 +0000][Debug ][ExDbgMsg ] [stkendca2220.fnal.gov:22861] Destroying MsgHandler: 0x28490220.
18-Nov-2025 08:32:43 GMT Closed input file "root://fndcadoor.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/hd-protodune/dc/9f/pdhd_prod_beam__242390_48_1_20251112T221046Z_gen_g4_IonScintPDExt.root"
================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
================================================================================================================================
Full event 0.0110023 31.7785 46.3921 40.3555 18.5418 4
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read) 0.0110023 0.263461 0.909566 0.0666389 0.375722 4
simulate:PDFastSimTPC1:PDFastSimPAR 30.4057 33.0099 36.9148 31.7092 2.812 3
[art]:TriggerResults:TriggerResultInserter 2.0943e-05 3.3834e-05 5.9088e-05 2.1471e-05 1.78586e-05 3
end_path:out1:RootOutput 3.114e-06 6.62633e-06 1.3455e-05 3.31e-06 4.82926e-06 3
end_path:out1:RootOutput(write) 8.30408 9.00983 9.37015 9.35525 0.499077 3
================================================================================================================================
%MSG-i NuRandomService: RootOutput:out1@EndJob 18-Nov-2025 08:32:43 GMT ModuleEndJob
Summary of seeds computed by the NuRandomService
Random policy: 'random'
master seed: 838217056
seed within: [ 1 ; 900000000 ]
Configured value Last value ModuleLabel.InstanceName
210667936 (same) PDFastSimTPC1.photon
569208273 (same) PDFastSimTPC1.scinttime
%MSG
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 4004.05 MB
Peak resident set size usage (VmHWM): 3060.03 MB
====================================================================================================
TrigReport ---------- Event summary -------------
TrigReport Events total = 4 passed = 3 failed = 1
TrigReport ---------- Modules in End-path ----------
TrigReport Run Success Error Name
TrigReport 3 3 0 out1
TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 129.347347 Real = 196.651630
MemReport ---------- Memory summary [base-10 MB] ------
MemReport VmPeak = 4004.05 VmHWM = 3060.03
%MSG-s ArtException: PostEndJob 18-Nov-2025 08:33:14 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- ScheduleExecutionFailure BEGIN
Path: ProcessingStopped.
---- FileReadError BEGIN
---- FatalRootError BEGIN
Fatal Root Error: TBasket::ReadBasketBuffers
fNbytes = 65895624, fKeylen = 130, fObjlen = 140756244, noutot = 16777215, nout=0, nin=8446107, nbuf=16777215
ROOT severity: 3000
---- FatalRootError END
The above exception was thrown while processing module PDFastSimPAR/PDFastSimTPC1 run: 20250627 subRun: 1 event: 493
---- FileReadError END
Exception going through path simulate
---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
Fatal Root Error: TTree::SetEntries
Tree branches have different numbers of entries, eg EventAuxiliary has 3 entries while art::RNGsnapshots_rns__IonScintPDExt. has 10 entries.
ROOT severity: 2000
---- FatalRootError END
%MSG
[2025-11-18 08:33:14.143440 +0000][Debug ][JobMgr ] Stopping the job manager...
[2025-11-18 08:33:14.144017 +0000][Debug ][JobMgr ] Job manager stopped
[2025-11-18 08:33:14.144078 +0000][Debug ][TaskMgr ] Stopping the task manager...
[2025-11-18 08:33:14.144193 +0000][Debug ][TaskMgr ] Task manager stopped
[2025-11-18 08:33:14.144198 +0000][Debug ][Poller ] Stopping the poller...
[2025-11-18 08:33:14.144496 +0000][Debug ][AsyncSock ] [fndcadoor.fnal.gov:1094.0] Closing the socket
[2025-11-18 08:33:14.144590 +0000][Debug ][Poller ] <[::ffff:192.41.105.41]:55946><--><[::ffff:131.225.69.39]:1094> Removing socket from the poller
[2025-11-18 08:33:14.144865 +0000][Debug ][PostMaster ] [fndcadoor.fnal.gov:1094] Destroying stream
[2025-11-18 08:33:14.144941 +0000][Debug ][AsyncSock ] [fndcadoor.fnal.gov:1094.0] Closing the socket
[2025-11-18 08:33:14.145051 +0000][Debug ][AsyncSock ] [stkendca2220.fnal.gov:22861.0] Closing the socket
[2025-11-18 08:33:14.145056 +0000][Debug ][Poller ] <[::ffff:192.41.105.41]:58964><--><[::ffff:131.225.69.185]:22861> Removing socket from the poller
[2025-11-18 08:33:14.145069 +0000][Debug ][PostMaster ] [stkendca2220.fnal.gov:22861] Destroying stream
[2025-11-18 08:33:14.145073 +0000][Debug ][AsyncSock ] [stkendca2220.fnal.gov:22861.0] Closing the socket
Art has completed and will exit with status 1.
PD TPC1 returns 1