justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 20160.73@dunegpschedd01.fnal.gov

Jobsub ID20160.73@dunegpschedd01.fnal.gov
Workflow ID202
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-08-01 10:20:03
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc03
Last heartbeat2025-08-01 10:37:54
From worker nodeHostnamewn-lb-12.gina.surf.nl
cpuinfoAMD EPYC 9754 128-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statefinished
Started2025-08-01 10:21:15
Input filesfardet-hd:anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2.root
JobscriptExit code0
Real time16m (964s)
CPU time13m (810s = 84%)
Max RSS bytes1328115712 (1266 MiB)
Outputting started2025-08-01 10:37:20
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/fnal/00202/1/001/trainingFile_anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2.root
Finished2025-08-01 10:37:54
Saved logsjustin-logs:20160.73-dunegpschedd01.fnal.gov.logs.tgz
List job events     Cached HTCondor job logs

Jobscript log (last 10,000 characters)

woViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00187492 0.00213575 0.00224066 0.00867319 0.00817013 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 
----view 1: 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00213575 0.00224066 0.00867319 0.00817013 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 
----view 1: 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00224066 0.00867319 0.00817013 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 
----view 1: 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00867319 0.00817013 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 
----view 1: 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00817013 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 
----view 1: 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00195885 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 
----view 1: 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00604618 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 0.00229758 
----view 1: 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.0037967 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 0.00229758 0.00291514 
----view 1: 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00806856 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 0.00229758 0.00291514 0.00280631 
----view 1: 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.00358152 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 0.00229758 0.00291514 0.00280631 0.00293654 
----view 1: 3.96371e-05 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 
TwoViewTransverseTracksAlgorithm: failed to calculate correlation coefficient p-value for these numbers
----view 0: 0.000848472 0.00271946 0.00490928 0.00318527 0.00276041 0.00679636 0.00229758 0.00291514 0.00280631 0.00293654 0.00237823 
----view 1: 3.95775e-05 3.96371e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 3.95775e-05 3.96371e-05 
Begin processing the 76th record. run: 1075 subRun: 1 event: 56476 at 01-Aug-2025 12:35:13 CEST
Begin processing the 77th record. run: 1075 subRun: 1 event: 56477 at 01-Aug-2025 12:35:35 CEST
Begin processing the 78th record. run: 1075 subRun: 1 event: 56478 at 01-Aug-2025 12:35:38 CEST
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, STATUS_CODE_FAILURE
Begin processing the 79th record. run: 1075 subRun: 1 event: 56479 at 01-Aug-2025 12:35:41 CEST
Begin processing the 80th record. run: 1075 subRun: 1 event: 56480 at 01-Aug-2025 12:35:46 CEST
Begin processing the 81st record. run: 1075 subRun: 1 event: 56481 at 01-Aug-2025 12:35:50 CEST
Begin processing the 82nd record. run: 1075 subRun: 1 event: 56482 at 01-Aug-2025 12:35:54 CEST
Begin processing the 83rd record. run: 1075 subRun: 1 event: 56483 at 01-Aug-2025 12:36:01 CEST
Begin processing the 84th record. run: 1075 subRun: 1 event: 56484 at 01-Aug-2025 12:36:05 CEST
Begin processing the 85th record. run: 1075 subRun: 1 event: 56485 at 01-Aug-2025 12:36:09 CEST
Begin processing the 86th record. run: 1075 subRun: 1 event: 56486 at 01-Aug-2025 12:36:13 CEST
Begin processing the 87th record. run: 1075 subRun: 1 event: 56487 at 01-Aug-2025 12:36:16 CEST
Begin processing the 88th record. run: 1075 subRun: 1 event: 56488 at 01-Aug-2025 12:36:19 CEST
Begin processing the 89th record. run: 1075 subRun: 1 event: 56489 at 01-Aug-2025 12:36:23 CEST
Begin processing the 90th record. run: 1075 subRun: 1 event: 56490 at 01-Aug-2025 12:36:27 CEST
Begin processing the 91st record. run: 1075 subRun: 1 event: 56491 at 01-Aug-2025 12:36:33 CEST
Begin processing the 92nd record. run: 1075 subRun: 1 event: 56492 at 01-Aug-2025 12:36:38 CEST
Begin processing the 93rd record. run: 1075 subRun: 1 event: 56493 at 01-Aug-2025 12:36:41 CEST
Begin processing the 94th record. run: 1075 subRun: 1 event: 56494 at 01-Aug-2025 12:36:44 CEST
Begin processing the 95th record. run: 1075 subRun: 1 event: 56495 at 01-Aug-2025 12:36:59 CEST
Begin processing the 96th record. run: 1075 subRun: 1 event: 56496 at 01-Aug-2025 12:37:01 CEST
Begin processing the 97th record. run: 1075 subRun: 1 event: 56497 at 01-Aug-2025 12:37:05 CEST
Begin processing the 98th record. run: 1075 subRun: 1 event: 56498 at 01-Aug-2025 12:37:07 CEST
Begin processing the 99th record. run: 1075 subRun: 1 event: 56499 at 01-Aug-2025 12:37:10 CEST
Begin processing the 100th record. run: 1075 subRun: 1 event: 56500 at 01-Aug-2025 12:37:16 CEST
01-Aug-2025 12:37:18 CEST  Closed output file "anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2_reco2.root"
01-Aug-2025 12:37:18 CEST  Closed input file "root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/fardet-hd/9c/d1/anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.484388       8.04996       174.08        2.32996       21.5637        100    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0540613     0.201184      0.495958      0.181057      0.0747583       100    
reco:pandora2:StandardPandora                  0.243816       7.84785       173.771       2.07073       21.5558        100    
[art]:TriggerResults:TriggerResultInserter    1.5452e-05    2.46531e-05   0.000114813   1.9229e-05    1.47846e-05      100    
end_path:out1:RootOutput                       4.716e-06    5.55732e-06   1.6936e-05    5.4235e-06    1.17418e-06      100    
end_path:out1:RootOutput(write)               0.000209294   0.000657385   0.00253104     0.0004435    0.000496601      100    
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2310.69 MB
  Peak resident set size usage (VmHWM): 1328.12 MB
====================================================================================================
Art has completed and will exit with status 0.
lar exit code 0
total 2292
-rw-r--r--. 1 dune003 dune     218 Aug  1 12:21 all-input-dids.txt
-rw-r--r--. 1 dune003 dune  350650 Aug  1 12:37 anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2_reco2.root
-rw-r--r--. 1 dune003 dune       0 Aug  1 12:21 debugprod.log
-rw-r--r--. 1 dune003 dune   67530 Aug  1 12:37 jobscript.log
-rw-r--r--. 1 dune003 dune     170 Aug  1 12:37 justin-processed-pfns.txt
drwxr-xr-x. 4 dune003 dune      60 Aug  1 12:21 larpandoracontent
-rw-r--r--. 1 dune003 dune     519 Aug  1 12:37 reco2_hist.root
-rw-r--r--. 1 dune003 dune 1848725 Aug  1 12:37 trainingFile_anutau_dune10kt_1x2x6_1075_564_20230824T164356Z_gen_g4_detsim_hitreco__20240220T174547Z_reco2.root
justIN time: 2025-08-04 15:55:59 UTC       justIN version: 01.04.00