justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Workflow 2661, Stage 1

Priority50
Processors1
Wall seconds43200
Image/cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest
RSS bytes4194304000 (4000 MiB)
Max distance for inputs30.0
Enabled input RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled output RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled sites BR_CBPF, CA_SFU, CERN, CH_UNIBE-LHEP, CZ_FZU, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IT_CNAF, NL_NIKHEF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Glasgow, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin
Scopeusertests
Events for this stage

Output patterns

 DestinationPatternLifetimeFor next stageRSE expression
1https://fndcadoor.fnal.gov:2880/dune/scratch/users/imawby/splitting_nu_3/fnal/02661/1*CheatingKalmanSplittingU*.root
2https://fndcadoor.fnal.gov:2880/dune/scratch/users/imawby/splitting_nu_3/fnal/02661/1*CheatingKalmanSplittingV*.root
3https://fndcadoor.fnal.gov:2880/dune/scratch/users/imawby/splitting_nu_3/fnal/02661/1*CheatingKalmanSplittingW*.root

Environment variables

NameValue
INPUT_TAR_DIR_LOCAL/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/432ff2a09171fb45cd52c6bcbb8cee015838b67a
NUM_EVENTS100

File states

Total filesFindingUnallocatedAllocatedOutputtingProcessedNot foundFailed
10000000968032

Job states

TotalSubmittedStartedProcessingOutputtingFinishedNotusedAbortedStalledJobscript errorOutputting failedNone processed
26220000241100021100
Files processed00100100200200300300400400500500600600700700800800Sep-16 15:00Sep-16 16:00Sep-16 17:00Sep-16 18:00Files processedBin start timesNumber per binUK_BristolNL_SURFsaraUK_OxfordUK_ManchesterUK_RAL-Tier1NL_NIKHEFUK_RAL-PPDFR_CCIN2P3US_UChicagoES_PICUS_WisconsinCZ_FZUUS_FNAL-T1US_FNAL-FermiG…US_FNAL-FermiGridCERNIT_CNAFUS_UCSD
Replicas per RSE1000478.61544523168413203.743366006409221000296.6058201267621279.720471256207556314.137381683453147.4904865593204655326.44077032460876138.3722163945360853339.60613884718026131.3931782864769446352.4607186469468126.7688333185349436363.5032548677674124.2871579630834334373.10057842485963123.16192842729894378.34291231890774122.938652130069514379.4476152227206122.92651693384667Replicas per RSEDUNE_US_FNAL_DISK_S…DUNE_US_FNAL_DISK_STAGE (43%)FNAL_DCACHE (43%)NIKHEF (2%)PRAGUE (2%)RAL-PP (2%)SURFSARA (2%)RAL_ECHO (1%)QMUL (1%)DUNE_ES_PIC (0%)DUNE_FR_CCIN2P3_DIS…DUNE_FR_CCIN2P3_DISK (0%)

RSEs used

NameInputsOutputs
DUNE_US_FNAL_DISK_STAGE7160
QMUL2000
PRAGUE640
NIKHEF560
RAL-PP530
SURFSARA460
RAL_ECHO360
DUNE_ES_PIC40
DUNE_FR_CCIN2P3_DISK40
None02904

Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)

Jobscript

#!/bin/bash

# FCL file and DUNE software version/qualifier to be used
FCL_FILE=${FCL_FILE:-$INPUT_TAR_DIR_LOCAL/kalmanTraining.fcl}
DUNE_VERSION=${DUNE_VERSION:-v10_08_00d00}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}

# Make sure that we're not beeing greedy with resources...
if [ -z ${JUSTIN_PROCESSORS} ]; then
 JUSTIN_PROCESSORS=1
fi

echo "Justin processors: ${JUSTIN_PROCESSORS}"

export TF_NUM_THREADS=${JUSTIN_PROCESSORS}
export OPENBLAS_NUM_THREADS=${JUSTIN_PROCESSORS}
export JULIA_NUM_THREADS=${JUSTIN_PROCESSORS}
export MKL_NUM_THREADS=${JUSTIN_PROCESSORS}
export NUMEXPR_NUM_THREADS=${JUSTIN_PROCESSORS}
export OMP_NUM_THREADS=${JUSTIN_PROCESSORS}

# setup the DUNE environment
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
source $INPUT_TAR_DIR_LOCAL/setup-grid
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"
mrbslp

export FW_SEARCH_PATH=.:${INPUT_TAR_DIR_LOCAL}:$FW_SEARCH_PATH
export FHICL_FILE_PATH=.:${INPUT_TAR_DIR_LOCAL}:$FHICL_FILE_PATH

# number of events to process from the input file
if [ "$NUM_EVENTS" != "" ] ; then
 events_option="-n $NUM_EVENTS"
fi

# First get an unprocessed file from this stage
did_pfn_rse=`$JUSTIN_PATH/justin-get-file`

if [ "$did_pfn_rse" = "" ] ; then
  echo "Nothing to process - exit jobscript"
  exit 0
fi

# Keep a record of all input DIDs, for pdjson2meta file -> DID mapping
echo "$did_pfn_rse" | cut -f1 -d' ' >>all-input-dids.txt

# pfn is also needed when creating justin-processed-pfns.txt
pfn=`echo $did_pfn_rse | cut -f2 -d' '`
echo "Input PFN = $pfn"

# Construct outFile from input $pfn 
now=$(date -u +"%Y-%m-%dT_%H%M%SZ")
Ffname=`echo $pfn | awk -F/ '{print $NF}'`
fname=`echo $Ffname | awk -F. '{print $1}'`

campaign="justIN.w${JUSTIN_WORKFLOW_ID}s${JUSTIN_STAGE_ID}"

# Here is where the LArSoft command is call it 
(
# Do the scary preload stuff in a subshell!
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so

lar -c $FCL_FILE $events_option "$pfn" > ${fname}_reco_${now}.log 2>&1
)

# Subshell exits with exit code of last command
larExit=$?
echo "lar exit code $larExit"

echo '=== Start last 100 lines of lar log file ==='
tail -100 ${fname}_reco_${now}.log
echo '=== End last 100 lines of lar log file ==='

mv CheatingKalmanSplittingU.root CheatingKalmanSplittingU_${fname}.root
mv CheatingKalmanSplittingV.root CheatingKalmanSplittingV_${fname}.root
mv CheatingKalmanSplittingW.root CheatingKalmanSplittingW_${fname}.root

if [ $larExit -eq 0 ] ; then
  # Success !
  echo "$pfn" > justin-processed-pfns.txt
  jobscriptExit=0
else
  # Oh :(
  jobscriptExit=1
fi

# Create compressed tar file with all log files 
tar zcf `echo "$JUSTIN_JOBSUB_ID.logs.tgz" | sed 's/@/_/g'` *.log
exit $jobscriptExit
justIN time: 2025-09-18 16:03:41 UTC       justIN version: 01.05.00