justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Workflow 10938, Stage 1

Priority50
Processors1
Wall seconds80000
Image/cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest
RSS bytes4194304000 (4000 MiB)
Max distance for inputs3000.0
Enabled input RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled output RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled sites BR_CBPF, CA_SFU, CA_Victoria, CERN, CH_UNIBE-LHEP, CZ_FZU, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IT_CNAF, NL_NIKHEF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin
Scopeusertests
Events for this stage

Output patterns

 DestinationPatternLifetimeFor next stageRSE expression
1https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/10938/1graph_output*

Environment variables

NameValue
CODE_TAR_DIR_LOCAL/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/c976ce192292f2f9835b2b96cb18eba74fad6274
DUNE_QUALIFIERe26:prof
DUNE_VERSIONv10_04_06d00
FCL_FILE/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/796bc68541be1b664a9943f6d9c55c88be5b8089/atm-training-extract.fcl
FCL_SECONDARY/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/796bc68541be1b664a9943f6d9c55c88be5b8089/eid.fcl
FCL_THIRD/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/796bc68541be1b664a9943f6d9c55c88be5b8089/anatree_new_prod.fcl
NUM_EVENTS200
XML_MASTER/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/796bc68541be1b664a9943f6d9c55c88be5b8089/PandoraSettings_Master_Training_CSV.xml
XML_NEUTRINO/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/796bc68541be1b664a9943f6d9c55c88be5b8089/PandoraSettings_Neutrino_Training_CSV.xml

File states

Total filesFindingUnallocatedAllocatedOutputtingProcessedNot foundFailed
1000000099307

Job states

TotalSubmittedStartedProcessingOutputtingFinishedNotusedAbortedStalledJobscript errorOutputting failedNone processed
1209000010730421745311
Files processed00100100200200300300400400500500600600700700800800900900Dec-02 12:00Dec-02 15:00Dec-02 18:00Dec-02 21:00Dec-03 00:00Dec-03 03:00Dec-03 06:00Dec-03 09:00Dec-03 12:00Dec-03 15:00Dec-03 18:00Dec-03 21:00Dec-04 00:00Dec-04 03:00Dec-04 06:00Dec-04 09:00Dec-04 12:00Files processedBin start timesNumber per binNL_SURFsaraNL_NIKHEFES_PICUK_BristolUK_DurhamUK_QMULUK_OxfordUK_ManchesterUK_RAL-Tier1UK_LancasterUK_BrunelUK_RAL-PPDFR_CCIN2P3CZ_FZUUS_UCSDBR_CBPF
Replicas per RSE1000461.1421299615057166.887819694856321000430.6994348441155326.291155082636641000279.83658678754176266.48662268735166129298.09564684787074167.8201923942711111313.78417105870983152.12476566373812106330.5228434350572141.0478565161182698347.9101659683255133.708761340685595365.3185713478017129.728500285406832377.0359808535869128.74172436098104Replicas per RSEDUNE_ES_PIC (28%)FNAL_DCACHE (28%)PRAGUE (28%)RAL_ECHO (3%)NIKHEF (3%)SURFSARA (2%)QMUL (2%)RAL-PP (2%)DUNE_FR_CCIN2P3_DISK (0%)

RSEs used

NameInputsOutputs
PRAGUE4520
RAL_ECHO1320
QMUL1210
RAL-PP980
NIKHEF970
SURFSARA960
DUNE_ES_PIC940
DUNE_FR_CCIN2P3_DISK290
None07944

Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)

File reset events, by site

SiteAllocatedOutputting
UK_Manchester201
UK_QMUL1423
CA_Victoria50
NL_SURFsara30
NL_NIKHEF33
UK_Edinburgh30
UK_RAL-PPD20
UK_RAL-Tier110
ES_PIC10
CZ_FZU01
BR_CBPF01

Jobscript

#!/bin/bash
:<<'EOF'
This jobscript generates CaloHitList-based graph data 
from input reco2 ROOT files using your custom LArSoft setup.

Required environment variables:
  - FCL_FILE
  - CODE_TAR_DIR_LOCAL
  - DUNE_VERSION
  - DUNE_QUALIFIER
  - XML_MASTER
  - XML_NEUTRINO
  - NUM_EVENTS (optional)
  - FCL_SECONDARY (optional)
  - FCL_THIRD (optional)
EOF

# === Setup FCL and version info ===
FCL_FILE=${FCL_FILE:-atm-training-extract.fcl}
DUNE_VERSION=${DUNE_VERSION:-v10_04_06d00}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}

# === Number of events option ===
if [ -n "$NUM_EVENTS" ]; then
  events_option="-n $NUM_EVENTS"
fi

# === Get a file from justIN ===
did_pfn_rse=$($JUSTIN_PATH/justin-get-file)
if [ -z "$did_pfn_rse" ]; then
  echo "No file assigned. Exiting jobscript."
  exit 0
fi

# === Track input DID for MetaCat ===
echo "$did_pfn_rse" | cut -f1 -d' ' >> all-input-dids.txt

# === Parse PFN from DID ===
pfn=$(echo "$did_pfn_rse" | cut -d' ' -f2)
echo "Input PFN = $pfn"

# === Setup DUNE software ===
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"

# === Mirror CODE_TAR_DIR_LOCAL ===
INPUT_TAR_DIR_LOCAL="$CODE_TAR_DIR_LOCAL"
echo "INPUT_TAR_DIR_LOCAL = $INPUT_TAR_DIR_LOCAL"

# === Setup custom code ===
if [ -n "$CODE_TAR_DIR_LOCAL" ]; then
  echo "Using local products from $CODE_TAR_DIR_LOCAL"
  source "$CODE_TAR_DIR_LOCAL/larsoft_graph_V1_2025/localProducts_larsoft_v10_04_06_e26_prof/setup-grid"
  mrbslp
fi

# === Generate common timestamp and random suffix for output renaming ===
timestamp=$(date -u +"%Y-%m-%dT_%H%M%SZ")
rand_suffix=$((1 + RANDOM % 10))

# === Output file naming ===
fname=$(basename "$pfn" .root)
outFile="${fname}_graph_${timestamp}.root"
logFile="${fname}_graph_${timestamp}.log"

# === Set FW search path ===
XML_DIR_MASTER=$(dirname "$XML_MASTER")
XML_DIR_NEUTRINO=$(dirname "$XML_NEUTRINO")
export FW_SEARCH_PATH="$XML_DIR_MASTER:$XML_DIR_NEUTRINO:$FW_SEARCH_PATH"

# === Run lar (primary) ===
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so
echo "Running LArSoft with FCL: $FCL_FILE"
lar -c "$FCL_FILE" $events_option -o "$outFile" "$pfn" > "$logFile" 2>&1
larExit=$?

# === Run lar (secondary) if needed ===
if [ -n "$FCL_SECONDARY" ]; then
  secondary_out="secondary_${outFile}"
  secondary_log="secondary_${logFile}"
  echo "Running LArSoft with secondary FCL: $FCL_SECONDARY"
  lar -c "$FCL_SECONDARY" $events_option "$pfn" > "$secondary_log" 2>&1
fi

if [ -f "$secondary_log" ]; then
  echo '=== Start last 100 lines of secondary lar log file ==='
  tail -100 "$secondary_log"
  echo '=== End last 100 lines of secondary lar log file ==='
fi

# === Run lar (third) if needed ===
if [ -n "$FCL_THIRD" ]; then
  third_out="third_${outFile}"
  third_log="third_${logFile}"
  echo "Running LArSoft with third FCL: $FCL_THIRD"
  lar -c "$FCL_THIRD" $events_option "$pfn" > "$third_log" 2>&1
fi

if [ -f "$third_log" ]; then
  echo '=== Start last 100 lines of third lar log file ==='
  tail -100 "$third_log"
  echo '=== End last 100 lines of third lar log file ==='
fi

# === Rename .data and .root files with timestamp and suffix ===
if [ $larExit -eq 0 ]; then
  # Rename .data files
  for f in *.data; do
    if [ -f "$f" ]; then
      newname="graph_output_${timestamp}_${rand_suffix}_$f"
      mv -f "$f" "$newname"
      echo "Renamed $f -> $newname"
    fi
  done

  # Rename *eid.root files
  for f in *eid.root; do
    if [ -f "$f" ]; then
      newname="graph_output_${timestamp}_${rand_suffix}_$f"
      mv -f "$f" "$newname"
      echo "Renamed $f -> $newname"
    fi
  done

  # Rename ana_tree_hd.root
  if [ -f ana_tree_hd.root ]; then
    newname="graph_output_${timestamp}_${rand_suffix}_ana_tree_hd.root"
    mv ana_tree_hd.root "$newname"
    echo "Renamed ana_tree_hd.root -> $newname"
  fi

  echo "$pfn" > justin-processed-pfns.txt
  jobscriptExit=0
else
  jobscriptExit=1
fi





# === Show lar log tail ===
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$logFile"
echo '=== End last 100 lines of lar log file ==='

# === Mark processed ===
if [ $larExit -eq 0 ]; then
  echo "$pfn" > justin-processed-pfns.txt
  jobscriptExit=0
else
  jobscriptExit=1
fi

# === Package logs ===
tar zcf "${JUSTIN_JOBSUB_ID//[@]/_}.logs.tgz" *.log

# === Display output summary ===
echo "=== Generated output files ==="
ls -1 *.* 2>/dev/null | grep -v 'all-input-dids.txt' || echo "No output files found."

exit $jobscriptExit
justIN time: 2025-12-19 02:01:14 UTC       justIN version: 01.05.03