Workflow 273, Stage 1
Priority | 50 |
Processors | 1 |
Wall seconds | 80000 |
Image | /cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest |
RSS bytes | 4194304000 (4000 MiB) |
Max distance for inputs | 3000.0 |
Enabled input RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled output RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled sites |
BR_CBPF, CA_SFU, CERN, CH_UNIBE-LHEP, CZ_FZU, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IT_CNAF, NL_NIKHEF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Glasgow, UK_Imperial, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin |
Scope | usertests |
Events for this stage |
Output patterns
| Destination | Pattern | Lifetime | For next stage | RSE expression |
---|
1 | https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00273/1 | graph_output* | | | |
2 | https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00273/1 | *.csv | | | |
Environment variables
Name | Value |
---|
CODE_TAR_DIR_LOCAL | /cvmfs/fifeuser2.opensciencegrid.org/sw/dune/c976ce192292f2f9835b2b96cb18eba74fad6274 |
DUNE_QUALIFIER | e26:prof |
DUNE_VERSION | v10_04_06d00 |
FCL_FILE | /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/aded37b8339c4dad9b3dde933bfdb47ff2c3e23e/atm-training-extract.fcl |
FCL_SECONDARY | /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/aded37b8339c4dad9b3dde933bfdb47ff2c3e23e/eid.fcl |
FCL_THIRD | /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/aded37b8339c4dad9b3dde933bfdb47ff2c3e23e/anatree_new_prod.fcl |
NUM_EVENTS | 200 |
XML_MASTER | /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/aded37b8339c4dad9b3dde933bfdb47ff2c3e23e/PandoraSettings_Master_Training_CSV.xml |
XML_NEUTRINO | /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/aded37b8339c4dad9b3dde933bfdb47ff2c3e23e/PandoraSettings_Neutrino_Training_CSV.xml |
File states
Total files | Finding | Unallocated | Allocated | Outputting | Processed | Not found | Failed |
---|
|
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Job states
Total | Submitted | Started | Processing | Outputting | Finished | Notused | Aborted | Stalled | Jobscript error | Outputting failed | None processed |
---|
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)
Jobscript
#!/bin/bash
:<<'EOF'
This jobscript generates CaloHitList-based graph data
from input reco2 ROOT files using your custom LArSoft setup.
Required environment variables:
- FCL_FILE
- CODE_TAR_DIR_LOCAL
- DUNE_VERSION
- DUNE_QUALIFIER
- XML_MASTER
- XML_NEUTRINO
- NUM_EVENTS (optional)
- FCL_SECONDARY (optional)
- FCL_THIRD (optional)
EOF
# === Setup FCL and version info ===
FCL_FILE=${FCL_FILE:-atm-training-extract.fcl}
DUNE_VERSION=${DUNE_VERSION:-v10_04_06d00}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}
# === Number of events option ===
if [ -n "$NUM_EVENTS" ]; then
events_option="-n $NUM_EVENTS"
fi
# === Get a file from justIN ===
did_pfn_rse=$($JUSTIN_PATH/justin-get-file)
if [ -z "$did_pfn_rse" ]; then
echo "No file assigned. Exiting jobscript."
exit 0
fi
# === Track input DID for MetaCat ===
echo "$did_pfn_rse" | cut -f1 -d' ' >> all-input-dids.txt
# === Parse PFN from DID ===
pfn=$(echo "$did_pfn_rse" | cut -d' ' -f2)
echo "Input PFN = $pfn"
# === Setup DUNE software ===
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"
# === Mirror CODE_TAR_DIR_LOCAL ===
INPUT_TAR_DIR_LOCAL="$CODE_TAR_DIR_LOCAL"
echo "INPUT_TAR_DIR_LOCAL = $INPUT_TAR_DIR_LOCAL"
# === Setup custom code ===
if [ -n "$CODE_TAR_DIR_LOCAL" ]; then
echo "Using local products from $CODE_TAR_DIR_LOCAL"
source "$CODE_TAR_DIR_LOCAL/larsoft_graph_V1_2025/localProducts_larsoft_v10_04_06_e26_prof/setup-grid"
mrbslp
fi
# === Generate common timestamp and random suffix for output renaming ===
timestamp=$(date -u +"%Y-%m-%dT_%H%M%SZ")
rand_suffix=$((1 + RANDOM % 10))
# === Output file naming ===
fname=$(basename "$pfn" .root)
outFile="${fname}_graph_${timestamp}.root"
logFile="${fname}_graph_${timestamp}.log"
# === Set FW search path ===
XML_DIR_MASTER=$(dirname "$XML_MASTER")
XML_DIR_NEUTRINO=$(dirname "$XML_NEUTRINO")
export FW_SEARCH_PATH="$XML_DIR_MASTER:$XML_DIR_NEUTRINO:$FW_SEARCH_PATH"
# === Run lar (primary) ===
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so
echo "Running LArSoft with FCL: $FCL_FILE"
lar -c "$FCL_FILE" $events_option -o "$outFile" "$pfn" > "$logFile" 2>&1
larExit=$?
# === Run lar (secondary) if needed ===
if [ -n "$FCL_SECONDARY" ]; then
secondary_out="secondary_${outFile}"
secondary_log="secondary_${logFile}"
echo "Running LArSoft with secondary FCL: $FCL_SECONDARY"
lar -c "$FCL_SECONDARY" $events_option "$pfn" > "$secondary_log" 2>&1
fi
if [ -f "$secondary_log" ]; then
echo '=== Start last 100 lines of secondary lar log file ==='
tail -100 "$secondary_log"
echo '=== End last 100 lines of secondary lar log file ==='
fi
# === Run lar (third) if needed ===
if [ -n "$FCL_THIRD" ]; then
third_out="third_${outFile}"
third_log="third_${logFile}"
echo "Running LArSoft with third FCL: $FCL_THIRD"
lar -c "$FCL_THIRD" $events_option "$pfn" > "$third_log" 2>&1
fi
if [ -f "$third_log" ]; then
echo '=== Start last 100 lines of third lar log file ==='
tail -100 "$third_log"
echo '=== End last 100 lines of third lar log file ==='
fi
# === Rename .data and .root files with timestamp and suffix ===
if [ $larExit -eq 0 ]; then
# Rename .data files
for f in *.data; do
if [ -f "$f" ]; then
newname="graph_output_${timestamp}_${rand_suffix}_$f"
mv -f "$f" "$newname"
echo "Renamed $f -> $newname"
fi
done
# Rename *eid.root files
for f in *eid.root; do
if [ -f "$f" ]; then
newname="graph_output_${timestamp}_${rand_suffix}_$f"
mv -f "$f" "$newname"
echo "Renamed $f -> $newname"
fi
done
# Rename ana_tree_hd.root
if [ -f ana_tree_hd.root ]; then
newname="graph_output_${timestamp}_${rand_suffix}_ana_tree_hd.root"
mv ana_tree_hd.root "$newname"
echo "Renamed ana_tree_hd.root -> $newname"
fi
echo "$pfn" > justin-processed-pfns.txt
jobscriptExit=0
else
jobscriptExit=1
fi
# === Show lar log tail ===
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$logFile"
echo '=== End last 100 lines of lar log file ==='
# === Mark processed ===
if [ $larExit -eq 0 ]; then
echo "$pfn" > justin-processed-pfns.txt
jobscriptExit=0
else
jobscriptExit=1
fi
# === Package logs ===
tar zcf "${JUSTIN_JOBSUB_ID//[@]/_}.logs.tgz" *.log
# === Display output summary ===
echo "=== Generated output files ==="
ls -1 *.* 2>/dev/null | grep -v 'all-input-dids.txt' || echo "No output files found."
exit $jobscriptExit