justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Workflow 762, Stage 1

Priority50
Processors1
Wall seconds80000
Image/cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest
RSS bytes4194304000 (4000 MiB)
Max distance for inputs3000.0
Enabled input RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled output RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled sites BR_CBPF, CA_SFU, CERN, CH_UNIBE-LHEP, CZ_FZU, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IT_CNAF, NL_NIKHEF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Glasgow, UK_Imperial, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin
Scopeusertests
Events for this stage

Output patterns

 DestinationPatternLifetimeFor next stageRSE expression
1https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/fnal/00762/1*_caf.root

Environment variables

NameValue
CODE_TAR_DIR_LOCAL/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/1f8f8aaded68cc7c3e873196b083ca84b933f518
DUNE_QUALIFIERe26:prof
DUNE_VERSIONv10_04_06d00
FCL_FILE/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/69307e9d0f729451cf854a7b9325462579bb3629/atm-reco_truth_vtx.fcl
FCL_SECONDARYcafmaker_atmos_dune10kt_1x2x6_runreco-nuenergy-nuangular_geov5.fcl
NUM_EVENTS200
XML_MASTER/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/69307e9d0f729451cf854a7b9325462579bb3629/PandoraSettings_Master_Atmos_Production.xml
XML_NEUTRINO/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/69307e9d0f729451cf854a7b9325462579bb3629/PandoraSettings_Neutrino_Atmos_Production.xml

File states

Total filesFindingUnallocatedAllocatedOutputtingProcessedNot foundFailed
100000205140484

Job states

TotalSubmittedStartedProcessingOutputtingFinishedNotusedAbortedStalledJobscript errorOutputting failedNone processed
476500201733004132326660
Files processed00100100200200300300400400Aug-10 17:00Aug-10 20:00Aug-10 23:00Aug-11 02:00Aug-11 05:00Aug-11 08:00Aug-11 11:00Aug-11 14:00Aug-11 17:00Aug-11 20:00Aug-11 23:00Aug-12 02:00Aug-12 05:00Aug-12 08:00Aug-12 11:00Aug-12 14:00Files processedBin start timesNumber per binNL_SURFsaraUK_DurhamUK_QMULUK_OxfordUK_ManchesterUK_RAL-Tier1UK_LancasterNL_NIKHEFUK_RAL-PPDFR_CCIN2P3ES_PICCZ_FZUUS_UChicagoUS_UCSDBR_CBPF
Replicas per RSE1000479.4491071786552199.388367831488351000323.5291001675289322.87703568476513116281.99711735697764195.48344520004434112296.5722534573339169.7512559903879108316.891326338979149.706552397314196339.91393367140586136.6285602124547765359.9518162049603130.626113904331231372.32868535264396128.9798070294038214378.1781649037108128.71576130834242Replicas per RSEFNAL_DCACHE (39%)PRAGUE (39%)SURFSARA (4%)RAL_ECHO (4%)NIKHEF (4%)QMUL (3%)RAL-PP (2%)DUNE_FR_CCIN2P3_DISK (1%)DUNE_ES_PIC (0%)

RSEs used

NameInputsOutputs
PRAGUE17540
RAL_ECHO4140
SURFSARA4090
RAL-PP2730
QMUL2690
NIKHEF2680
DUNE_FR_CCIN2P3_DISK1040
DUNE_ES_PIC550
None0514

Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)

File reset events, by site

SiteAllocatedOutputting
NL_NIKHEF2699
UK_QMUL3364
UK_Imperial20
ES_PIC1213
NL_SURFsara1216
UK_Manchester1453
CZ_FZU178
UK_Edinburgh10
UK_Lancaster0157
UK_RAL-Tier10131
UK_RAL-PPD0127
CERN0113
US_UChicago082
US_FNAL-FermiGrid073
US_UCSD062
FR_CCIN2P3020
UK_Oxford017
BR_CBPF04
US_Wisconsin03
UK_Durham01

Jobscript

#!/bin/bash
:<<'EOF'
This jobscript generates CaloHitList-based graph data 
from input reco2 ROOT files using your custom LArSoft setup.

Required environment variables:
  - FCL_FILE
  - CODE_TAR_DIR_LOCAL
  - DUNE_VERSION
  - DUNE_QUALIFIER
  - XML_MASTER
  - XML_NEUTRINO
  - NUM_EVENTS (optional)
  - FCL_SECONDARY (optional)
  - FCL_THIRD (optional)
EOF

# === Setup FCL and version info ===
FCL_FILE=${FCL_FILE:-atm-training-extract.fcl}
DUNE_VERSION=${DUNE_VERSION:-v10_04_06d00}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}

# === Number of events option ===
if [ -n "$NUM_EVENTS" ]; then
  events_option="-n $NUM_EVENTS"
fi

# === Get a file from justIN ===
did_pfn_rse=$($JUSTIN_PATH/justin-get-file)
if [ -z "$did_pfn_rse" ]; then
  echo "No file assigned. Exiting jobscript."
  exit 0
fi

# === Track input DID for MetaCat ===
echo "$did_pfn_rse" | cut -f1 -d' ' >> all-input-dids.txt

# === Parse PFN from DID ===
pfn=$(echo "$did_pfn_rse" | cut -d' ' -f2)
echo "Input PFN = $pfn"

# === Setup DUNE software ===
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"

# === Mirror CODE_TAR_DIR_LOCAL ===
INPUT_TAR_DIR_LOCAL="$CODE_TAR_DIR_LOCAL"
echo "INPUT_TAR_DIR_LOCAL = $INPUT_TAR_DIR_LOCAL"

# === Setup custom code ===
if [ -n "$CODE_TAR_DIR_LOCAL" ]; then
  echo "Using local products from $CODE_TAR_DIR_LOCAL"
  source "$CODE_TAR_DIR_LOCAL/truth_vtx_larsoft/localProducts_larsoft_v10_04_06d00_e26_prof/setup-grid"
  mrbslp
fi

# === Generate common timestamp and random suffix for output renaming ===
timestamp=$(date -u +"%Y-%m-%dT_%H%M%SZ")
rand_suffix=$((1 + RANDOM % 10))

# === Output file naming ===
fname=$(basename "$pfn" .root)
outFile="${fname}_truthvtx_${timestamp}.root"
logFile="${fname}_truthvtx_${timestamp}.log"

# === Set FW search path ===
XML_DIR_MASTER=$(dirname "$XML_MASTER")
XML_DIR_NEUTRINO=$(dirname "$XML_NEUTRINO")
export FW_SEARCH_PATH="$XML_DIR_MASTER:$XML_DIR_NEUTRINO:$FW_SEARCH_PATH"

# === Run lar (primary) ===
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so
echo "Running LArSoft with FCL: $FCL_FILE"
lar -c "$FCL_FILE" $events_option -o "$outFile" "$pfn" > "$logFile" 2>&1
larExit=$?


###################run caf ###################

if [ -n "$FCL_SECONDARY" ]; then
  caf_timestamp=$(date -u +"%Y-%m-%dT_%H%M%SZ")
  secondary_out="caf_${fname}_truthvtx_${caf_timestamp}.root"
  secondary_log="caf_${fname}_truthvtx_${caf_timestamp}.log"
  echo "Running CAF step with FCL_SECONDARY: $FCL_SECONDARY using reco output: $outFile"
  lar -c "$FCL_SECONDARY" $events_option -o "$secondary_out" "$outFile" > "$secondary_log" 2>&1

  # === Rename CAF output root file to include timestamp if needed ===
  new_caf_out="caf_${fname}_truthvtx_${caf_timestamp}_caf.root"
  mv caf.root "$new_caf_out"
  echo "Renamed caf.root -> $new_caf_out"

fi

# # === Run lar (third) if needed ===
# if [ -n "$FCL_THIRD" ]; then
#   third_out="third_${outFile}"
#   third_log="third_${logFile}"
#   echo "Running LArSoft with third FCL: $FCL_THIRD"
#   lar -c "$FCL_THIRD" $events_option "$pfn" > "$third_log" 2>&1
# fi

# if [ -f "$third_log" ]; then
#   echo '=== Start last 100 lines of third lar log file ==='
#   tail -100 "$third_log"
#   echo '=== End last 100 lines of third lar log file ==='
# fi

# === Rename .data and .root files with timestamp and suffix ===
# if [ $larExit -eq 0 ]; then
#   # Rename .data files
#   for f in *.data; do
#     if [ -f "$f" ]; then
#       newname="graph_output_${timestamp}_${rand_suffix}_$f"
#       mv -f "$f" "$newname"
#       echo "Renamed $f -> $newname"
#     fi
#   done

#   # Rename *eid.root files
#   for f in *eid.root; do
#     if [ -f "$f" ]; then
#       newname="graph_output_${timestamp}_${rand_suffix}_$f"
#       mv -f "$f" "$newname"
#       echo "Renamed $f -> $newname"
#     fi
#   done

#   # Rename ana_tree_hd.root
#   if [ -f ana_tree_hd.root ]; then
#     newname="graph_output_${timestamp}_${rand_suffix}_ana_tree_hd.root"
#     mv ana_tree_hd.root "$newname"
#     echo "Renamed ana_tree_hd.root -> $newname"
#   fi

#   echo "$pfn" > justin-processed-pfns.txt
#   jobscriptExit=0
# else
#   jobscriptExit=1
# fi





# === Show lar log tail ===
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$logFile"
echo '=== End last 100 lines of lar log file ==='



# === Show lar log tail ===
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$secondary_log"
echo '=== End last  100 lines of lar log file ==='



# === Mark processed ===
if [ $larExit -eq 0 ]; then
  echo "$pfn" > justin-processed-pfns.txt
  jobscriptExit=0
else
  jobscriptExit=1
fi

# === Package logs ===
tar zcf "${JUSTIN_JOBSUB_ID//[@]/_}.logs.tgz" *.log

# === Display output summary ===
echo "=== Generated output files ==="
ls -1 *.* 2>/dev/null | grep -v 'all-input-dids.txt' || echo "No output files found."

exit $jobscriptExit
# === Package logs ===
tar zcf "${JUSTIN_JOBSUB_ID//[@]/_}.logs.tgz" *.log

# === Display output summary ===
echo "=== Generated output files ==="
ls -1 *.* 2>/dev/null | grep -v 'all-input-dids.txt' || echo "No output files found."

exit $jobscriptExit
justIN time: 2025-08-13 09:41:05 UTC       justIN version: 01.04.00