Name
tau_exec — TAU execution wrapping script
Synopsis
tau_exec [
options
] [--] {
exe
} [
exe options
]
Description
Use this script to perform memory or IO tracking on either an instrumented or uninstrumented executable.
Options
- -v
-
verbose mode
- -qsub
-
BG/P qsub mode
- -io
-
track io
- -memory
-
track memory
- -cuda
-
track GPU events via CUDA (Must be configured with -cuda=<dir>, Preferred of CUDA 4.0 or earlier)
- -cupti
-
track GPU events via Nvidia's CUPTI interface (Must be configured with -cupti=<dir>, Preferred for CUDA 4.1 or later).
- -opencl
-
track GPU events via OpenCL
- -armci
-
track ARMCI events via PARMCI (Must be configured with -armci=<dir>)
- -ebs
-
enable Event-based sampling. See README.sampling for more information
- -ebs_period=<count >
-
sampling period (default 1000)
- -ebs_source=<counter>
-
sets sampling metric (default "itimer")
- -T<option>
-
: specify TAU option
- -loadlib=<file.so >
-
: specify additional load library
- -XrunTAU-<options>
-
specify TAU library directly
Notes
Defaults if unspecified: -T MPI. MPI is assumed unless SERIAL is specified
CUDA kernel tracking is included, if A CUDA SYNC call is made after
each kernel launch and cudaThreadExit() is called before the exit of each thread that uses CUDA.
OPENCL kernel tracking is included, if A OPENCL SYNC call is made after
each kernel launch and clReleaseContext() is called before the exit of each thread that uses CUDA.
Examples
mpirun -np 2 tau_exec -io ./ring
mpirun -np 8 tau_exec -ebs -ebs_period=1000000 -ebs_source=PAPI_FP_INS ./ring
tau_exec -T serial,cupti -cupti ./matmult (Preferred for CUDA 4.1 or later)
tau_exec -T serial -cuda ./matmult (Preferred for CUDA 4.0 or earlier)
tau_exec -T serial -opencl (OPENCL)