Name

tau_exec — TAU execution wrapping script

Synopsis

tau_exec [ options ] [--] { exe } [ exe options ]

Description

Use this script to perform memory or IO tracking on either an instrumented or uninstrumented executable.

Options

-v

verbose mode

-qsub

BG/P qsub mode

-io

track io

-memory

track memory

-cuda

track GPU events via CUDA (Must be configured with -cuda=<dir>, Preferred of CUDA 4.0 or earlier)

-cupti

track GPU events via Nvidia's CUPTI interface (Must be configured with -cupti=<dir>, Preferred for CUDA 4.1 or later).

-opencl

track GPU events via OpenCL

-armci

track ARMCI events via PARMCI (Must be configured with -armci=<dir>)

-ebs

enable Event-based sampling. See README.sampling for more information

-ebs_period=<count >

sampling period (default 1000)

-ebs_source=<counter>

sets sampling metric (default "itimer")

-T<option>

: specify TAU option

-loadlib=<file.so >

: specify additional load library

-XrunTAU-<options>

specify TAU library directly

Notes

Defaults if unspecified: -T MPI. MPI is assumed unless SERIAL is specified

CUDA kernel tracking is included, if A CUDA SYNC call is made after each kernel launch and cudaThreadExit() is called before the exit of each thread that uses CUDA.

OPENCL kernel tracking is included, if A OPENCL SYNC call is made after each kernel launch and clReleaseContext() is called before the exit of each thread that uses CUDA.

Examples

mpirun -np 2 tau_exec -io ./ring

mpirun -np 8 tau_exec -ebs -ebs_period=1000000 -ebs_source=PAPI_FP_INS ./ring

tau_exec -T serial,cupti -cupti ./matmult (Preferred for CUDA 4.1 or later)

tau_exec -T serial -cuda ./matmult (Preferred for CUDA 4.0 or earlier)

tau_exec -T serial -opencl (OPENCL)