TAU v2.14.6, PDT v3.4 and VTF3 v1.34 released

(30 June 2005) We've added support for large trace files (> 2GB), GPSHMEM, and now we distribute Jumpshot4 and SLOG2 SDK as part of TAU. TAU_COMPILER and tau_instrumentor are enhanced to better support automatic instrumentation of Fortran 95 codes using PDT v3.4.

TAU v2.14.5 released

(8 June 2005) We've added support for importing CUBE(Kojak) profiles in paraprof. TAU has a new -MPITRACE option that produces trace files with events that are ancestors of MPI calls. These traces can be converted to the Epilog format (from Kojak) for use with the expert tool. TAU_COMPILER instrumentation tool has been updated to support OpenMP instrumentation with Kojak's Opari instrumentor. Paraprof has a new thread statistics table window with support for expanding a callgraph by clicking on a node. You can sort on a particular column by clicking on it its heading.

TAU v2.14.4 released

(18 May 2005) We've added support for memory headroom calculation. Paraprof has a packed profile data format, reverse callpath views, and search capabilities. TAU has a new context user defined event where application specific events can be mapped to the program's callstack. TAU traces can now be converted to the Epilog trace format using tau2elg tool.

TAU v2.14.3 released

(20 Apr 2005) We've added support for 3D profile displays in Paraprof. TAU now supports the Jumpshot4 trace visualizer with the SLOG2 trace converter.

TAU v2.14.1 released

(20 Jan 2005) We've added support for phase based profiling, dynamic timers, a tool to convert vtf3 trace files to TAU profiles, and several enhancements to Paraprof. Paraprof now has an option to show the complete callgraph (clickable to identify the callpath, with zoom in/out capabilities, options to select node colors and sizes).Paraprof has a new scalable histogram display which shows the no. of threads of a routine in each bin (between max and min values, with the ability to change the no. of bins). TAU features better support for multi-threaded executions, and support for PathScale compilers (C, C++, Fortran 95) for Opteron Linux platform. PDT v3.3.1 is also released with support for PathScale compilers.

TAU v2.14 released

(Nov 2004) TAU now supports Oracle, PostgreSQL and MySQL databases in PerfDMF.

TAU v2.13.7 released

(Aug 2004) TAU now supports generation of binary VTF3 traces using VTF3 Trace Library from TU Dresden.

TAU v2.12.9 released

(July 2003) TAU v2.12.9 introduces the new paraprof profile browser [Europar03], DyninstAPI 4.0 support for rewriting binary images, file level selective instrumentation support, gprof style parallel callpath views for callpath profiles in paraprof, user specified depth in callpath profiles, Python API improvements, Opari updates for OpenMP instrumentation and EPILOG trace file format support from the KOJAK (FZJ) project.

TAU v2.12.5 released

(March 2003) TAU v2.12.5 supports Python bindings and automatic instrumentation of Python code.

Call Path profiling

(Aug 2002) TAU supports call path profiling. This allows a user to explore the time spent along a specific call path. Currently, the latest release (TAU v2.11.17) supports a two-level call path. See Call Path Profiling for further details. TAU also supports PETSc in this release.

New tool: tau_reduce

(July 2002) Frequently executing light-weight routines may distort the performance data by introducing unnecessary overhead. To weed out these routines, a new tool tau_reduce has been introduced in TAU. It reads the profile output and a rules file that specifies when a routine should not be instrumented, and produces a selective instrumentation file that lists routines that should be excluded from instrumentation. This information can be fed to tau_instrumentor based on PDT or tau_run based on DyninstAPI to reduce the instrumentation overhead for subsequent runs. See examples/reduce and utils/TAU_REDUCE.README for more information.

Support for EPILOG and EXPERT

(June 2002) TAU can generate EPILOG binary traces which can be analyzed using the EXPERT tool. See [KOJAK]. TAU also supports Hitachi SR8000, NEC SX and IA-64 Linux platforms. Under IA-64, Intel C/C++/F90 compilers are supported.

Runtime access to performance data

(May 2002) TAU v2.11.14 also supports runtime access to performance data that allows an application to query its performance metrics. TAU also features selective dumping of profile data and incremental dumping of data at runtime. TAU supports integrated performance analysis in the Uintah software. See [ISHPC'02 paper].

Selective Instrumentation

(April 2002) TAU supports selective instrumentation of source code (using PDT) and object code (using DyninstAPI). A selective instrumentation file can specify a list of routines that are to be instrumented or to be excluded from instrumentation.

Support for multiple counters

(March 2002) TAU can now support profiling with more than one quantity (such as wallclock time, hardware performance counters). Different options can be selected by setting COUNTER[1-25] environment variables to indicate the counters to be profiled. TAU also supports PAPI v2.1 in this release. See -MULTIPLECOUNTERS configuration option.

Dynamic Grouping

(Feb 2002) TAU supports dynamic creation of profile groups. This allows users to enable and disable groups at runtime, as well as associate groups with files during instrumentation using tau_instrumentor. Support for profile groups is demonstrated in SAMRAI(LLNL).

F90 Support

(Jan 2002) TAU supports F90 instrumentation using PDT.

Access to x86 timers under Linux

(Dec. 2001) TAU supports access to low-overhead timers under Linux using the -LINUXTIMERS configuration option.

jracy released in TAU v2.10

(Nov. 2001) TAU has a new profile browser (jracy) implemented in Java. Sample images of jracy can be seen in EVH1 Profiles.


(Oct. 2001) TAU works with UPS.


(Sept. 2001) XPARE (eXPeriment Alerting and REporting) is a system for performance experimentation that is integrated in a weekly testing harness for the Uintah / C-SAFE software development effort. With this system we can produce detailed weekly reports of Uintah / C-SAFE performance and alert code developers of performance problems as they arise.

TAU v 2.9.19 Released

(Aug 2001) TAU v 2.9.19 features support for OpenMP directive rewriting (Opari) based instrumentation for OpenMP programs. See LACSI 2001 paper.

TAU v 2.9.12 Released

(July 2001) TAU v 2.9.12 features support for several thread packages (SGI sproc, pthread, Java, Windows, OpenMP, Tulip, SMARTS) and for a runtime profile snapshot (TAU_DB_DUMP) facility in addition to extensions to its performance data mapping API. See the download section for instructions on downloading TAU.

TAU Documentation

(June 2001) TAU JAVA Grande/ISCOPE'01 paper (mpiJava, multi-level instrumentation) and PDPTA'01 paper (use of DyninstAPI with MPI) [All papers].

TAU v 2.9 Released

(Nov. 2000) TAU v2.9 features support for mixed model programming, support for PAPI, PCL for hardware performance counters and new ports (to IA-64). See the Download page for more information.

TAU supports Hybrid Execution Models

TAU supports MPI+pthread, MPI+OpenMP and MPI+Java hybrid execution models. For details see DAPSYS2000 and ICSJava papers.

TAU supports PAPI and OpenMP with MPI (OpenMPI)

TAU supports access to hardware performance counters using PAPI. For details see PAPI and OpenMPI announcements.

TAU v 2.8.11 Released

(Oct. 2000) TAU v2.8x implements the performance mapping API that allows performance data to be corelated between different layers in a multi-layered software. It features support for Fortran 90 and MPI Profiling Interface. It supports access to hardware performance counters using PCL and PAPI on several platforms including Cray T3E, SGI, UltraSparc, IBM Power3, Intel Pentium+

Profiling User Events in PaRP

TAU now implements profiling of user defined event. These could be used to track memory statistics or any application specific statistics maintained on a per thread basis. Click here for more information on its use in the PaRP project.

Vampir and Smarts

TAU can generate event traces for Vampir for Smarts user level threads. This can be a valuable tool in evaluating efficient thread scheduling policies in SMARTS. Click here for more information.

TAU integrated with Pooma II

TAU uses the EDG parser, IL converter and DUCTAPE to automatically insert TAU macros in the source code. TAU is now integrated with Pooma II. Click here for more information.

Pthread support

TAU Profiling package now supports pthreads using -pthread configure option. Version 2.3 released on Aug. 10, 1998 also supports user defined events. C programs can now be profiled using TAU using the same API as C++.

TAU IL Converter

TAU IL converter and program database for analysis tools uses an EDG front end to parse a C++ program and converts the intermediate language to a format that can be used by TAU tools. For more info see the documentation section.

TAU Tracing

The TAU Portable package can now generate traces that can be viewed using VAMPIR. For details see the Tutorial Tracing for VAMPIR.