[PREV] [Back to tutorial] [NEXT]
[TAU

TAU: Profiling Asynchronous Executions with POOMA, PETE and SMARTS

[POOMA-II

POOMA-2 includes thread-based evaluation and the ability to use the Shared Memory Asynchronous RunTime System (SMARTS). POOMA-2 and SMARTS present several problems to a performance analysis system. First, being a class library with data-parallel semantics, POOMA-level expressions will be mapped to parallel computations, either an SPMD code with message passing or a multithreaded asynchrounous code. The performance system has to be able to track this mapping and associated performance data with the framework-level abstraction. TAU does this through its mapping API and its support for tracking asynchronous execution.