Performance Analysis of pC++: A Portable Data-Parallel Programming
System for Scalable Parallel Computers
pC++ is a language extension to C++ designed to allow programmers
to compose distributed data structures with parallel execution semantics.
These data structures are organized as ``concurrent aggregate'' collection
classes which can be aligned and distributed over the memory hierarchy
of a parallel machine in a manner consistent with the High Performance
Fortran Forum (HPF) directives for Fortran 90.
pC++ allows the user to write portable and efficient code which will
run on a wide range of scalable parallel computers.
In this paper, we discuss the performance analysis of the pC++ programming
We describe the performance tools developed and include
scalability measurements for four benchmark
programs: a ``nearest neighbor'' grid computation, a fast Poisson solver,
and the ``Embar'' and ``Sparse'' codes from the NAS suite.
In addition to speedup numbers, we present a detailed analysis
highlighting performance issues at the language, runtime system,
and target system levels.