Show Entry in Conferences and Workshops

You may return to the Main Menu when done.

Showing 1 entry


[iccs15]
Abhinav Sarje, Sukhyun Song, Douglas Jacobsen, Kevin A. Huck, Jeffrey K. Hollingsworth, Allen D. Malony, Samuel Williams, Leonid Oliker: Parallel Performance Optimizations on Unstructured Mesh-based Simulations. ICCS 2015: 2016-2025

Keywords: Unstructured Mesh, Ocean Modeling, Graph Partitioning, Performance Optimization

This paper addresses two key parallelization challenges the unstructured mesh- based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations

URLs:

Modified:
Created: Fri Jan 20 14:23:54 2017


Current Collection: Conferences and Workshops
[ Menu | List | Show | About ]

Return to the ParaDucks Research Group Publications page.