Optimizing the hybrid parallelization of BHAC

Open Access
Authors
  • S. Cielo
  • O. Porth
  • L. Iapichino
  • A. Karmakar
  • H. Olivares
  • C. Xia
Publication date 01-2022
Journal Astronomy and Computing
Article number 100509
Volume | Issue number 38
Number of pages 9
Organisations
  • Faculty of Science (FNWI) - Anton Pannekoek Institute for Astronomy (API)
Abstract
We present our experience with the modernization on the GR-MHD code BHAC, aimed at improving its novel hybrid (MPI+OpenMP) parallelization scheme. In doing so, we showcase the use of performance profiling tools usable on x86 (Intel-based) architectures. Our performance characterization and threading analysis provided guidance in improving the concurrency and thus the efficiency of the OpenMP parallel regions. We assess scaling and communication patterns in order to identify and alleviate MPI bottlenecks, with both runtime switches and precise code interventions. The performance of optimized version of BHAC improved by 28%, making it viable for scaling on several hundreds of supercomputer nodes. We finally test whether porting such optimizations to different hardware is likewise beneficial on the new architecture by running on ARM A64FX vector nodes.
Document type Article
Language English
Published at https://doi.org/10.1016/j.ascom.2021.100509
Published at https://arxiv.org/abs/2108.12240
Other links https://ui.adsabs.harvard.edu/abs/2022A%26C....3800509C/abstract
Downloads
Permalink to this page
Back