Optimizing the hybrid parallelization of BHAC
| Authors |
|
|---|---|
| Publication date | 01-2022 |
| Journal | Astronomy and Computing |
| Article number | 100509 |
| Volume | Issue number | 38 |
| Number of pages | 9 |
| Organisations |
|
| Abstract |
We present our experience with the modernization on the GR-MHD code BHAC, aimed at improving its novel hybrid (MPI+OpenMP) parallelization scheme. In doing so, we showcase the use of performance profiling tools usable on x86 (Intel-based) architectures. Our performance characterization and threading analysis provided guidance in improving the concurrency and thus the efficiency of the OpenMP parallel regions. We assess scaling and communication patterns in order to identify and alleviate MPI bottlenecks, with both runtime switches and precise code interventions. The performance of optimized version of BHAC improved by 28%, making it viable for scaling on several hundreds of supercomputer nodes. We finally test whether porting such optimizations to different hardware is likewise beneficial on the new architecture by running on ARM A64FX vector nodes.
|
| Document type | Article |
| Language | English |
| Published at | https://doi.org/10.1016/j.ascom.2021.100509 |
| Published at | https://arxiv.org/abs/2108.12240 |
| Other links | https://ui.adsabs.harvard.edu/abs/2022A%26C....3800509C/abstract |
| Downloads |
Optimizing the hybrid parallelization of BHAC arxiv
(Submitted manuscript)
|
| Permalink to this page | |