Filters: Author is Emin Nuriyev [Clear All Filters]
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"A New Model-Based Approach to Performance Comparison of MPI Collective Algorithms",
16th International Conference on Parallel Computing Technologies (PaCT 2021), Kaliningrad, Russia, Lecture Notes in Computer Science 12942, Springer, pp. 11-25, 09/2021.
Download: Nuriyev-Lastovetsky2021_Chapter_ANewModel-BasedApproachToPerfo.pdf (623.78 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Efficient and accurate selection of optimal MPI collective algorithms using analytical performance modelling",
School of Computer Science, Dublin, University College Dublin, pp. 130, 06/2021.
Download: thesis.pdf (2.21 MB)
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)