"HMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, France, IEEE Computer Society, 22-26 April 2003.
Download: IPDPS2003_HMPI.pdf (144.95 KB)
"A Non-Intrusive and Incremental Approach to Enabling Direct Communications in RPC-based Grid Programming Systems",
Technical Report UCD-CSI-2005-2, pp. 15, 2006.
Download: ucd-csi-2005-2.pdf (295.56 KB)
"On Performance Analysis of Heterogeneous Parallel Algorithms",
Parallel Computing, vol. 30, issue 11, pp. 1195-1216, 2004.
Download: ParCom2004_hetero_perf.pdf (750.84 KB)
"A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors",
Proceedings of the 9th International Conference on Parallel Computing Technologies (PaCT 2007), vol. 4671, Pereslavl-Zalessky, Russia, Springer, pp. 261-275, 3-7 September, 2007.
Download: 1182174947076.pdf (305.56 KB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"Two-dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 112-121, 25/9/2009, 2010.
Download: heteropar2009-2.pdf (1021.24 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"Scientific Programming for Heterogeneous Systems - Bridging the Gap between Algorithms and Applications",
Proceedings of the 5th International Symposium on Parallel Computing in Electrical Engineering (PARELEC 2006), Bialystok, Poland, IEEE Computer Society Press, pp. 3-8, 13-17 Sept 2006.
Download: 1152191152218.pdf (66.64 KB)
"Towards Optimal Matrix Partitioning for Data Parallel Computing on a Hybrid Heterogeneous Server",
IEEE Access, vol. 9: IEEE, pp. 17229 - 17244, 02/2021.
Download: IEEE-Access-09328411.pdf (3.76 MB)
"Network-Aware Optimization of MPDATA on Homogeneous Multi-core Clusters with Heterogeneous Network",
ICA3PP 2016 Workshops, Granada, Spain, Lecture Notes in Computer Science 10049, Springer, pp. 30-42, 14-16 Dec 2016.
Download: tapems2016.pdf (357.28 KB)
"Topology-aware Optimization of Communication Cost of Parallel Applications in Heterogeneous HPC Systems",
School of Computer Science, Dublin, University College Dublin, pp. 106, 09/2016.
Download: thesis.pdf (1 MB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"Topology-aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platforms",
23rd International Heterogeneity in Computing Workshop (HCW 2014), Phoenix, Arizona, USA, IEEE Computer Society, 19 May, 2014.
Download: HCW-2014-09.pdf (294.81 KB)
"Optimal Matrix Partitioning for Data Parallel Computing on Hybrid Heterogeneous Platforms",
19th International Symposium on Parallel and Distributed Computing (ISPDC), Warsaw, Poland, IEEE, 5-8 July, 2020.
Download: ispdc2020.pdf (367.16 KB)
"Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms",
IEEE Access, vol. 11: IEEE, pp. 27226-27245, 03/2023.
Download: Access-2023-acceleration.pdf (1.28 MB)
"On Energy Nonproportionality of CPUs and GPUs",
31st Heterogeneity in Computing Workshop (HCW 2022), Lyon, France, IEEE, pp. 34-44, 30/05/2022.
Download: On_Energy_Nonproportionality_of_CPUs_and_GPUs.pdf (1.03 MB)
"Concurrent and Orthogonal Software Power Meters for Accurate Runtime Energy Profiling of Parallel Hybrid Programs on Heterogeneous Hybrid Servers",
IEEE Transactions on Parallel and Distributed Systems, vol. 37, issue 2: IEEE, pp. 322-339, 02/2026.
Download: Concurrent_and_Orthogonal_Software_Power_Meters_for_Accurate_Runtime_Energy_Profiling_of_Parallel_Hybrid_Programs_on_Heterogeneous_Hybrid_Servers.pdf (3.44 MB)
"Accurate and Reliable Energy Measurement and Modelling of Data Transfer Between CPU and GPU in Parallel Applications on Heterogeneous Hybrid Platforms",
IEEE Transactions on Computers, vol. 74, issue 3, pp. 1011--1024, 03/2025.
Download: Accurate_and_Reliable_Energy_Measurement_and_Modelling_of_Data_Transfer_Between_CPU_and_GPU_in_Parallel_Applications_on_Heterogeneous_Hybrid_Platforms.pdf (853.36 KB); supplemental_r2.pdf (2.05 MB)
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"A New Model-Based Approach to Performance Comparison of MPI Collective Algorithms",
16th International Conference on Parallel Computing Technologies (PaCT 2021), Kaliningrad, Russia, Lecture Notes in Computer Science 12942, Springer, pp. 11-25, 09/2021.
Download: Nuriyev-Lastovetsky2021_Chapter_ANewModel-BasedApproachToPerfo.pdf (623.78 KB)
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)
"Efficient and accurate selection of optimal MPI collective algorithms using analytical performance modelling",
School of Computer Science, Dublin, University College Dublin, pp. 130, 06/2021.
Download: thesis.pdf (2.21 MB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Application level energy measurements and models for hybrid platform with accelerators",
School of Computer Science, Dublin, University College Dublin, pp. 165, 05/2018.
Download: thesis.pdf (1.61 MB)
"Towards Application Energy Measurement and Modelling Tool Support",
13th International Conference on Parallel Computing Technologies (PaCT-2015), Petrozavodsk, Russia, Lecture Notes in Computer Science 9251, Springer, pp. 91-101, 31 Aug - 4 Sept, 2015.
Download: pact2015energy.pdf (383.55 KB)

] 

