"HeteroMPI + ScaLAPACK: Towards a ScaLAPACK (Dense Linear Solvers) on Heterogeneous Networks of Computers",
Proceedings of the 13th IEEE International Conference on High Performance Computing (HiPC 2006), vol. 4297, Bangalore, India, Springer, pp. 242-253, 18-21 Dec 2006.
Download: 1161251345946.pdf (202.11 KB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"HMPI: A Message-Passing Library for Heterogeneous Networks of Computers",
Computer Science Department, Dublin, University College Dublin, pp. 456, 06/2005.
Download: Ravi_thesis.pdf (1.89 MB)
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"Heterogeneous PBLAS: Optimization of PBLAS for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 73-80, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_HeteroPBLAS.pdf (268.49 KB)
Heterogeneous PBLAS: A Set of Parallel Basic Linear Algebra Subprograms for Heterogeneous Computational Clusters,
: School of Computer Science and Informatics, University College Dublin, pp. 19, 04/2008.
Abstract
Download: UCD-CSI-2008-2-HeteroPBLAS.pdf (111.71 KB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"Scalable Dense Factorizations for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 49-56, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_scalable_factorizations.pdf (262.74 KB)
"Hierarchical Parallel Matrix Multiplication on Large-Scale Distributed Memory Platforms",
42nd International Conference on Parallel Processing (ICPP 2013), Lyon, France, IEEE, pp. 754-762, 1-4 October, 2013.
Download: 06687414.pdf (480.2 KB)
"SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms",
28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.
Download: hcw2019.pdf (673.25 KB)
"Energy aware ultrascale systems",
Ultrascale computing systems: IET, 03/2019.
Download: chap5.pdf (3.2 MB)
"Communication Performance Models for Heterogeneous Computational Clusters",
School of Computer Science and Informatics, Dublin, University College Dublin, pp. 115, 06/2009.
Download: moflynn-ethesis.pdf (925.45 KB)
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)
"Towards Application Energy Measurement and Modelling Tool Support",
13th International Conference on Parallel Computing Technologies (PaCT-2015), Petrozavodsk, Russia, Lecture Notes in Computer Science 9251, Springer, pp. 91-101, 31 Aug - 4 Sept, 2015.
Download: pact2015energy.pdf (383.55 KB)
"Application level energy measurements and models for hybrid platform with accelerators",
School of Computer Science, Dublin, University College Dublin, pp. 165, 05/2018.
Download: thesis.pdf (1.61 MB)
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"A New Model-Based Approach to Performance Comparison of MPI Collective Algorithms",
16th International Conference on Parallel Computing Technologies (PaCT 2021), Kaliningrad, Russia, Lecture Notes in Computer Science 12942, Springer, pp. 11-25, 09/2021.
Download: Nuriyev-Lastovetsky2021_Chapter_ANewModel-BasedApproachToPerfo.pdf (623.78 KB)
"Efficient and accurate selection of optimal MPI collective algorithms using analytical performance modelling",
School of Computer Science, Dublin, University College Dublin, pp. 130, 06/2021.
Download: thesis.pdf (2.21 MB)
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"On Energy Nonproportionality of CPUs and GPUs",
31st Heterogeneity in Computing Workshop (HCW 2022), Lyon, France, IEEE, pp. 34-44, 30/05/2022.
Download: On_Energy_Nonproportionality_of_CPUs_and_GPUs.pdf (1.03 MB)
"Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms",
IEEE Access, vol. 11: IEEE, pp. 27226-27245, 03/2023.
Download: Access-2023-acceleration.pdf (1.28 MB)
"Topology-aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platforms",
23rd International Heterogeneity in Computing Workshop (HCW 2014), Phoenix, Arizona, USA, IEEE Computer Society, 19 May, 2014.
Download: HCW-2014-09.pdf (294.81 KB)
"Optimal Matrix Partitioning for Data Parallel Computing on Hybrid Heterogeneous Platforms",
19th International Symposium on Parallel and Distributed Computing (ISPDC), Warsaw, Poland, IEEE, 5-8 July, 2020.
Download: ispdc2020.pdf (367.16 KB)