"Scalable Dense Factorizations for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 49-56, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_scalable_factorizations.pdf (262.74 KB)
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"HMPI: A Message-Passing Library for Heterogeneous Networks of Computers",
Computer Science Department, Dublin, University College Dublin, pp. 456, 06/2005.
Download: Ravi_thesis.pdf (1.89 MB)
"HeteroMPI + ScaLAPACK: Towards a ScaLAPACK (Dense Linear Solvers) on Heterogeneous Networks of Computers",
Proceedings of the 13th IEEE International Conference on High Performance Computing (HiPC 2006), vol. 4297, Bangalore, India, Springer, pp. 242-253, 18-21 Dec 2006.
Download: 1161251345946.pdf (202.11 KB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"Heterogeneous PBLAS: Optimization of PBLAS for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 73-80, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_HeteroPBLAS.pdf (268.49 KB)
Heterogeneous PBLAS: A Set of Parallel Basic Linear Algebra Subprograms for Heterogeneous Computational Clusters,
: School of Computer Science and Informatics, University College Dublin, pp. 19, 04/2008.
Abstract
Download: UCD-CSI-2008-2-HeteroPBLAS.pdf (111.71 KB)
"Parallel Solvers for Dense Linear Systems for Heterogeneous Computational Clusters",
The 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, IEEE Computer Society, May 25-29, 2009.
Download: pdsec_linear_solvers_hcc_cr.pdf (252.69 KB)
"Hierarchical Parallel Matrix Multiplication on Large-Scale Distributed Memory Platforms",
42nd International Conference on Parallel Processing (ICPP 2013), Lyon, France, IEEE, pp. 754-762, 1-4 October, 2013.
Download: 06687414.pdf (480.2 KB)
"SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms",
28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.
Download: hcw2019.pdf (673.25 KB)
"Energy aware ultrascale systems",
Ultrascale computing systems: IET, 03/2019.
Download: chap5.pdf (3.2 MB)
"Communication Performance Models for Heterogeneous Computational Clusters",
School of Computer Science and Informatics, Dublin, University College Dublin, pp. 115, 06/2009.
Download: moflynn-ethesis.pdf (925.45 KB)
"Application level energy measurements and models for hybrid platform with accelerators",
School of Computer Science, Dublin, University College Dublin, pp. 165, 05/2018.
Download: thesis.pdf (1.61 MB)
"Towards Application Energy Measurement and Modelling Tool Support",
13th International Conference on Parallel Computing Technologies (PaCT-2015), Petrozavodsk, Russia, Lecture Notes in Computer Science 9251, Springer, pp. 91-101, 31 Aug - 4 Sept, 2015.
Download: pact2015energy.pdf (383.55 KB)
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)
"A New Model-Based Approach to Performance Comparison of MPI Collective Algorithms",
16th International Conference on Parallel Computing Technologies (PaCT 2021), Kaliningrad, Russia, Lecture Notes in Computer Science 12942, Springer, pp. 11-25, 09/2021.
Download: Nuriyev-Lastovetsky2021_Chapter_ANewModel-BasedApproachToPerfo.pdf (623.78 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Efficient and accurate selection of optimal MPI collective algorithms using analytical performance modelling",
School of Computer Science, Dublin, University College Dublin, pp. 130, 06/2021.
Download: thesis.pdf (2.21 MB)
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)
"Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms",
IEEE Access, vol. 11: IEEE, pp. 27226-27245, 03/2023.
Download: Access-2023-acceleration.pdf (1.28 MB)
"On Energy Nonproportionality of CPUs and GPUs",
31st Heterogeneity in Computing Workshop (HCW 2022), Lyon, France, IEEE, pp. 34-44, 30/05/2022.
Download: On_Energy_Nonproportionality_of_CPUs_and_GPUs.pdf (1.03 MB)
"Topology-aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platforms",
23rd International Heterogeneity in Computing Workshop (HCW 2014), Phoenix, Arizona, USA, IEEE Computer Society, 19 May, 2014.
Download: HCW-2014-09.pdf (294.81 KB)
"Towards Optimal Matrix Partitioning for Data Parallel Computing on a Hybrid Heterogeneous Server",
IEEE Access, vol. 9: IEEE, pp. 17229 - 17244, 02/2021.
Download: IEEE-Access-09328411.pdf (3.76 MB)
"Optimal Matrix Partitioning for Data Parallel Computing on Hybrid Heterogeneous Platforms",
19th International Symposium on Parallel and Distributed Computing (ISPDC), Warsaw, Poland, IEEE, 5-8 July, 2020.
Download: ispdc2020.pdf (367.16 KB)