"Hierarchical Approach to Optimization of MPI Collective Communication Algorithms",
School of Computer Science, Dublin, University College Dublin, pp. 152, 10/2015.
Download: khalid-thesis-oct-2015.pdf (1.1 MB)
"Hierarchical redesign of classic MPI reduction algorithms",
The Journal of Supercomputing, vol. 73, issue 2: Springer, pp. 713-725, 02/2017.
Download: TJS-Hasanov-2016.pdf (593.41 KB)
"High-Level Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms",
Euro-Par 2014: Parallel Processing Workshops, Vol. 8806 of Lecture Notes in Computer Science, Porto, Portugal, Springer, pp. 413-425, 25-29 August, 2014.
Download: tasus_2014.pdf (280.75 KB)
"Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms",
Simulation Modelling Practice and Theory, vol. 58: Elsevier, pp. 30-39, 11/2015.
Download: simpat2015.pdf (1.63 MB)
"Hierarchical Approach to Optimization of Parallel Matrix Multiplication on Large-Scale Platforms",
The Journal of Supercomputing, vol. 71, issue 11: Springer, pp. 3991-4014, 11/2015.
Download: JoS 2014 hierarchical matrix multiplication.pdf (1.3 MB)
"Modelling the Performance of Processors in Heterogeneous Computing Environments",
School of Computer Science and Informatics, Dublin, Ireland, University College Dublin, pp. 155, 03/2011.
Download: rob_thesis.pdf (2.6 MB)
"Scheduling for Heterogeneous Networks of Computers with Persistent Fluctuation of Load",
Parallel Computing: Current & Future Issues of High-End Computing, Proceedings of the International Conference ParCo 2005, vol. 33, Malaga, Spain, John von Neumann Institute for Computing, Julich, pp. 171-178, 13-16 Sept 2005, 2006.
Download: 171.pdf (395.65 KB)
"Managing the Construction and Use of Functional Performance Models in a Grid Environment",
The 23rd IEEE International Parallel and Distributed Processing Symposium, Rome, Italy, May 25-29, 2009.
Abstract
Download: HPGC-1569173093-paper-1.pdf (630.47 KB)
"Portable efficiency of software for parallel architectures",
Fundamental and Applied Mathematics, vol. 4, issue 3, pp. 947-974, 1998.
Download: fpm336.pdf (1.52 MB)
"Refined Description of the C[] Language",
Programming and Computer Software, vol. 28, issue 6, pp. 333-341, 2002.
Download: RefDescr_2002.pdf (54.55 KB)
"The Concept of Replication of Data and Expressions as a Means to Increase Reliability of Parallel Programs",
Proceedings of the 7th Russian Conference on Scientific Service in the Internet: Distributed Computing Technologies, Novorossiysk, Russia, 19-24 Sept 2005.
"Heterogeneous Distribution of Computations Solving Linear Algebra Problems on Networks of Heterogeneous Computers",
Journal of Parallel and Distributed Computing, vol. 61, issue 4: Academic Press, pp. 520-535, 2001.
Download: SolvinLinearAlgebra_2001.pdf (229.46 KB)
"Effective Solving Scientific Problems on Heterogeneous Networks of Computers with mpC",
Journal of Computational Methods in Science and Engineering, vol. 2, issue 1-2: IOS Press, pp. 135-140, 2002.
"Compilation of Vector Statements of C[] Language for Architectures with Multilevel Memory Hierarchy",
Programming and Computer Software, vol. 27, issue 3, pp. 111-122, 2001.
Download: CompilOfVectorExpres_2001.pdf (87.7 KB)
"Heterogeneous Distribution of Computations While Solving Linear Algebra Problems on Networks of Heterogeneous Computers",
Proceedings of the 7th International Conference on High Performance Computing and Networking Europe (HPCN`99), vol. 1593: Springer, pp. 191-200, 1999.
Download: 1124709633906.pdf (528.05 KB)
"Heterogeneous Computing",
Parallel Computing, vol. 31, issue 7: Elsevier, pp. 649-812, 2005.
Download: HC_2005.pdf (61 KB)
"mpC + ScaLAPACK = Efficient Solving Linear Algebra Problems on Heterogeneous Networks",
Proceedings of the 5th International Euro-Par Conference, vol. 1685: Springer, pp. 1024-1031, 1999.
Download: mPC_plus_ScaLAPACK_1999.pdf (175.76 KB)
"Novel Data-Partitioning Algorithms for Performance and Energy Optimization of Data-Parallel Applications on Modern Heterogeneous HPC Platforms",
School of Computer Science, Dublin, University College Dublin, pp. 264, 03/2019.
Download: thesis-hamid.pdf (5.9 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 35, issue 20: Wiley, pp. 1--19, 09/2023.
Download: Concurrency and Computation - 2022 - Khaleghzadeh - Efficient exact algorithms for continuous bi‐objective.pdf (1.58 MB)
"A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms",
19th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2021), Lisbon, Portugal, Lecture Notes in Computer Science, vol. 13098, Springer, pp. 166-178, 31/08/2021, 2022.
Download: Khaleghzadeh2022_Chapter_ANovelAlgorithmForBi-objective.pdf (1.07 MB)
"Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution",
17th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2019), Gottingen, Germany, Lecture Notes in Computer Science, vol. 11997, Springer, 08/2019, 2020.
Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
"A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 33, issue 21: Wiley, pp. e5928, 07/2020.
Download: CCPE-2020-dynamic-energy.pdf (1.34 MB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)