"Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution",
17th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2019), Gottingen, Germany, Lecture Notes in Computer Science, vol. 11997, Springer, 08/2019, 2020.
Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
"Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 35, issue 20: Wiley, pp. 1--19, 09/2023.
Download: Concurrency and Computation - 2022 - Khaleghzadeh - Efficient exact algorithms for continuous bi‐objective.pdf (1.58 MB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Optimization of Multithreaded Data-parallel Applications on Modern Multicore CPUs For Performance and Energy Using Application-level Decision Variables",
School of Computer Science, Dublin, University College Dublin, pp. 181, 09/2019.
Download: PhD_Thesis_Semyon_Khokhriakov.pdf (8.7 MB); thesis-summary.pdf (621.98 KB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"HMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, France, IEEE Computer Society, 22-26 April 2003.
Download: IPDPS2003_HMPI.pdf (144.95 KB)
"Modeling Performance of Many-to-One Collective Communication Operations in Heterogeneous Clusters",
UCD CSI Technical Report 2006-3, 2006.
Download: 1157631827149.pdf (145.83 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors",
Proceedings of the 9th International Conference on Parallel Computing Technologies (PaCT 2007), vol. 4671, Pereslavl-Zalessky, Russia, Springer, pp. 261-275, 3-7 September, 2007.
Download: 1182174947076.pdf (305.56 KB)
"Two-dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 112-121, 25/9/2009, 2010.
Download: heteropar2009-2.pdf (1021.24 KB)
"Scientific Programming for Heterogeneous Systems - Bridging the Gap between Algorithms and Applications",
Proceedings of the 5th International Symposium on Parallel Computing in Electrical Engineering (PARELEC 2006), Bialystok, Poland, IEEE Computer Society Press, pp. 3-8, 13-17 Sept 2006.
Download: 1152191152218.pdf (66.64 KB)
"Adaptive Parallel Computing on Heterogeneous Networks with mpC",
Parallel Computing, vol. 28, issue 10, pp. 1369-1407, 2002.
Download: AdaptParComp_2002.pdf (290.43 KB)
"Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.
Download: TPDS_15.pdf (812.34 KB)
"Classification of Partitioning Problems for Networks of Heterogeneous Computers",
Proceedings of the 5th International Conference on Parallel Processing and Applied Mathematics (PPAM 2003), vol. 3019, Czestochowa, Poland, Springer, pp. 921-929, September 7-10, 2003.
Download: PPAM_classification.pdf (75.86 KB)
"Towards a Realistic Performance Model for Networks of Heterogeneous Computers",
Proceedings of IFIP TC5 Workshop, World Computer Congress, August 22-27 2004, Toulouse, France: Springer, pp. 39-58, 2005.
Download: IFIP TC5_Workshop_2005.pdf (583.39 KB)
Parallel Computing on Heterogeneous Networks,
: John Wiley & Sons, pp. 423, 2003.
Download: cover.jpg (227.84 KB)
"A Language and Programming Environment for High-Performance Parallel Computing on Heterogeneous Networks",
Programming and Computer Software, vol. 26, issue 4: Kluwer, pp. 216-236, 2000.
Download: PCS2000.pdf (1.87 MB)
"An Accurate Communication Model of a Heterogeneous Cluster Based on a Switch-Enabled Ethernet Network",
Proceedings of the 12th International Conference on Parallel and Distributed Systems (ICPADS 2006), vol. 2, Minneapolis, Minnesota, USA, IEEE Computer Society Press, pp. 15-20, 12-15 July 2006.
Download: 1153144642501.pdf (136.5 KB)
"mpC: A Multi-Paradigm Programming Language for Massively Parallel Computers",
ACM SIGPLAN Notices, vol. 31, issue 2: ACM, pp. 13-20, 1996.
Download: ACM_SIGPLAN_1996.pdf (846.66 KB)
"MPIBlib: Benchmarking MPI Communications for Parallel Computing on Homogeneous and Heterogeneous Clusters",
15th European PVM/MPI User's Group Meeting, vol. 5205, Dublin, Ireland, Springer-Verlag Berlin Heidelberg, pp. 227-238, September 7-10, 2008.
Download: 52050227.pdf (341.9 KB)
"Data distribution for dense factorization on computers with memory heterogeneity",
Parallel Computing, vol. 33, issue 12, pp. 757-779, 12/2007.
Abstract
Download: sdarticle.pdf (714.34 KB)
An Efficient Procedure for Building the Functional Performance Model of a Processor,
, 2005.
Download: Cluster2005_perf_model.pdf (268.4 KB)
"An Approach to Assessment of Heterogeneous Parallel Algorithms",
Proceedings of the 7th International Conference on Parallel Computing Technologies (PaCT 2003), vol. 2763, Nizhni Novgorod, Russia, pp. 117-129, 15-19 Sept 2003.
Download: PaCT_hetero_analysis.pdf (138.22 KB)
"Data Partitioning for Multiprocessors with Memory Heterogeneity and Memory Constraints",
Scientific Programming, vol. 13, issue 2: IOS Press, pp. 93-112, 2005.
Download: JSP_data_partitioning_2005.pdf (204.98 KB)