"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Optimization of Multithreaded Data-parallel Applications on Modern Multicore CPUs For Performance and Energy Using Application-level Decision Variables",
School of Computer Science, Dublin, University College Dublin, pp. 181, 09/2019.
Download: PhD_Thesis_Semyon_Khokhriakov.pdf (8.7 MB); thesis-summary.pdf (621.98 KB)
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"A Performance Model of Many-to-One Collective Communications for Parallel Computing",
Proceedings of the 21st International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, California, USA, IEEE Computer Society, 26-30 March 2007.
Download: 1175591678074.pdf (335.97 KB)
"Parallel Testing of Distributed Software",
Information and Software Technology, vol. 47, issue 10: Elsevier, pp. 657-662, 2005.
Download: ParTestSoft_2005.pdf (49.1 KB)
"Model-Based Optimization of MPI Collective Operations for Computational Clusters",
EuroPVM/MPI 2009, vol. 5759, Espoo, Finland, pp. 4-5, Sep 7-10, 2009.
Download: 57590004.pdf (26.19 KB)
"Heterogeneity in parallel and distributed computing",
Journal of Parallel and Distributed Computing, vol. 73, issue 12, pp. 1523-1524, 2013.
Download: jpdc-2013.pdf (152.05 KB)
"How Pre-multicore Methods and Algorithms Perform in Multicore Era",
High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.
Download: nesus-isc-paper.pdf (574.34 KB)
"Revisiting communication performance models for computational clusters",
IPDPS 2009, Rome, Italy, IEEE, May 25-29, 2009.
Download: Revisiting_12.pdf (200.63 KB)
"A Variable Group Block Distribution Strategy for Dense Factorizations on Networks of Heterogeneous Computers",
Proceedings of the 6th International Conference on Parallel Processing and Applied Mathematics (PPAM 2005), vol. 3911, Poznan, Poland, Springer, 11-14 Sept 2005.
Download: PPAM_HPC_Hetero_LU_2005.pdf (79.43 KB)
High Performance Heterogeneous Computing,
: Wiley, pp. 267, 2009.
Download: High_Performance_Heterogeneous_Computing__Wiley_Series_on_Parallel_and_Distributed_Computing_.pdf (666.28 KB)
"Data Partitioning with a Realistic Performance Model of Networks of Heterogeneous Computers with Task Size Limits",
Proceedings of the Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks (ISPDC/HeteroPar'04), Cork, Ireland, IEEE Computer Society Press, pp. 133-140, 5-7 July 2004.
Download: ISPDC_data_partitioning.pdf (93.4 KB)
"A Parallel Language and Its Programming System for Heterogeneous Networks",
Concurrency: Practice and Experience, vol. 12, issue 13: Wiley, pp. 1317-1343, 2000.
Download: ParLangandItsProgrSystem_2000.pdf (518.3 KB)
"A Software Tool for Accurate Estimation of Parameters of Heterogeneous Communication Models",
15th European PVM/MPI User's Group Meeting, vol. 5205, Dublin, Ireland, Springer-Verlag Berlin Heidelberg, pp. 43-54, September 7-10, 2008.
Download: 52050043.pdf (352.98 KB)
"Accurate and efficient estimation of parameters of heterogeneous communication performance models",
International Journal of High Performance Computing Applications, vol. 23, issue 2, pp. 123-139, 2009.
Download: 123.pdf (504.5 KB)
"Building the Functional Performance Model of a Processor",
Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC 2006), Dijon, France, ACM, April 23-27 2006.
Download: SAC_2006.pdf (372.24 KB)
"On Performance Analysis of Heterogeneous Parallel Algorithms",
Parallel Computing, vol. 30, issue 11, pp. 1195-1216, 2004.
Download: ParCom2004_hetero_perf.pdf (750.84 KB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"The 27th International Heterogeneity in Computing Workshop and the 16th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms",
Concurrency and Computation: Practice and Experience, vol. 32, issue 15: Wiley, pp. 3, 03/2020.
Download: cpe.5736.pdf (169.99 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"Distributed Data Partitioning for Heterogeneous Processors Based on Partial Estimation of their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 91-101, 25/9/2009, 2010.
Download: heteropar2009-1.pdf (1.21 MB)
"A Non-Intrusive and Incremental Approach to Enabling Direct Communications in RPC-based Grid Programming Systems",
Technical Report UCD-CSI-2005-2, pp. 15, 2006.
Download: ucd-csi-2005-2.pdf (295.56 KB)