"Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution",
17th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2019), Gottingen, Germany, Lecture Notes in Computer Science, vol. 11997, Springer, 08/2019, 2020.
Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
"A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms",
19th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2021), Lisbon, Portugal, Lecture Notes in Computer Science, vol. 13098, Springer, pp. 166-178, 31/08/2021, 2022.
Download: Khaleghzadeh2022_Chapter_ANovelAlgorithmForBi-objective.pdf (1.07 MB)
"Optimization of Multithreaded Data-parallel Applications on Modern Multicore CPUs For Performance and Energy Using Application-level Decision Variables",
School of Computer Science, Dublin, University College Dublin, pp. 181, 09/2019.
Download: PhD_Thesis_Semyon_Khokhriakov.pdf (8.7 MB); thesis-summary.pdf (621.98 KB)
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Scientific Programming for Heterogeneous Systems - Bridging the Gap between Algorithms and Applications",
Proceedings of the 5th International Symposium on Parallel Computing in Electrical Engineering (PARELEC 2006), Bialystok, Poland, IEEE Computer Society Press, pp. 3-8, 13-17 Sept 2006.
Download: 1152191152218.pdf (66.64 KB)
"Accurate Heterogeneous Communication Models and a Software Tool for their Efficient Estimation",
International Journal of High Performance Computing Applications, vol. 24, issue 1, pp. 34-48, 2010.
Download: IJHPCA_2010.pdf (160.91 KB)
"A Non-Intrusive and Incremental Approach to Enabling Direct Communications in RPC-based Grid Programming Systems",
Computational Science - ICCS 2006: 6th International Conference, Reading, UK, May 28-31, 2006, Proceedings, Part III, vol. 3993: Springer Berlin / Heidelberg, pp. 1008-1011, 2006.
Download: WSES06.pdf (120.33 KB)
High Performance Heterogeneous Computing,
: Wiley, pp. 267, 2009.
Download: High_Performance_Heterogeneous_Computing__Wiley_Series_on_Parallel_and_Distributed_Computing_.pdf (666.28 KB)
"HMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, France, IEEE Computer Society, 22-26 April 2003.
Download: IPDPS2003_HMPI.pdf (144.95 KB)
"Classification of Partitioning Problems for Networks of Heterogeneous Computers",
Proceedings of the 5th International Conference on Parallel Processing and Applied Mathematics (PPAM 2003), vol. 3019, Czestochowa, Poland, Springer, pp. 921-929, September 7-10, 2003.
Download: PPAM_classification.pdf (75.86 KB)
"A Parallel Language and Its Programming System for Heterogeneous Networks",
Concurrency: Practice and Experience, vol. 12, issue 13: Wiley, pp. 1317-1343, 2000.
Download: ParLangandItsProgrSystem_2000.pdf (518.3 KB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"Towards a Realistic Performance Model for Networks of Heterogeneous Computers",
Proceedings of IFIP TC5 Workshop, World Computer Congress, August 22-27 2004, Toulouse, France: Springer, pp. 39-58, 2005.
Download: IFIP TC5_Workshop_2005.pdf (583.39 KB)
"A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors",
Proceedings of the 9th International Conference on Parallel Computing Technologies (PaCT 2007), vol. 4671, Pereslavl-Zalessky, Russia, Springer, pp. 261-275, 3-7 September, 2007.
Download: 1182174947076.pdf (305.56 KB)
"Revisiting communication performance models for computational clusters",
IPDPS 2009, Rome, Italy, IEEE, May 25-29, 2009.
Download: Revisiting_12.pdf (200.63 KB)
"Building the Functional Performance Model of a Processor",
Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC 2006), Dijon, France, ACM, April 23-27 2006.
Download: SAC_2006.pdf (372.24 KB)
"mpC: A Multi-Paradigm Programming Language for Massively Parallel Computers",
ACM SIGPLAN Notices, vol. 31, issue 2: ACM, pp. 13-20, 1996.
Download: ACM_SIGPLAN_1996.pdf (846.66 KB)
"Parallel Testing of Distributed Software",
Information and Software Technology, vol. 47, issue 10: Elsevier, pp. 657-662, 2005.
Download: ParTestSoft_2005.pdf (49.1 KB)
"Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.
Download: TPDS_15.pdf (812.34 KB)
An Efficient Procedure for Building the Functional Performance Model of a Processor,
, 2005.
Download: Cluster2005_perf_model.pdf (268.4 KB)
"Two-dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 112-121, 25/9/2009, 2010.
Download: heteropar2009-2.pdf (1021.24 KB)
"Design and implementation of self-adaptable parallel algorithms for scientific computing on highly heterogeneous HPC platforms",
arXiv.org, no. arXiv:1109.3074, 09/2011.
Download: 1109.3074.pdf (1.04 MB)
"A Non-Intrusive and Incremental Approach to Enabling Direct Communications in RPC-based Grid Programming Systems",
Technical Report UCD-CSI-2005-2, pp. 15, 2006.
Download: ucd-csi-2005-2.pdf (295.56 KB)