Publications

Export 85 results:
Sort by: Author Title Type [ Year  (Desc)]
Filters: First Letter Of Last Name is R  [Clear All Filters]
2022
Nuriyev, E., J. - A. Rico-Gallego, and A. Lastovetsky, "Model-based selection of optimal MPI broadcast algorithms for multi-core clusters", Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.  Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
2020
Khaleghzadeh, H., M. Fahad, R. Reddy, and A. Lastovetsky, "Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution", 17th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2019), Gottingen, Germany, Lecture Notes in Computer Science, vol. 11997, Springer, 08/2019, 2020.  Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
2019
Lastovetsky, A., M. Fahad, H. Khaleghzadeh, S. Khokhriakov, R. Reddy, A. Shahid, L. Szustak, and R. Wyrzykowski, "How Pre-multicore Methods and Algorithms Perform in Multicore Era", High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.  Download: nesus-isc-paper.pdf (574.34 KB)
Patton, S., H. Khaleghzadeh, R. Reddy, and A. Lastovetsky, "SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms", 28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.  Download: hcw2019.pdf (673.25 KB)
Shahid, A., M. Fahad, R. Reddy, and A. Lastovetsky, "Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters", 15th International Conference on Parallel Computing Technologies (PaCT-2019), Almaty, Kazakhstan, Lecture Notes in Computer Science 11657, Springer, pp. 51-66, 08/2019.  Download: PaCT2019.pdf (370.4 KB)
2018
Khokhriakov, S., R. Reddy, and A. Lastovetsky, "Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches", IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.  Download: paper.pdf (1.33 MB)
Khaleghzadeh, H., R. Reddy, and A. Lastovetsky, "A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms", IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.  Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
Khaleghzadeh, H., H. Deldari, R. Reddy, and A. Lastovetsky, "Hierarchical Multicore Thread Mapping via Estimation of Remote Communication", The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
Khaleghzadeh, H., Z. Zhong, R. Reddy, and A. Lastovetsky, "Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds", The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.  Download: paper.pdf (762.34 KB)
2017
Shahid, A., M. Fahad, R. Reddy, and A. Lastovetsky, "Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling", Supercomputing Frontiers and Innovations, vol. 4, issue 4, pp. 50-65, 12/2017. Abstract  Download: 153-992-1-PB.pdf (666.73 KB)