Publications

Export 185 results:
Sort by: Author Title Type [ Year  (Desc)]
2019
Lastovetsky, A., M. Fahad, H. Khaleghzadeh, S. Khokhriakov, R. R. Manumachu, A. Shahid, L. Szustak, and R. Wyrzykowski, "How Pre-multicore Methods and Algorithms Perform in Multicore Era", High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.  Download: nesus-isc-paper.pdf (574.34 KB)
Patton, S., H. Khaleghzadeh, R. R. Manumachu, and A. Lastovetsky, "SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms", 28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.  Download: hcw2019.pdf (673.25 KB)
Shahid, A., M. Fahad, R. R. Manumachu, and A. Lastovetsky, "Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters", 15th International Conference on Parallel Computing Technologies (PaCT-2019), Almaty, Kazakhstan, Lecture Notes in Computer Science 11657, Springer, pp. 51-66, 08/2019.  Download: PaCT2019.pdf (370.4 KB)
Beaumont, O., B. Becker, A. DeFlumere, L. Eyraud-Dubois, T. Lambert, and A. Lastovetsky, "Recent Advances in Matrix Partitioning for Parallel Computing on Heterogeneous Platforms", IEEE Transactions on Parallel and Distributed Systems, vol. 30, issue 1: IEEE, pp. 218-229, 01/2019.  Download: recent-advances-matrix.pdf (2.77 MB)
2018
Khokhriakov, S., R. R. Manumachu, and A. Lastovetsky, "Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches", IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.  Download: paper.pdf (1.33 MB)
Khaleghzadeh, H., R. Reddy, and A. Lastovetsky, "A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms", IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.  Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
O'Brien, K., "Application level energy measurements and models for hybrid platform with accelerators", School of Computer Science, Dublin, University College Dublin, pp. 165, 05/2018.  Download: thesis.pdf (1.84 MB)
Khaleghzadeh, H., Z. Zhong, R. Reddy, and A. Lastovetsky, "Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds", The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.  Download: paper.pdf (762.34 KB)
2017
Lastovetsky, A., L. Szustak, and R. Wyrzykowski, "Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing", IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.  Download: TPDS_15.pdf (812.34 KB)