Publications

Export 180 results:
Sort by: Author Title Type [ Year  (Desc)]
2019
Lastovetsky, A., M. Fahad, H. Khaleghzadeh, S. Khokhriakov, R. R. Manumachu, A. Shahid, L. Szustak, and R. Wyrzykowski, "How Pre-multicore Methods and Algorithms Perform in Multicore Era", High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.  Download: nesus-isc-paper.pdf (574.34 KB)
Beaumont, O., B. Becker, A. DeFlumere, L. Eyraud-Dubois, T. Lambert, and A. Lastovetsky, "Recent Advances in Matrix Partitioning for Parallel Computing on Heterogeneous Platforms", IEEE Transactions on Parallel and Distributed Systems, vol. 30, issue 1: IEEE, pp. 218-229, 01/2019.  Download: recent-advances-matrix.pdf (2.77 MB)
2018
Khokhriakov, S., R. R. Manumachu, and A. Lastovetsky, "Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches", IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.  Download: paper.pdf (1.33 MB)
Khaleghzadeh, H., R. Reddy, and A. Lastovetsky, "A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms", IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.  Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
O'Brien, K., "Application level energy measurements and models for hybrid platform with accelerators", School of Computer Science, Dublin, University College Dublin, pp. 165, 05/2018.  Download: thesis.pdf (1.84 MB)
Khaleghzadeh, H., Z. Zhong, R. Reddy, and A. Lastovetsky, "Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds", The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.  Download: paper.pdf (762.34 KB)
2017
Lastovetsky, A., L. Szustak, and R. Wyrzykowski, "Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing", IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.  Download: TPDS_15.pdf (812.34 KB)
Hasanov, K., and A. Lastovetsky, "Hierarchical redesign of classic MPI reduction algorithms", The Journal of Supercomputing, vol. 73, issue 2: Springer, pp. 713-725, 02/2017.  Download: TJS-Hasanov-2016.pdf (593.41 KB)
2016
Malik, T., L. Szustak, R. Wyrzykowski, and A. Lastovetsky, "Network-Aware Optimization of MPDATA on Homogeneous Multi-core Clusters with Heterogeneous Network", ICA3PP 2016 Workshops, Granada, Spain, Lecture Notes in Computer Science 10049, Springer, pp. 30-42, 14-16 Dec 2016.  Download: tapems2016.pdf (357.28 KB)
Malik, T., "Topology-aware Optimization of Communication Cost of Parallel Applications in Heterogeneous HPC Systems", School of Computer Science, Dublin, University College Dublin, pp. 106, 09/2016.  Download: thesis.pdf (1 MB)
Rico-Gallego, J. - A., J. - C. Díaz-Martín, and A. Lastovetsky, "Extending τ -Lop to model concurrent MPI communications in multicore clusters", Future Generation Computer Systems, vol. 61: Elsevier, pp. 66-82, 08/2016.  Download: fgcs2016.pdf (985.73 KB)