"Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.
Download: TPDS_15.pdf (812.34 KB)
"Hierarchical redesign of classic MPI reduction algorithms",
The Journal of Supercomputing, vol. 73, issue 2: Springer, pp. 713-725, 02/2017.
Download: TJS-Hasanov-2016.pdf (593.41 KB)
"Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems",
The Journal of Supercomputing, vol. 73, issue 1, pp. 227-239, 01/2017.
Download: JoS-2016-Murilo.pdf (530.26 KB)
"Network-Aware Optimization of MPDATA on Homogeneous Multi-core Clusters with Heterogeneous Network",
ICA3PP 2016 Workshops, Granada, Spain, Lecture Notes in Computer Science 10049, Springer, pp. 30-42, 14-16 Dec 2016.
Download: tapems2016.pdf (357.28 KB)
"Topology-aware Optimization of Communication Cost of Parallel Applications in Heterogeneous HPC Systems",
School of Computer Science, Dublin, University College Dublin, pp. 106, 09/2016.
Download: thesis.pdf (1 MB)
"Extending τ -Lop to model concurrent MPI communications in multicore clusters",
Future Generation Computer Systems, vol. 61: Elsevier, pp. 66-82, 08/2016.
Download: fgcs2016.pdf (985.73 KB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"Hierarchical Optimization of MPI Reduce Algorithms",
13th International Conference on Parallel Computing Technologies (PaCT-2015), Petrozavodsk, Russia, Lecture Notes in Computer Science 9251, Springer, pp. 21-34, 31 Aug - 4 Sept, 2015.
Download: pact2015reduce.pdf (812.37 KB)
"Towards Application Energy Measurement and Modelling Tool Support",
13th International Conference on Parallel Computing Technologies (PaCT-2015), Petrozavodsk, Russia, Lecture Notes in Computer Science 9251, Springer, pp. 91-101, 31 Aug - 4 Sept, 2015.
Download: pact2015energy.pdf (383.55 KB)
"Asymmetric communication models for resource-constrained hierarchical Ethernet networks",
Concurrency and Computation: Practice and Experience, vol. 27, issue 6: Wiley, pp. 1575-1590, 25/04/2015.
Download: cpe3343.pdf (1.69 MB)
"Hierarchical Approach to Optimization of Parallel Matrix Multiplication on Large-Scale Platforms",
The Journal of Supercomputing, vol. 71, issue 11: Springer, pp. 3991-4014, 11/2015.
Download: JoS 2014 hierarchical matrix multiplication.pdf (1.3 MB)
"Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms",
Simulation Modelling Practice and Theory, vol. 58: Elsevier, pp. 30-39, 11/2015.
Download: simpat2015.pdf (1.63 MB)
"Hierarchical Approach to Optimization of MPI Collective Communication Algorithms",
School of Computer Science, Dublin, University College Dublin, pp. 152, 10/2015.
Download: khalid-thesis-oct-2015.pdf (1.1 MB)
"Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models",
IEEE Transactions on Computers, vol. 64, issue 9: IEEE, pp. 2506-2518, 09/2015.
Download: 06975085.pdf (2.08 MB)
"Exascale Machines Require New Programming Paradigms and Runtimes",
Supercomputing Frontiers and Innovations, vol. 2, issue 2, pp. 6-27, 09/2015.
Download: 44-301-3-PB.pdf (308.72 KB)
"Acceleration of MPI Mechanisms for Sustainable HPC Applications",
Supercomputing Frontiers and Innovations, vol. 2, issue 2, pp. 28-45, 2015.
Download: 35-302-3-PB.pdf (464.88 KB)
"Optimizations to enhance sustainability of MPI applications",
EuroMPI/ASIA '14, Kyoto, Japan, ACM, 9-12 September, 2014.
Download: p145-carretero.pdf (351.18 KB)
"Automatic Assessment of Computer Programs in eLearning Systems",
The 15th Educational Technology Conference of the Irish Learning Technology Association (ILTA), Dublin, Ireland, 29-30 May, 2014.
"High-Level Topology-Oblivious Optimization of MPI Broadcast Algorithms on Extreme-Scale Platforms",
Euro-Par 2014: Parallel Processing Workshops, Vol. 8806 of Lecture Notes in Computer Science, Porto, Portugal, Springer, pp. 413-425, 25-29 August, 2014.
Download: tasus_2014.pdf (280.75 KB)
"Optimal Data Partitioning Shape for Matrix Multiplication on Three Fully Connected Heterogeneous Processors",
12th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar'2014), Porto, Portugal, 25 August, 2014.
Download: heteropar2014.pdf (772.84 KB)
"Searching for the Optimal Data Partitioning Shape for Parallel Matrix Matrix Multiplication on 3 Heterogenous Processors",
23rd International Heterogeneity in Computing Workshop (HCW 2014), Phoenix, Arizona, USA, IEEE Computer Society, 19 May, 2014.
Download: HCW-2014-05.pdf (510.38 KB)
"Topology-aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC Platforms",
23rd International Heterogeneity in Computing Workshop (HCW 2014), Phoenix, Arizona, USA, IEEE Computer Society, 19 May, 2014.
Download: HCW-2014-09.pdf (294.81 KB)
"Heterogeneous Parallel Computing: from Clusters of Workstations to Hierarchical Hybrid Platforms",
Supercomputing Frontiers and Innovations, vol. 1, issue 3, pp. 70-87, 12/2014.
Download: 32-140-2-PB.pdf (747.18 KB)
"Optimal Partitioning for Parallel Matrix Computation on a Small Number of Abstract Heterogeneous Processors",
School of Computer Science and Informatics, Dublin, University College Dublin, pp. 161, 09/2014.
Abstract
Download: AshleyPhDThesis.pdf (3.22 MB)
"Using Static Code Analysis for Improvement of Programmability and Performance of GridRPC-Based Applications",
School of Computer Science and Informatics, Dublin, University College Dublin, pp. 110, 09/2014.
Abstract
Download: oleg_thesis.pdf (583.17 KB)