"Accurate and efficient estimation of parameters of heterogeneous communication performance models",
International Journal of High Performance Computing Applications, vol. 23, issue 2, pp. 123-139, 2009.
Download: 123.pdf (504.5 KB)
"Accurate and Reliable Energy Measurement and Modelling of Data Transfer Between CPU and GPU in Parallel Applications on Heterogeneous Hybrid Platforms",
IEEE Transactions on Computers, vol. 74, issue 3, pp. 1011--1024, 03/2025.
Download: Accurate_and_Reliable_Energy_Measurement_and_Modelling_of_Data_Transfer_Between_CPU_and_GPU_in_Parallel_Applications_on_Heterogeneous_Hybrid_Platforms.pdf (853.36 KB); supplemental_r2.pdf (2.05 MB)
"Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms using System-Level Measurements",
IEEE Access, vol. 8, pp. 93793 - 93829, 06/2020.
Download: 09094309.pdf (2.89 MB)
"Accurate Heterogeneous Communication Models and a Software Tool for their Efficient Estimation",
International Journal of High Performance Computing Applications, vol. 24, issue 1, pp. 34-48, 2010.
Download: IJHPCA_2010.pdf (160.91 KB)
"Adaptive Parallel Computing on Heterogeneous Networks with mpC",
Parallel Computing, vol. 28, issue 10, pp. 1369-1407, 2002.
Download: AdaptParComp_2002.pdf (290.43 KB)
"Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling",
Supercomputing Frontiers and Innovations, vol. 4, issue 4, pp. 50-65, 12/2017.
Abstract
Download: 153-992-1-PB.pdf (666.73 KB)
"An algebraic approach to semantics of programming languages",
Theoretical Computer Science, vol. 135, issue 2: Elsevier, pp. 267-288, 1994.
Download: TCS2004.pdf (1.19 MB)
"An ANSI C superset for vector and superscalar computers and its retargetable compiler",
The Journal of C Language Translation, vol. 5, issue 3, pp. 183-198, 1994.
Download: 1124709634093.pdf (44.03 KB)
"Asymmetric communication models for resource-constrained hierarchical Ethernet networks",
Concurrency and Computation: Practice and Experience, vol. 27, issue 6: Wiley, pp. 1575-1590, 25/04/2015.
Download: cpe3343.pdf (1.69 MB)
"Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems",
The Journal of Supercomputing, vol. 73, issue 1, pp. 227-239, 01/2017.
Download: JoS-2016-Murilo.pdf (530.26 KB)
"Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution",
IEEE Transactions on Parallel and Distributed Systems, vol. 32, issue 3: IEEE, pp. 543-560, 03/2021.
Download: tpds-2021-32-3-09207974.pdf (1.58 MB)
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"A Comparative Study of Methods for Measurement of Energy of Computing",
Energies, vol. 12, issue 11: MDPI, pp. 42, 06/2019.
"A Comparative Study of Techniques for Energy Predictive Modeling Using Performance Monitoring Counters on Modern Multicore CPUs",
IEEE Access, vol. 8: IEEE, pp. 143306 - 143332, 08/2020.
Download: IEEE-Access-09154439.pdf (2.37 MB)
"Compilation of Vector Statements of C[] Language for Architectures with Multilevel Memory Hierarchy",
Programming and Computer Software, vol. 27, issue 3, pp. 111-122, 2001.
Download: CompilOfVectorExpres_2001.pdf (87.7 KB)
"Data distribution for dense factorization on computers with memory heterogeneity",
Parallel Computing, vol. 33, issue 12, pp. 757-779, 12/2007.
Abstract
Download: sdarticle.pdf (714.34 KB)
"Data Partitioning for Multiprocessors with Memory Heterogeneity and Memory Constraints",
Scientific Programming, vol. 13, issue 2: IOS Press, pp. 93-112, 2005.
Download: JSP_data_partitioning_2005.pdf (204.98 KB)
"Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models",
IEEE Transactions on Computers, vol. 64, issue 9: IEEE, pp. 2506-2518, 09/2015.
Download: 06975085.pdf (2.08 MB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"Dynamic Load Balancing of Parallel Computational Iterative Routines on Highly Heterogeneous HPC Platforms",
Parallel Processing Letters, vol. 21, issue 2: World Scientific, pp. 195-217, 06/2011.
Download: DLB_PCIR_HHHP-16.pdf (797.9 KB)
"Effective Solving Scientific Problems on Heterogeneous Networks of Computers with mpC",
Journal of Computational Methods in Science and Engineering, vol. 2, issue 1-2: IOS Press, pp. 135-140, 2002.
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)
"Efficient and reliable network tomography in heterogeneous networks using BitTorrent broadcasts and clustering algorithms",
Scientific Programming, vol. 21, issue 3-4: IOS Press, pp. 79-92, 12/2013.
Download: sci-pro-2013.pdf (702.01 KB)
"Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 35, issue 20: Wiley, pp. 1--19, 09/2023.
Download: Concurrency and Computation - 2022 - Khaleghzadeh - Efficient exact algorithms for continuous bi‐objective.pdf (1.58 MB)