"Accurate and efficient estimation of parameters of heterogeneous communication performance models",
International Journal of High Performance Computing Applications, vol. 23, issue 2, pp. 123-139, 2009.
Download: 123.pdf (504.5 KB)
"Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms using System-Level Measurements",
IEEE Access, vol. 8, pp. 93793 - 93829, 06/2020.
Download: 09094309.pdf (2.89 MB)
"Accurate Heterogeneous Communication Models and a Software Tool for their Efficient Estimation",
International Journal of High Performance Computing Applications, vol. 24, issue 1, pp. 34-48, 2010.
Download: IJHPCA_2010.pdf (160.91 KB)
"Adaptive Parallel Computing on Heterogeneous Networks with mpC",
Parallel Computing, vol. 28, issue 10, pp. 1369-1407, 2002.
Download: AdaptParComp_2002.pdf (290.43 KB)
"Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling",
Supercomputing Frontiers and Innovations, vol. 4, issue 4, pp. 50-65, 12/2017.
Abstract
Download: 153-992-1-PB.pdf (666.73 KB)
"An algebraic approach to semantics of programming languages",
Theoretical Computer Science, vol. 135, issue 2: Elsevier, pp. 267-288, 1994.
Download: TCS2004.pdf (1.19 MB)
"An ANSI C superset for vector and superscalar computers and its retargetable compiler",
The Journal of C Language Translation, vol. 5, issue 3, pp. 183-198, 1994.
Download: 1124709634093.pdf (44.03 KB)
"Asymmetric communication models for resource-constrained hierarchical Ethernet networks",
Concurrency and Computation: Practice and Experience, vol. 27, issue 6: Wiley, pp. 1575-1590, 25/04/2015.
Download: cpe3343.pdf (1.69 MB)
"Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems",
The Journal of Supercomputing, vol. 73, issue 1, pp. 227-239, 01/2017.
Download: JoS-2016-Murilo.pdf (530.26 KB)
"Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution",
IEEE Transactions on Parallel and Distributed Systems, vol. 32, issue 3: IEEE, pp. 543-560, 03/2021.
Download: tpds-2021-32-3-09207974.pdf (1.58 MB)
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"A Comparative Study of Methods for Measurement of Energy of Computing",
Energies, vol. 12, issue 11: MDPI, pp. 42, 06/2019.
"A Comparative Study of Techniques for Energy Predictive Modeling Using Performance Monitoring Counters on Modern Multicore CPUs",
IEEE Access, vol. 8: IEEE, pp. 143306 - 143332, 08/2020.
Download: IEEE-Access-09154439.pdf (2.37 MB)
"Compilation of Vector Statements of C[] Language for Architectures with Multilevel Memory Hierarchy",
Programming and Computer Software, vol. 27, issue 3, pp. 111-122, 2001.
Download: CompilOfVectorExpres_2001.pdf (87.7 KB)
"Data distribution for dense factorization on computers with memory heterogeneity",
Parallel Computing, vol. 33, issue 12, pp. 757-779, 12/2007.
Abstract
Download: sdarticle.pdf (714.34 KB)
"Data Partitioning for Multiprocessors with Memory Heterogeneity and Memory Constraints",
Scientific Programming, vol. 13, issue 2: IOS Press, pp. 93-112, 2005.
Download: JSP_data_partitioning_2005.pdf (204.98 KB)
"Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models",
IEEE Transactions on Computers, vol. 64, issue 9: IEEE, pp. 2506-2518, 09/2015.
Download: 06975085.pdf (2.08 MB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"Dynamic Load Balancing of Parallel Computational Iterative Routines on Highly Heterogeneous HPC Platforms",
Parallel Processing Letters, vol. 21, issue 2: World Scientific, pp. 195-217, 06/2011.
Download: DLB_PCIR_HHHP-16.pdf (797.9 KB)
"Effective Solving Scientific Problems on Heterogeneous Networks of Computers with mpC",
Journal of Computational Methods in Science and Engineering, vol. 2, issue 1-2: IOS Press, pp. 135-140, 2002.
"Efficient and Accurate Selection of Optimal Collective Communication Algorithms Using Analytical Performance Modeling",
IEEE Access, vol. 9: IEEE, pp. 109355 - 109373, 07/2021.
Download: Efficient_and_Accurate_Selection_of_Optimal_Collective_Communication_Algorithms_Using_Analytical_Performance_Modeling.pdf (6.95 MB)
"Efficient and reliable network tomography in heterogeneous networks using BitTorrent broadcasts and clustering algorithms",
Scientific Programming, vol. 21, issue 3-4: IOS Press, pp. 79-92, 12/2013.
Download: sci-pro-2013.pdf (702.01 KB)
"Energy Predictive Models of Computing: Theory, Practical Implications and Experimental Analysis on Multicore Processors",
IEEE Access, vol. 9: IEEE, pp. 63149 - 63172, 04/2021.
Download: IEEE_Access_2021_Energy_theory.pdf (2.11 MB)
"Exascale Machines Require New Programming Paradigms and Runtimes",
Supercomputing Frontiers and Innovations, vol. 2, issue 2, pp. 6-27, 09/2015.
Download: 44-301-3-PB.pdf (308.72 KB)