"Energy Predictive Models of Computing: Theory, Practical Implications and Experimental Analysis on Multicore Processors",
IEEE Access, vol. 9: IEEE, pp. 63149 - 63172, 04/2021.
Download: IEEE_Access_2021_Energy_theory.pdf (2.11 MB)
"Energy-Efficient Parallel Computing: Challenges to Scaling",
Information, vol. 14, issue 4, pp. 1--29, 04/2023.
Download: information-14-00248.pdf (1.53 MB)
"Exascale Machines Require New Programming Paradigms and Runtimes",
Supercomputing Frontiers and Innovations, vol. 2, issue 2, pp. 6-27, 09/2015.
Download: 44-301-3-PB.pdf (308.72 KB)
"Extending τ -Lop to model concurrent MPI communications in multicore clusters",
Future Generation Computer Systems, vol. 61: Elsevier, pp. 66-82, 08/2016.
Download: fgcs2016.pdf (985.73 KB)
"Extension of ANSI C for vector and superscalar computers",
Programming and Computer Software, vol. 21, issue 1: Kluwer, pp. 17-25, 1995.
"FuPerMod: a software tool for the optimization of data-parallel applications on heterogeneous platforms",
The Journal of Supercomputing, vol. 69, issue 1: Springer US, pp. 61- 69, 2014.
Download: fupermod-jos-2014.pdf (276.83 KB)
"Heterogeneity in parallel and distributed computing",
Journal of Parallel and Distributed Computing, vol. 73, issue 12, pp. 1523-1524, 2013.
Download: jpdc-2013.pdf (152.05 KB)
"Heterogeneous Computing",
Parallel Computing, vol. 31, issue 7: Elsevier, pp. 649-812, 2005.
Download: HC_2005.pdf (61 KB)
"Heterogeneous Distribution of Computations Solving Linear Algebra Problems on Networks of Heterogeneous Computers",
Journal of Parallel and Distributed Computing, vol. 61, issue 4: Academic Press, pp. 520-535, 2001.
Download: SolvinLinearAlgebra_2001.pdf (229.46 KB)
"Heterogeneous Parallel Computing: from Clusters of Workstations to Hierarchical Hybrid Platforms",
Supercomputing Frontiers and Innovations, vol. 1, issue 3, pp. 70-87, 12/2014.
Download: 32-140-2-PB.pdf (747.18 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"Hierarchical Approach to Optimization of Parallel Matrix Multiplication on Large-Scale Platforms",
The Journal of Supercomputing, vol. 71, issue 11: Springer, pp. 3991-4014, 11/2015.
Download: JoS 2014 hierarchical matrix multiplication.pdf (1.3 MB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"Hierarchical redesign of classic MPI reduction algorithms",
The Journal of Supercomputing, vol. 73, issue 2: Springer, pp. 713-725, 02/2017.
Download: TJS-Hasanov-2016.pdf (593.41 KB)
"Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables",
Journal of Parallel and Distributed Computing, vol. 151: Elsevier, pp. 38-51, 05/2021.
Download: jpdc-2021-151.pdf (1.4 MB)
"A Language and Programming Environment for High-Performance Parallel Computing on Heterogeneous Networks",
Programming and Computer Software, vol. 26, issue 4: Kluwer, pp. 216-236, 2000.
Download: PCS2000.pdf (1.87 MB)
"A Language Approach to High Performance Computing on Heterogeneous Networks",
Parallel and Distributed Computing Practices, vol. 2, issue 3: Nova Science Publishers, pp. 87-96, 1999.
"A Language Approach to High Performance Computing on Heterogeneous Networks",
Progress in Computer Research, vol. 2: Nova Science Publishers, pp. 215-226, 2001.
"Model-Based Estimation of the Communication Cost of Hybrid Data-Parallel Applications on Heterogeneous Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 11: IEEE, pp. 3215-3228, 11/2017.
Download: model-based-estimation-tpds-2017.pdf (1.65 MB); model-based-estimation-tpds-2017-supplement.pdf (871.33 KB)
"Model-based optimization of EULAG kernel on Intel Xeon Phi through load imbalancing",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 3: IEEE, pp. 787-797, 03/2017.
Download: TPDS_15.pdf (812.34 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"mpC: A Multi-Paradigm Programming Language for Massively Parallel Computers",
ACM SIGPLAN Notices, vol. 31, issue 2: ACM, pp. 13-20, 1996.
Download: ACM_SIGPLAN_1996.pdf (846.66 KB)
"MPI Support in the Grid",
Journal of Computing and Informatics, vol. 27, no. 2, pp. 213–222, 2008.