Filters: First Letter Of Last Name is R [Clear All Filters]
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"Model-Based Estimation of the Communication Cost of Hybrid Data-Parallel Applications on Heterogeneous Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 11: IEEE, pp. 3215-3228, 11/2017.
Download: model-based-estimation-tpds-2017.pdf (1.65 MB); model-based-estimation-tpds-2017-supplement.pdf (871.33 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 33, issue 21: Wiley, pp. e5928, 07/2020.
Download: CCPE-2020-dynamic-energy.pdf (1.34 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms",
Energies, vol. 13, issue 15: MDPI, pp. 22, 08/2020.
Download: energies-13-03944.pdf (4.08 MB); supplemental.pdf (188.52 KB)
"On Performance Analysis of Heterogeneous Parallel Algorithms",
Parallel Computing, vol. 30, issue 11, pp. 1195-1216, 2004.
Download: ParCom2004_hetero_perf.pdf (750.84 KB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"A Survey of Communication Performance Models for High-Performance Computing",
ACM Computing Surveys, vol. 51, issue 6: ACM, 01/2019.
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)
"A tool to assess the communication cost of parallel kernels on heterogeneous platforms",
The Journal of Supercomputing, vol. 76: Springer, pp. 4629–4644, 06/2020.
"An Approach to Assessment of Heterogeneous Parallel Algorithms",
Proceedings of the 7th International Conference on Parallel Computing Technologies (PaCT 2003), vol. 2763, Nizhni Novgorod, Russia, pp. 117-129, 15-19 Sept 2003.
Download: PaCT_hetero_analysis.pdf (138.22 KB)
"Automatic Assessment of Computer Programs in eLearning Systems",
The 15th Educational Technology Conference of the Irish Learning Technology Association (ILTA), Dublin, Ireland, 29-30 May, 2014.
"Building the Communication Performance Model of Heterogeneous Clusters Based on a Switched Network",
Proceedings of the 2007 IEEE International Conference on Cluster Computing (Cluster 2007), Austin, Texas, USA, IEEE Computer Society, pp. 568-575, September 17-20, 2007.
Download: CL070209.PDF (151.58 KB)
"Building the Functional Performance Model of a Processor",
Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC 2006), Dijon, France, ACM, April 23-27 2006.
Download: SAC_2006.pdf (372.24 KB)
"Classification of Partitioning Problems for Networks of Heterogeneous Computers",
Proceedings of the 5th International Conference on Parallel Processing and Applied Mathematics (PPAM 2003), vol. 3019, Czestochowa, Poland, Springer, pp. 921-929, September 7-10, 2003.
Download: PPAM_classification.pdf (75.86 KB)
"Column-Based Matrix Partitioning for Parallel Matrix Multiplication on Heterogeneous Processors Based on Functional Performance Models",
9th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar'2011) , Bordeaux, France, Lecture Notes in Computer Science 7155, Springer, pp. 450-459, August 29, 2011, 2012.
Download: Matrix_Multiplication_Heterogeneous_full.pdf (310.85 KB)
"Communication Models for Resource Constrained Hierarchical Ethernet Networks",
11th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar'2013), Aachen, Germany, Lecture Notes in Computer Science 8374, Springer, pp. 259-269, 26 August, 2013.
Download: hetPar13-final-08.pdf (614.88 KB)