Filters: First Letter Of Last Name is R [Clear All Filters]
"Data Partitioning for Multiprocessors with Memory Heterogeneity and Memory Constraints",
Scientific Programming, vol. 13, issue 2: IOS Press, pp. 93-112, 2005.
Download: JSP_data_partitioning_2005.pdf (204.98 KB)
"Data Partitioning on Multicore and Multi-GPU Platforms Using Functional Performance Models",
IEEE Transactions on Computers, vol. 64, issue 9: IEEE, pp. 2506-2518, 09/2015.
Download: 06975085.pdf (2.08 MB)
"Data Partitioning with a Functional Performance Model of Heterogeneous Processors",
International Journal of High Performance Computing Applications, vol. 21, issue 1: Sage, pp. 76-90, 2007.
Download: 76.pdf (497.14 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"Dynamic Load Balancing of Parallel Computational Iterative Routines on Highly Heterogeneous HPC Platforms",
Parallel Processing Letters, vol. 21, issue 2: World Scientific, pp. 195-217, 06/2011.
Download: DLB_PCIR_HHHP-16.pdf (797.9 KB)
"Efficient and reliable network tomography in heterogeneous networks using BitTorrent broadcasts and clustering algorithms",
Scientific Programming, vol. 21, issue 3-4: IOS Press, pp. 79-92, 12/2013.
Download: sci-pro-2013.pdf (702.01 KB)
"Exascale Machines Require New Programming Paradigms and Runtimes",
Supercomputing Frontiers and Innovations, vol. 2, issue 2, pp. 6-27, 09/2015.
Download: 44-301-3-PB.pdf (308.72 KB)
"Extending τ -Lop to model concurrent MPI communications in multicore clusters",
Future Generation Computer Systems, vol. 61: Elsevier, pp. 66-82, 08/2016.
Download: fgcs2016.pdf (985.73 KB)
"FuPerMod: a software tool for the optimization of data-parallel applications on heterogeneous platforms",
The Journal of Supercomputing, vol. 69, issue 1: Springer US, pp. 61- 69, 2014.
Download: fupermod-jos-2014.pdf (276.83 KB)
"Heterogeneous Computing",
Parallel Computing, vol. 31, issue 7: Elsevier, pp. 649-812, 2005.
Download: HC_2005.pdf (61 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"Model-Based Estimation of the Communication Cost of Hybrid Data-Parallel Applications on Heterogeneous Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 11: IEEE, pp. 3215-3228, 11/2017.
Download: model-based-estimation-tpds-2017.pdf (1.65 MB); model-based-estimation-tpds-2017-supplement.pdf (871.33 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 33, issue 21: Wiley, pp. e5928, 07/2020.
Download: CCPE-2020-dynamic-energy.pdf (1.34 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms",
Energies, vol. 13, issue 15: MDPI, pp. 22, 08/2020.
Download: energies-13-03944.pdf (4.08 MB); supplemental.pdf (188.52 KB)
"On Performance Analysis of Heterogeneous Parallel Algorithms",
Parallel Computing, vol. 30, issue 11, pp. 1195-1216, 2004.
Download: ParCom2004_hetero_perf.pdf (750.84 KB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)