Filters: Author is Reddy [Clear All Filters]
"Programming models and runtimes",
Ultrascale computing systems: IET, 03/2019.
Download: nesus-book-chap2.pdf (3.84 MB); nesus-book-chap2-summary.pdf (123.29 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"A Survey of Communication Performance Models for High-Performance Computing",
ACM Computing Surveys, vol. 51, issue 6: ACM, 01/2019.
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling",
Supercomputing Frontiers and Innovations, vol. 4, issue 4, pp. 50-65, 12/2017.
Abstract
Download: 153-992-1-PB.pdf (666.73 KB)
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"Design and implementation of self-adaptable parallel algorithms for scientific computing on highly heterogeneous HPC platforms",
arXiv.org, no. arXiv:1109.3074, 09/2011.
Download: 1109.3074.pdf (1.04 MB)
"Experimental Study of Six Different Implementations of Parallel Matrix Multiplication on Heterogeneous Computational Clusters of Multicore Processors",
18th Euromicro Conference on Parallel, Distributed and Network-based Processing (PDP 2010), Pisa, Italy, pp. 263-270, Feb 17-19, 2010.
Download: pdp2010.pdf (639.47 KB)
"Distributed Data Partitioning for Heterogeneous Processors Based on Partial Estimation of their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 91-101, 25/9/2009, 2010.
Download: heteropar2009-1.pdf (1.21 MB)
"Two-dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on their Functional Performance Models",
7th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009) , Delft, Netherlands, Lecture Notes in Computer Science, vol. 6043, Springer, pp. 112-121, 25/9/2009, 2010.
Download: heteropar2009-2.pdf (1021.24 KB)
"Parallel Solvers for Dense Linear Systems for Heterogeneous Computational Clusters",
The 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2009), Rome, Italy, IEEE Computer Society, May 25-29, 2009.
Download: pdsec_linear_solvers_hcc_cr.pdf (252.69 KB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
Experimental Study of Six Different Parallel Matrix-Matrix Multiplication Applications for Heterogeneous Computational Clusters of Multicore Processors,
: School of Computer Science and Informatics, University College Dublin, pp. 47, 02/2009.
Abstract
Download: Parallel_matrix_matrix_mutiplication_multicores.pdf (373.08 KB)
"Heterogeneous PBLAS: Optimization of PBLAS for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 73-80, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_HeteroPBLAS.pdf (268.49 KB)
"Scalable Dense Factorizations for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 49-56, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_scalable_factorizations.pdf (262.74 KB)
Heterogeneous PBLAS: A Set of Parallel Basic Linear Algebra Subprograms for Heterogeneous Computational Clusters,
: School of Computer Science and Informatics, University College Dublin, pp. 19, 04/2008.
Abstract
Download: UCD-CSI-2008-2-HeteroPBLAS.pdf (111.71 KB)
"A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors",
Proceedings of the 9th International Conference on Parallel Computing Technologies (PaCT 2007), vol. 4671, Pereslavl-Zalessky, Russia, Springer, pp. 261-275, 3-7 September, 2007.
Download: 1182174947076.pdf (305.56 KB)
"Data distribution for dense factorization on computers with memory heterogeneity",
Parallel Computing, vol. 33, issue 12, pp. 757-779, 12/2007.
Abstract
Download: sdarticle.pdf (714.34 KB)