Filters: First Letter Of Last Name is R [Clear All Filters]
"A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors",
Proceedings of the 9th International Conference on Parallel Computing Technologies (PaCT 2007), vol. 4671, Pereslavl-Zalessky, Russia, Springer, pp. 261-275, 3-7 September, 2007.
Download: 1182174947076.pdf (305.56 KB)
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"MPIBlib: Benchmarking MPI Communications for Parallel Computing on Homogeneous and Heterogeneous Clusters",
15th European PVM/MPI User's Group Meeting, vol. 5205, Dublin, Ireland, Springer-Verlag Berlin Heidelberg, pp. 227-238, September 7-10, 2008.
Download: 52050227.pdf (341.9 KB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Model-Based Estimation of the Communication Cost of Hybrid Data-Parallel Applications on Heterogeneous Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 11: IEEE, pp. 3215-3228, 11/2017.
Download: model-based-estimation-tpds-2017.pdf (1.65 MB); model-based-estimation-tpds-2017-supplement.pdf (871.33 KB)
"Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters",
15th International Conference on Parallel Computing Technologies (PaCT-2019), Almaty, Kazakhstan, Lecture Notes in Computer Science 11657, Springer, pp. 51-66, 08/2019.
Download: PaCT2019.pdf (370.4 KB)
"Improvement of the Bandwidth of Cross-Site MPI Communication Using Optical Fiber",
EuroMPI 2011, vol. 6960, Santorini, Greece, Springer, September 18-21, 2011.
Download: eurompi2011-short-paper.pdf (216.17 KB)
"How Pre-multicore Methods and Algorithms Perform in Multicore Era",
High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.
Download: nesus-isc-paper.pdf (574.34 KB)
"HMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, France, IEEE Computer Society, 22-26 April 2003.
Download: IPDPS2003_HMPI.pdf (144.95 KB)
"HMPI: A Message-Passing Library for Heterogeneous Networks of Computers",
Computer Science Department, Dublin, University College Dublin, pp. 456, 06/2005.
Download: Ravi_thesis.pdf (1.89 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)
"HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters",
Scalable Computing: Practice and Experience, vol. 10, issue 2, pp. 201-216, 06/2009.
Download: SCPE_10_2_06.pdf (248.74 KB)
"HeteroMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers",
Journal of Parallel and Distributed Computing, vol. 66, issue 2: Elsevier, pp. 197-220, 2006.
Download: JPDC_HMPI_2006.pdf (349.02 KB)
"HeteroMPI + ScaLAPACK: Towards a ScaLAPACK (Dense Linear Solvers) on Heterogeneous Networks of Computers",
Proceedings of the 13th IEEE International Conference on High Performance Computing (HiPC 2006), vol. 4297, Bangalore, India, Springer, pp. 242-253, 18-21 Dec 2006.
Download: 1161251345946.pdf (202.11 KB)
"Heterogeneous PBLAS: Optimization of PBLAS for Heterogeneous Computational Clusters",
7th International Symposium on Parallel and Distributed Computing, Krakow, Poland, pp. 73-80, Jul 1-5, 2008.
Abstract
Download: ispdc_rreddy_HeteroPBLAS.pdf (268.49 KB)
Heterogeneous PBLAS: A Set of Parallel Basic Linear Algebra Subprograms for Heterogeneous Computational Clusters,
: School of Computer Science and Informatics, University College Dublin, pp. 19, 04/2008.
Abstract
Download: UCD-CSI-2008-2-HeteroPBLAS.pdf (111.71 KB)
"Heterogeneous Computing",
Parallel Computing, vol. 31, issue 7: Elsevier, pp. 649-812, 2005.
Download: HC_2005.pdf (61 KB)
"FuPerMod: a software tool for the optimization of data-parallel applications on heterogeneous platforms",
The Journal of Supercomputing, vol. 69, issue 1: Springer US, pp. 61- 69, 2014.
Download: fupermod-jos-2014.pdf (276.83 KB)
"FuPerMod: A Framework for Optimal Data Partitioning for Parallel Scientific Applications on Dedicated Heterogeneous HPC Platforms",
12th International Conference on Parallel Computing Technologies (PaCT-2013), St. Petersburg, Russia, Lecture Notes in Computer Science 7979, Springer, pp. 182-196, 30 Sept - 4 Oct, 2013.
Download: pact2013-fupermod.pdf (367.48 KB)
"Extending τ -Lop to model concurrent MPI communications in multicore clusters",
Future Generation Computer Systems, vol. 61: Elsevier, pp. 66-82, 08/2016.
Download: fgcs2016.pdf (985.73 KB)
Experimental Study of Six Different Parallel Matrix-Matrix Multiplication Applications for Heterogeneous Computational Clusters of Multicore Processors,
: School of Computer Science and Informatics, University College Dublin, pp. 47, 02/2009.
Abstract
Download: Parallel_matrix_matrix_mutiplication_multicores.pdf (373.08 KB)
"Experimental Study of Six Different Implementations of Parallel Matrix Multiplication on Heterogeneous Computational Clusters of Multicore Processors",
18th Euromicro Conference on Parallel, Distributed and Network-based Processing (PDP 2010), Pisa, Italy, pp. 263-270, Feb 17-19, 2010.
Download: pdp2010.pdf (639.47 KB)