Ravi Reddy Manumachu
"Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution",
Euro-Par 2019 Workshops, Lecture Notes in Computer Science, vol. 11997, Gottingen, Germany, Springer, 08/2019, 2020.
Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
"Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms using System-Level Measurements",
IEEE Access, vol. 8, pp. 93793 - 93829, 06/2020.
Download: 09094309.pdf (2.89 MB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)
"How Pre-multicore Methods and Algorithms Perform in Multicore Era",
High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.
Download: nesus-isc-paper.pdf (574.34 KB)
"SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms",
28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.
Download: hcw2019.pdf (673.25 KB)
"Optimization of Multithreaded Data-parallel Applications on Modern Multicore CPUs For Performance and Energy Using Application-level Decision Variables",
School of Computer Science, Dublin, University College Dublin, pp. 181, 09/2019.
Download: PhD_Thesis_Semyon_Khokhriakov.pdf (8.7 MB); thesis-summary.pdf (621.98 KB)
"Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters",
15th International Conference on Parallel Computing Technologies (PaCT-2019), Almaty, Kazakhstan, Lecture Notes in Computer Science 11657, Springer, pp. 51-66, 08/2019.
Download: PaCT2019.pdf (370.4 KB)
"A Comparative Study of Methods for Measurement of Energy of Computing",
Energies, vol. 12, issue 11: MDPI, pp. 42, 06/2019.
"Energy aware ultrascale systems",
Ultrascale computing systems: IET, 03/2019.
Download: chap5.pdf (3.2 MB)
"Programming models and runtimes",
Ultrascale computing systems: IET, 03/2019.
Download: nesus-book-chap2.pdf (3.84 MB); nesus-book-chap2-summary.pdf (123.29 KB)
"Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution",
Concurrency and Computation: Practice and Experience, vol. 31, issue 4: Wiley, 02/2019.
Download: ccpe2018ravi.pdf (1.67 MB)
"A Survey of Communication Performance Models for High-Performance Computing",
ACM Computing Surveys, vol. 51, issue 6: ACM, 01/2019.
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Hierarchical Multicore Thread Mapping via Estimation of Remote Communication",
The Journal of Supercomputing, vol. 74, issue 3: Springer, pp. 1321-1340, 03/2018.
"Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy",
IEEE Transactions on Computers, vol. 67, issue 2: IEEE, pp. 160-177, 02/2018.
Download: paperfinal.pdf (1.16 MB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling",
Supercomputing Frontiers and Innovations, vol. 4, issue 4, pp. 50-65, 12/2017.
Abstract
Download: 153-992-1-PB.pdf (666.73 KB)
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"Design and implementation of self-adaptable parallel algorithms for scientific computing on highly heterogeneous HPC platforms",
arXiv.org, no. arXiv:1109.3074, 09/2011.
Download: 1109.3074.pdf (1.04 MB)
"Experimental Study of Six Different Implementations of Parallel Matrix Multiplication on Heterogeneous Computational Clusters of Multicore Processors",
18th Euromicro Conference on Parallel, Distributed and Network-based Processing (PDP 2010), Pisa, Italy, pp. 263-270, Feb 17-19, 2010.
Download: pdp2010.pdf (639.47 KB)