Filters: Author is Ravi Reddy Manumachu [Clear All Filters]
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"OpenH: A Novel Programming Model and API for Developing Portable Parallel Programs on Heterogeneous Hybrid Servers",
IEEE Access, vol. 12, pp. 23666--23694, 02/2024.
Download: OpenH.pdf (2.4 MB)
"On Energy Nonproportionality of CPUs and GPUs",
31st Heterogeneity in Computing Workshop (HCW 2022), Lyon, France, IEEE, pp. 34-44, 30/05/2022.
Download: On_Energy_Nonproportionality_of_CPUs_and_GPUs.pdf (1.03 MB)
"A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms",
19th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2021), Lisbon, Portugal, Lecture Notes in Computer Science, vol. 13098, Springer, pp. 166-178, 31/08/2021, 2022.
Download: Khaleghzadeh2022_Chapter_ANovelAlgorithmForBi-objective.pdf (1.07 MB)
"Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables",
Journal of Parallel and Distributed Computing, vol. 151: Elsevier, pp. 38-51, 05/2021.
Download: jpdc-2021-151.pdf (1.4 MB)
"Energy-Efficient Parallel Computing: Challenges to Scaling",
Information, vol. 14, issue 4, pp. 1--29, 04/2023.
Download: information-14-00248.pdf (1.53 MB)
"Energy Predictive Models of Computing: Theory, Practical Implications and Experimental Analysis on Multicore Processors",
IEEE Access, vol. 9: IEEE, pp. 63149 - 63172, 04/2021.
Download: IEEE_Access_2021_Energy_theory.pdf (2.11 MB)
"Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 35, issue 20: Wiley, pp. 1--19, 09/2023.
Download: Concurrency and Computation - 2022 - Khaleghzadeh - Efficient exact algorithms for continuous bi‐objective.pdf (1.58 MB)
"A Comparative Study of Techniques for Energy Predictive Modeling Using Performance Monitoring Counters on Modern Multicore CPUs",
IEEE Access, vol. 8: IEEE, pp. 143306 - 143332, 08/2020.
Download: IEEE-Access-09154439.pdf (2.37 MB)
"Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution",
IEEE Transactions on Parallel and Distributed Systems, vol. 32, issue 3: IEEE, pp. 543-560, 03/2021.
Download: tpds-2021-32-3-09207974.pdf (1.58 MB)
"Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms",
IEEE Access, vol. 11: IEEE, pp. 27226-27245, 03/2023.
Download: Access-2023-acceleration.pdf (1.28 MB)
"The 27th International Heterogeneity in Computing Workshop and the 16th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms",
Concurrency and Computation: Practice and Experience, vol. 32, issue 15: Wiley, pp. 3, 03/2020.
Download: cpe.5736.pdf (169.99 KB)