Filters: Author is Alexey Lastovetsky [Clear All Filters]
"Towards Optimal Matrix Partitioning for Data Parallel Computing on a Hybrid Heterogeneous Server",
IEEE Access, vol. 9: IEEE, pp. 17229 - 17244, 02/2021.
Download: IEEE-Access-09328411.pdf (3.76 MB)
"A tool to assess the communication cost of parallel kernels on heterogeneous platforms",
The Journal of Supercomputing, vol. 76: Springer, pp. 4629–4644, 06/2020.
"A Survey of Communication Performance Models for High-Performance Computing",
ACM Computing Surveys, vol. 51, issue 6: ACM, 01/2019.
"SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms",
28th Heterogeneity in Computing Workshop (HCW 2019), Rio de Janeiro, Brazil, IEEE, 20/05/2019.
Download: hcw2019.pdf (673.25 KB)
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"Programming models and runtimes",
Ultrascale computing systems: IET, 03/2019.
Download: nesus-book-chap2.pdf (3.84 MB); nesus-book-chap2-summary.pdf (123.29 KB)
"Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches",
IEEE 25th International Conference on High Performance Computing Workshops (HiPCW), Bengaluru, India, IEEE, pp. 8-17, 17-20 Dec, 2018.
Download: paper.pdf (1.33 MB)
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"Optimization of Multithreaded Data-parallel Applications on Modern Multicore CPUs For Performance and Energy Using Application-level Decision Variables",
School of Computer Science, Dublin, University College Dublin, pp. 181, 09/2019.
Download: PhD_Thesis_Semyon_Khokhriakov.pdf (8.7 MB); thesis-summary.pdf (621.98 KB)
"Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution",
17th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2019), Gottingen, Germany, Lecture Notes in Computer Science, vol. 11997, Springer, 08/2019, 2020.
Download: Khaleghzadeh2020_Chapter_OptimizationOfData-ParallelApp.pdf (692.31 KB)
"Optimal Matrix Partitioning for Data Parallel Computing on Hybrid Heterogeneous Platforms",
19th International Symposium on Parallel and Distributed Computing (ISPDC), Warsaw, Poland, IEEE, 5-8 July, 2020.
Download: ispdc2020.pdf (367.16 KB)
"OpenH: A Novel Programming Model and API for Developing Portable Parallel Programs on Heterogeneous Hybrid Servers",
IEEE Access, vol. 12, pp. 23666--23694, 02/2024.
Download: OpenH.pdf (2.4 MB)
"On Energy Nonproportionality of CPUs and GPUs",
31st Heterogeneity in Computing Workshop (HCW 2022), Lyon, France, IEEE, pp. 34-44, 30/05/2022.
Download: On_Energy_Nonproportionality_of_CPUs_and_GPUs.pdf (1.03 MB)
"A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms",
Energies, vol. 13, issue 15: MDPI, pp. 22, 08/2020.
Download: energies-13-03944.pdf (4.08 MB); supplemental.pdf (188.52 KB)
"Novel Data-Partitioning Algorithms for Performance and Energy Optimization of Data-Parallel Applications on Modern Heterogeneous HPC Platforms",
School of Computer Science, Dublin, University College Dublin, pp. 264, 03/2019.
Download: thesis-hamid.pdf (5.9 MB)
"A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 33, issue 21: Wiley, pp. e5928, 07/2020.
Download: CCPE-2020-dynamic-energy.pdf (1.34 MB)
"A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms",
19th Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2021), Lisbon, Portugal, Lecture Notes in Computer Science, vol. 13098, Springer, pp. 166-178, 31/08/2021, 2022.
Download: Khaleghzadeh2022_Chapter_ANovelAlgorithmForBi-objective.pdf (1.07 MB)
"A New Model-Based Approach to Performance Comparison of MPI Collective Algorithms",
16th International Conference on Parallel Computing Technologies (PaCT 2021), Kaliningrad, Russia, Lecture Notes in Computer Science 12942, Springer, pp. 11-25, 09/2021.
Download: Nuriyev-Lastovetsky2021_Chapter_ANewModel-BasedApproachToPerfo.pdf (623.78 KB)
"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Model-based selection of optimal MPI broadcast algorithms for multi-core clusters",
Journal of Parallel and Distributed Computing, vol. 165: Elsevier, pp. 1-16, 07/2022.
Download: 1-s2.0-S0743731522000697-main.pdf (988.38 KB)
"Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters",
15th International Conference on Parallel Computing Technologies (PaCT-2019), Almaty, Kazakhstan, Lecture Notes in Computer Science 11657, Springer, pp. 51-66, 08/2019.
Download: PaCT2019.pdf (370.4 KB)
"Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables",
Journal of Parallel and Distributed Computing, vol. 151: Elsevier, pp. 38-51, 05/2021.
Download: jpdc-2021-151.pdf (1.4 MB)
"How Pre-multicore Methods and Algorithms Perform in Multicore Era",
High Performance Computing. ISC High Performance 2018. Lecture Notes in Computer Science, vol 11203, Frankfurt, Springer Nature, pp. 527-539, 24-26 June, 2018, 2019.
Download: nesus-isc-paper.pdf (574.34 KB)
"A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes",
IEEE Access, vol. 8: IEEE, pp. 7861 - 7876, 01/2020.
Download: 08933138.pdf (3.4 MB)