"Multicore processor computing is not energy proportional: An opportunity for bi-objective optimization for energy and performance",
Applied Energy, vol. 268, pp. 18, 06/2020.
Download: paper_r2.pdf (1.38 MB)
"Network-aware optimization of communications for parallel matrix multiplication on hierarchical HPC platforms",
Concurrency and Computation: Practice and Experience, vol. 28, issue 3: Wiley, pp. 802-821, 03/2016.
Abstract
"New Model-based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters",
IEEE Transactions on Parallel and Distributed Systems, vol. 28, issue 4: IEEE, pp. 1119-1133, 04/2017.
Download: performance-energy-homo-multicore-clusters.pdf (1.27 MB)
"A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms",
Concurrency and Computation: Practice and Experience, vol. 33, issue 21: Wiley, pp. e5928, 07/2020.
Download: CCPE-2020-dynamic-energy.pdf (1.34 MB)
"A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 29, issue 10: IEEE, pp. 2176-2190, 10/2018.
Download: paper_r2.pdf (1.49 MB); tpds2018hpoptasuppl.pdf (3.4 MB)
"A Novel Statistical Learning-Based Methodology for Measuring the Goodness of Energy Profiles of Applications Executing on Multicore Computing Platforms",
Energies, vol. 13, issue 15: MDPI, pp. 22, 08/2020.
Download: energies-13-03944.pdf (4.08 MB); supplemental.pdf (188.52 KB)
"On Performance Analysis of Heterogeneous Parallel Algorithms",
Parallel Computing, vol. 30, issue 11, pp. 1195-1216, 2004.
Download: ParCom2004_hetero_perf.pdf (750.84 KB)
"OpenH: A Novel Programming Model and API for Developing Portable Parallel Programs on Heterogeneous Hybrid Servers",
IEEE Access, vol. 12, pp. 23666--23694, 02/2024.
Download: OpenH.pdf (2.4 MB)
"Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds",
The Journal of Supercomputing, vol. 74, issue 2, pp. 551-568, 2018.
Download: paper.pdf (762.34 KB)
"An Overview of Heterogeneous High Performance and Grid Computing",
Engineering the Grid: Status and Perspective: American Scientific Publishers, February 2006.
Download: ASP_Overview2006.pdf (199.93 KB)
"Parallel Computing on Heterogeneous Networks: Challenges and Responses",
Problems of Programming, vol. 10, issue 2-3, pp. 251-260, 2004.
Download: 34 - Lastovetsky.pdf (95.7 KB)
"Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy",
IEEE Access, vol. 6: IEEE, pp. 69075-69106, 11/2018.
Download: IEEEAccess2018PDPA.pdf (3.04 MB)
"A Parallel Language and Its Programming System for Heterogeneous Networks",
Concurrency: Practice and Experience, vol. 12, issue 13: Wiley, pp. 1317-1343, 2000.
Download: ParLangandItsProgrSystem_2000.pdf (518.3 KB)
"Parallel Processing of Remotely Sensed Hyperspectral Images On Heterogeneous Networks of Workstations Using HeteroMPI",
International Journal of High Performance Computing Applications, vol. 22, issue 4, pp. 386-407, 2008.
Download: 386.pdf (1.16 MB)
"Parallel Testing of Distributed Software",
Information and Software Technology, vol. 47, issue 10: Elsevier, pp. 657-662, 2005.
Download: ParTestSoft_2005.pdf (49.1 KB)
"Performance Analysis and Improvement of an Oceanography Application on a Supercomputer and on a Grid",
HPC-Europa++ reports from 2008 'Science and Supercomputing in Europe, pp. 217–219, 2008.
"Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method",
IEEE Access, vol. 6: IEEE, pp. 64202-64224, 10/2018.
Download: ACCESS2878271.pdf (2.78 MB)
"Portable efficiency of software for parallel architectures",
Fundamental and Applied Mathematics, vol. 4, issue 3, pp. 947-974, 1998.
Download: fpm336.pdf (1.52 MB)
"Porting the OPATM-BFM Application to a Grid e-Infrastructure – Optimization of Communication and I / O Patterns",
Computational Methods in Science and Technology, vol. 15, no. 1, pp. 9–19, 2009.
"Recent Advances in Matrix Partitioning for Parallel Computing on Heterogeneous Platforms",
IEEE Transactions on Parallel and Distributed Systems, vol. 30, issue 1: IEEE, pp. 218-229, 01/2019.
Download: recent-advances-matrix.pdf (2.77 MB)
"Refined Description of the C[] Language",
Programming and Computer Software, vol. 28, issue 6, pp. 333-341, 2002.
Download: RefDescr_2002.pdf (54.55 KB)
"SmartGridRPC: The new RPC model for high performance Grid computing",
Concurrency and Computation: Practice and Experience, vol. 22, issue 18, pp. 2467-2487, 2010.
Download: smartgridrpc_ccpe_2010.pdf (1.1 MB)
"SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications",
Journal of Parallel and Distributed Computing, vol. 183, pp. 15, 01/2024.
Download: jpdc-suara.pdf (2.46 MB)
"A Survey of Communication Performance Models for High-Performance Computing",
ACM Computing Surveys, vol. 51, issue 6: ACM, 01/2019.
"A Survey of Power and Energy Predictive Models in HPC Systems and Applications",
ACM Computing Surveys, vol. 50, issue 3: ACM, 10/2017.
Download: surveypowerenergymodelshpc.pdf (578.85 KB)