fupermod: Functional Performance Models of heterogeneous processors
Partitioner
Partitions 1D workloads. Takes conf_file and total problem size as input. Reads in performance model data files. Using 1 of 7 algorithms partitions workload to balance load. Outputs a text file partition.dist with hostname, device rank and distribution for later input into routine (or fupermod_1dto2d_mxm_dist).
Algorithms:
- Partial functional performance model
- Full functional performance model
- Constant performance model 1 (small kernel)
- Constant performance model 2 (homogeneous kernel)
- Homogeneous distribution
- Manual distribution
- Geometric Partitioning