Difference between revisions of "UTK multicores + GPU"
From HCL
Zhongziming (talk | contribs) |
Zhongziming (talk | contribs) |
||
Line 23: | Line 23: | ||
$ mpirun -rf rankfile -app appfile_fpm | $ mpirun -rf rankfile -app appfile_fpm | ||
− | + | ** example of a rankfile: | |
rank 0=ig.icl.utk.edu slot=0:0 | rank 0=ig.icl.utk.edu slot=0:0 | ||
rank 1=ig.icl.utk.edu slot=0:1 | rank 1=ig.icl.utk.edu slot=0:1 | ||
− | + | ... | |
− | + | ||
− | + | ** example of a appfile for building functional permanence model (FPM): | |
+ | # GPU | ||
+ | # e.g. Linking against cublas, and fupermod is configured under cublas_config | ||
+ | # suboption g=0 means device 0 is selected for computing | ||
+ | |||
+ | -host localhost -np 1 $HOME/fupermod/cublas_config/tools/builder -l $HOME/fupermod/cublas_config/routines/mxm/.libs/libmxm_col.so -o k=640,g=0 -U10000 -s10 | ||
+ | |||
+ | #--------------------------------------------------------------------------------------------------------------------------------------------------------------- | ||
+ | |||
+ | # CPU | ||
+ | # e.g. Linking against acml, and fupermod is configured under acml_config | ||
+ | |||
+ | -host localhost -np 47 $HOME/fupermod/acml_config/tools/builder -l $HOME/fupermod/acml_config/routines/mxm/.libs/libmxm_col.so -o k=640 -U10000 -s10 |
Revision as of 10:47, 12 July 2012
List of machines
http://icl.cs.utk.edu/iclhelp/custom/index.html?lid=97&slid=180
Display a list of available GPUs
$ nvidia-smi -L
Using Fupermod on hybrid multicore/GPUs node
- Compiling : Create two separate directories for configuration with selected CPU cblas (e.g. gsl, acml, mkl)and GPU cblas (e.g. cublas).
- For example: Using acml blas for CPU and cublas for GPU computing
cd fupermod/
mkdir acml_config cd acml_config ./configure --with-cblas=acml
mkdir cuda_config cd cuda_config ./configure --with-cblas=cuda
- Building performance model:
- rankfile is for processing binding, and appfile tells mpi what programs to launch
$ mpirun -rf rankfile -app appfile_fpm
- example of a rankfile:
rank 0=ig.icl.utk.edu slot=0:0 rank 1=ig.icl.utk.edu slot=0:1 ...
- example of a appfile for building functional permanence model (FPM):
# GPU # e.g. Linking against cublas, and fupermod is configured under cublas_config # suboption g=0 means device 0 is selected for computing
-host localhost -np 1 $HOME/fupermod/cublas_config/tools/builder -l $HOME/fupermod/cublas_config/routines/mxm/.libs/libmxm_col.so -o k=640,g=0 -U10000 -s10
#---------------------------------------------------------------------------------------------------------------------------------------------------------------
# CPU # e.g. Linking against acml, and fupermod is configured under acml_config
-host localhost -np 47 $HOME/fupermod/acml_config/tools/builder -l $HOME/fupermod/acml_config/routines/mxm/.libs/libmxm_col.so -o k=640 -U10000 -s10