Grid5000
From HCL
https://www.grid5000.fr/mediawiki/index.php/Grid5000:Home
Usage
- Select sites and clusters for experiments, using information on the Grid5000 network
- Access is provided via access nodes: access.SITE.grid5000.fr. As soon as you are on one of the sites, you can directly ssh frontend node of any other site: frontend.SITE2.
- There is no access to Internet from computing nodes (external IPs should be registered on proxy), therefore, download/update your stuff at the access nodes. Several revision control clients are available.
- Each site has a separate NFS, therefore, to run an application on several sites at once, you need to copy it scp, sftp, rsync between access or frontend nodes.
- Jobs are run from the frondend nodes, using a PBS]-like system OAR. Basic commands:
- oarstat - queue status
- oarsub - job submission
- oardel - job removal
Interactive job on deployed images:
$ oarsub -I -t deploy -l [/cluster=N/]nodes=N,walltime=HH[:MM[:SS]] [-p 'PROPERTY="VALUE"']
Batch job on installed images:
$ oarsub BATCH_FILE -t allow_classic_ssh -l [/cluster=N/]nodes=N,walltime=HH[:MM[:SS]] [-p 'PROPERTY="VALUE"']
- The image to deploy can be created with help of a Systemimager-like system Kadeploy. Loading an image:
$ kadeploy3 -e PATH_TO_IMAGE_DESC -f $OAR_FILE_NODES
A Linux distribution lenny-x64-nfs-2.1 with mc, subversion, autotools, doxygen, MPICH2, GSL, Boost, R, gnuplot, graphviz, X11, evince is available at Orsay /home/nancy/alastovetsky/grid5000.
- Running MPI applications is described here