Difference between revisions of "Grid5000"

From HCL
Jump to: navigation, search
Line 4: Line 4:
 
* Select sites and clusters for experiments, using information on the [https://www.grid5000.fr/mediawiki/index.php/Grid5000:Network#Grid.275000_Sites Grid5000 network]
 
* Select sites and clusters for experiments, using information on the [https://www.grid5000.fr/mediawiki/index.php/Grid5000:Network#Grid.275000_Sites Grid5000 network]
 
* Access is provided via access nodes: '''access.SITE.grid5000.fr'''. As soon as you are on one of the sites, you can directly ssh frontend node of any other site: '''frontend.SITE2'''.
 
* Access is provided via access nodes: '''access.SITE.grid5000.fr'''. As soon as you are on one of the sites, you can directly ssh frontend node of any other site: '''frontend.SITE2'''.
* There is no access to Internet from computing nodes (external IPs should be registered on proxy), therefore, download/update your stuff at the access nodes. Some revision control clients are available.
+
* There is no access to Internet from computing nodes (external IPs should be registered on proxy), therefore, download/update your stuff at the access nodes. Several revision control clients are available.
 
* Each site has a separate NFS, therefore, to run an application on several sites at once, you need to copy it '''scp, sftp, rsync''' between access or frontend nodes.
 
* Each site has a separate NFS, therefore, to run an application on several sites at once, you need to copy it '''scp, sftp, rsync''' between access or frontend nodes.
 
* Jobs are run from the frondend nodes, using a [http://en.wikipedia.org/wiki/OpenPBS PBS]]-like system [https://www.grid5000.fr/mediawiki/index.php/Cluster_experiment-OAR2 OAR]. Basic commands:
 
* Jobs are run from the frondend nodes, using a [http://en.wikipedia.org/wiki/OpenPBS PBS]]-like system [https://www.grid5000.fr/mediawiki/index.php/Cluster_experiment-OAR2 OAR]. Basic commands:

Revision as of 17:53, 21 June 2010

https://www.grid5000.fr/mediawiki/index.php/Grid5000:Home

Usage

  • Select sites and clusters for experiments, using information on the Grid5000 network
  • Access is provided via access nodes: access.SITE.grid5000.fr. As soon as you are on one of the sites, you can directly ssh frontend node of any other site: frontend.SITE2.
  • There is no access to Internet from computing nodes (external IPs should be registered on proxy), therefore, download/update your stuff at the access nodes. Several revision control clients are available.
  • Each site has a separate NFS, therefore, to run an application on several sites at once, you need to copy it scp, sftp, rsync between access or frontend nodes.
  • Jobs are run from the frondend nodes, using a PBS]-like system OAR. Basic commands:
    • oarstat - queue status
    • oarsub - job submission
    • oardel - job removal

Interactive job on deployed images:

$ oarsub -I -t deploy -l [/cluster=N/]nodes=N,walltime=HH[:MM[:SS]] [-p 'PROPERTY="VALUE"']

Batch job on installed images:

$ oarsub BATCH_FILE -t allow_classic_ssh -l [/cluster=N/]nodes=N,walltime=HH[:MM[:SS]] [-p 'PROPERTY="VALUE"']
  • The image to deploy can be created with help of a Systemimager-like system Kadeploy. A Linux distribution lenny-x64-nfs-2.1 with mc, subversion, autotools, doxygen, MPICH2, GSL, Boost, R, gnuplot, graphviz, X11, evince is available at Orsay /home/nancy/alastovetsky/grid5000.
  • Running MPI applications is described here