Difference between revisions of "HCL Cluster Specifications"
From HCL
					
										
					
					|  (→Cluster Specifications) | |||
| (9 intermediate revisions by 3 users not shown) | |||
| Line 1: | Line 1: | ||
| + | ==Cluster Specifications== | ||
| + | |||
| {|  border="1" cellspacing="1" cellpadding="5"   | {|  border="1" cellspacing="1" cellpadding="5"   | ||
| | Name | | Name | ||
| Line 48: | Line 50: | ||
| | 1 – 2 | | 1 – 2 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl01 (NIC1) | + | |  ALIGN=LEFT | [[Hcl01]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.3 | |  ALIGN=LEFT | 192.168.21.3 | ||
| Line 72: | Line 74: | ||
| | 3 | | 3 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl02 (NIC1) | + | |  ALIGN=LEFT | [[Hcl02]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.4 | |  ALIGN=LEFT | 192.168.21.4 | ||
| Line 78: | Line 80: | ||
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 1MB | |  ALIGN=LEFT | 1MB | ||
| − | |  ALIGN=LEFT | 1GB | + | |  ALIGN=LEFT | 1GB (4x256) | 
| |  ALIGN=LEFT | 80GB SATA | |  ALIGN=LEFT | 80GB SATA | ||
| |  ALIGN=LEFT | 250GB SATA | |  ALIGN=LEFT | 250GB SATA | ||
| Line 96: | Line 98: | ||
| | 4 | | 4 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl03 (NIC1) | + | |  ALIGN=LEFT | [[Hcl03]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.5 | |  ALIGN=LEFT | 192.168.21.5 | ||
| Line 102: | Line 104: | ||
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 1MB | |  ALIGN=LEFT | 1MB | ||
| − | |  ALIGN=LEFT | 1GB | + | |  ALIGN=LEFT | 1GB (2x512) | 
| |  ALIGN=LEFT | 80GB SATA | |  ALIGN=LEFT | 80GB SATA | ||
| |  ALIGN=LEFT | N/A | |  ALIGN=LEFT | N/A | ||
| Line 120: | Line 122: | ||
| | 5 | | 5 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl04 (NIC1) | + | |  ALIGN=LEFT | [[Hcl04]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.6 | |  ALIGN=LEFT | 192.168.21.6 | ||
| Line 144: | Line 146: | ||
| | 6 | | 6 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl05 (NIC1) | + | |  ALIGN=LEFT | [[Hcl05]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge SC1425 | |  ALIGN=LEFT | Dell Poweredge SC1425 | ||
| |  ALIGN=LEFT | 192.168.21.7 | |  ALIGN=LEFT | 192.168.21.7 | ||
| Line 168: | Line 170: | ||
| | 7 | | 7 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl06 (NIC1) | + | |  ALIGN=LEFT | [[Hcl06]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge SC1425 | |  ALIGN=LEFT | Dell Poweredge SC1425 | ||
| |  ALIGN=LEFT | 192.168.21.8 | |  ALIGN=LEFT | 192.168.21.8 | ||
| − | |  ALIGN=LEFT | 3. | + | |  ALIGN=LEFT | 3.0 Xeon | 
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 2MB | |  ALIGN=LEFT | 2MB | ||
| Line 192: | Line 194: | ||
| | 8 | | 8 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl07 (NIC1) | + | |  ALIGN=LEFT | [[Hcl07]](NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.9 | |  ALIGN=LEFT | 192.168.21.9 | ||
| Line 198: | Line 200: | ||
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 1MB | |  ALIGN=LEFT | 1MB | ||
| − | |  ALIGN=LEFT | 256MB | + | |  ALIGN=LEFT | 256MB (1x256) | 
| |  ALIGN=LEFT | 80GB SATA | |  ALIGN=LEFT | 80GB SATA | ||
| |  ALIGN=LEFT | N/A | |  ALIGN=LEFT | N/A | ||
| Line 216: | Line 218: | ||
| | 9 | | 9 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl08 (NIC1) | + | |  ALIGN=LEFT | [[Hcl08]] (NIC1) | 
| |  ALIGN=LEFT | Dell Poweredge 750 | |  ALIGN=LEFT | Dell Poweredge 750 | ||
| |  ALIGN=LEFT | 192.168.21.10 | |  ALIGN=LEFT | 192.168.21.10 | ||
| Line 222: | Line 224: | ||
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 1MB | |  ALIGN=LEFT | 1MB | ||
| − | |  ALIGN=LEFT | 256MB | + | |  ALIGN=LEFT | 256MB (1x256) | 
| |  ALIGN=LEFT | 80GB SATA | |  ALIGN=LEFT | 80GB SATA | ||
| |  ALIGN=LEFT | N/A | |  ALIGN=LEFT | N/A | ||
| Line 240: | Line 242: | ||
| | 10 | | 10 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl09 (NIC1) | + | |  ALIGN=LEFT | [[Hcl09]] (NIC1) | 
| |  ALIGN=LEFT | IBM E-server 326 | |  ALIGN=LEFT | IBM E-server 326 | ||
| |  ALIGN=LEFT | 192.168.21.11 | |  ALIGN=LEFT | 192.168.21.11 | ||
| Line 264: | Line 266: | ||
| | 11 | | 11 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl10 (NIC1) | + | |  ALIGN=LEFT | [[Hcl10]] (NIC1) | 
| |  ALIGN=LEFT | IBM E-server 326 | |  ALIGN=LEFT | IBM E-server 326 | ||
| |  ALIGN=LEFT | 192.168.21.12 | |  ALIGN=LEFT | 192.168.21.12 | ||
| Line 288: | Line 290: | ||
| | 12 | | 12 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl11 (NIC1) | + | |  ALIGN=LEFT | [[Hcl11]] (NIC1) | 
| |  ALIGN=LEFT | IBM X-Series 306 | |  ALIGN=LEFT | IBM X-Series 306 | ||
| |  ALIGN=LEFT | 192.168.21.13 | |  ALIGN=LEFT | 192.168.21.13 | ||
| Line 312: | Line 314: | ||
| | 13 | | 13 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl12 (NIC1) | + | |  ALIGN=LEFT | [[Hcl12]] (NIC1) | 
| |  ALIGN=LEFT | HP Proliant DL 320 G3 | |  ALIGN=LEFT | HP Proliant DL 320 G3 | ||
| |  ALIGN=LEFT | 192.168.21.14 | |  ALIGN=LEFT | 192.168.21.14 | ||
| Line 336: | Line 338: | ||
| | 14 | | 14 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl13 (NIC1) | + | |  ALIGN=LEFT | [[Hcl13]] (NIC1) | 
| |  ALIGN=LEFT | HP Proliant DL 320 G3 | |  ALIGN=LEFT | HP Proliant DL 320 G3 | ||
| |  ALIGN=LEFT | 192.168.21.15 | |  ALIGN=LEFT | 192.168.21.15 | ||
| Line 360: | Line 362: | ||
| | 15 | | 15 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl14 (NIC1) | + | |  ALIGN=LEFT | [[Hcl14]] (NIC1) | 
| |  ALIGN=LEFT | HP Proliant DL 140 G2 | |  ALIGN=LEFT | HP Proliant DL 140 G2 | ||
| |  ALIGN=LEFT | 192.168.21.16 | |  ALIGN=LEFT | 192.168.21.16 | ||
| Line 384: | Line 386: | ||
| | 16 | | 16 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl15 (NIC1) | + | |  ALIGN=LEFT | [[Hcl15]] (NIC1) | 
| |  ALIGN=LEFT | HP Proliant DL 140 G2 | |  ALIGN=LEFT | HP Proliant DL 140 G2 | ||
| |  ALIGN=LEFT | 192.168.21.17 | |  ALIGN=LEFT | 192.168.21.17 | ||
| Line 408: | Line 410: | ||
| | 17 | | 17 | ||
| |-   | |-   | ||
| − | |  ALIGN=LEFT | Hcl16 (NIC1) | + | |  ALIGN=LEFT | [[Hcl16]] (NIC1) | 
| |  ALIGN=LEFT | HP Proliant DL 140 G2 | |  ALIGN=LEFT | HP Proliant DL 140 G2 | ||
| |  ALIGN=LEFT | 192.168.21.18 | |  ALIGN=LEFT | 192.168.21.18 | ||
| Line 414: | Line 416: | ||
| |  ALIGN=LEFT | 800MHz | |  ALIGN=LEFT | 800MHz | ||
| |  ALIGN=LEFT | 2MB | |  ALIGN=LEFT | 2MB | ||
| − | |  ALIGN=LEFT |  | + | |  ALIGN=LEFT | 384MB | 
| |  ALIGN=LEFT | 80GB SATA | |  ALIGN=LEFT | 80GB SATA | ||
| |  ALIGN=LEFT | N/A | |  ALIGN=LEFT | N/A | ||
| Line 432: | Line 434: | ||
| | 18 | | 18 | ||
| |} | |} | ||
| + | |||
| + | ==Cluster Benchmarks== | ||
| + | ===Stream=== | ||
| + | ====Cluster Performance==== | ||
| + |  ---------------------------------------------- | ||
| + |  Double precision appears to have 16 digits of accuracy | ||
| + |  Assuming 8 bytes per DOUBLE PRECISION word | ||
| + |  ---------------------------------------------- | ||
| + |  Number of processors =           18 | ||
| + |  Array size =    2000000 | ||
| + |  Offset     =          0 | ||
| + |  The total memory requirement is     824.0 MB (     45.8MB/task) | ||
| + |  You are running each test  10 times | ||
| + |  -- | ||
| + |  The *best* time for each test is used | ||
| + |  *EXCLUDING* the first and last iterations | ||
| + |  ---------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond | ||
| + |  Your clock granularity/precision appears to be      1 microseconds | ||
| + |  ---------------------------------------------------- | ||
| + |  Function     Rate (MB/s)  Avg time   Min time  Max time | ||
| + |  Copy:      24589.1750      0.0235      0.0234      0.0237 | ||
| + |  Scale:     24493.9786      0.0237      0.0235      0.0245 | ||
| + |  Add:       27594.1797      0.0314      0.0313      0.0315 | ||
| + |  Triad:     27695.7938      0.0313      0.0312      0.0315 | ||
| + |  ----------------------------------------------- | ||
| + |  Solution Validates! | ||
| + |  ----------------------------------------------- | ||
| + | |||
| + | ====Individual Node Performance==== | ||
| + | [[Image:stream_bench_results.png]] | ||
| + | |||
| + | hcl01.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 8386 microseconds. | ||
| + |     (= 8386 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2615.4617       0.0125       0.0122       0.0132 | ||
| + |  Scale:       2609.2783       0.0125       0.0123       0.0133 | ||
| + |  Add:         3046.4707       0.0161       0.0158       0.0168 | ||
| + |  Triad:       3064.7322       0.0160       0.0157       0.0166 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl02.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 7790 microseconds. | ||
| + |     (= 7790 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2740.6840       0.0117       0.0117       0.0120 | ||
| + |  Scale:       2745.3930       0.0117       0.0117       0.0117 | ||
| + |  Add:         3063.1599       0.0157       0.0157       0.0157 | ||
| + |  Triad:       3075.3572       0.0156       0.0156       0.0159 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl03.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 8382 microseconds. | ||
| + |     (= 8382 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2788.6728       0.0115       0.0115       0.0115 | ||
| + |  Scale:       2722.0144       0.0118       0.0118       0.0121 | ||
| + |  Add:         3266.6166       0.0148       0.0147       0.0150 | ||
| + |  Triad:       3287.4249       0.0146       0.0146       0.0147 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl04.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 8378 microseconds. | ||
| + |     (= 8378 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2815.4071       0.0114       0.0114       0.0114 | ||
| + |  Scale:       2751.5270       0.0116       0.0116       0.0117 | ||
| + |  Add:         3260.1979       0.0148       0.0147       0.0150 | ||
| + |  Triad:       3274.2183       0.0147       0.0147       0.0150 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl05.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 14916 microseconds. | ||
| + |     (= 14916 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1581.8068       0.0203       0.0202       0.0204 | ||
| + |  Scale:       1557.1081       0.0207       0.0206       0.0215 | ||
| + |  Add:         1807.7015       0.0266       0.0266       0.0266 | ||
| + |  Triad:       1832.9646       0.0263       0.0262       0.0265 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl06.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 14824 microseconds. | ||
| + |     (= 14824 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1558.3835       0.0206       0.0205       0.0206 | ||
| + |  Scale:       1550.3951       0.0207       0.0206       0.0209 | ||
| + |  Add:         1863.4239       0.0258       0.0258       0.0259 | ||
| + |  Triad:       1885.3916       0.0255       0.0255       0.0259 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl07.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 13642 microseconds. | ||
| + |     (= 13642 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1759.4014       0.0183       0.0182       0.0186 | ||
| + |  Scale:       1740.2731       0.0184       0.0184       0.0185 | ||
| + |  Add:         2036.4962       0.0236       0.0236       0.0238 | ||
| + |  Triad:       2045.5920       0.0235       0.0235       0.0235 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl08.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 13476 microseconds. | ||
| + |     (= 13476 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1809.6454       0.0177       0.0177       0.0177 | ||
| + |  Scale:       1784.8271       0.0179       0.0179       0.0179 | ||
| + |  Add:         2085.3320       0.0231       0.0230       0.0232 | ||
| + |  Triad:       2095.7094       0.0231       0.0229       0.0239 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl09.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 2 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity/precision appears to be 2 microseconds. | ||
| + |  Each test below will take on the order of 7890 microseconds. | ||
| + |     (= 3945 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2838.6204       0.0114       0.0113       0.0115 | ||
| + |  Scale:       2774.6396       0.0116       0.0115       0.0117 | ||
| + |  Add:         3144.0396       0.0155       0.0153       0.0156 | ||
| + |  Triad:       3160.7905       0.0153       0.0152       0.0156 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl10.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 2 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity/precision appears to be 2 microseconds. | ||
| + |  Each test below will take on the order of 7959 microseconds. | ||
| + |     (= 3979 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2864.2732       0.0113       0.0112       0.0114 | ||
| + |  Scale:       2784.2952       0.0116       0.0115       0.0118 | ||
| + |  Add:         3162.2553       0.0153       0.0152       0.0155 | ||
| + |  Triad:       3223.8430       0.0151       0.0149       0.0153 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl11.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity/precision appears to be 1 microseconds. | ||
| + |  Each test below will take on the order of 13082 microseconds. | ||
| + |     (= 13082 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1842.8984       0.0174       0.0174       0.0174 | ||
| + |  Scale:       1818.1816       0.0177       0.0176       0.0179 | ||
| + |  Add:         2109.8918       0.0228       0.0227       0.0231 | ||
| + |  Triad:       2119.0161       0.0227       0.0227       0.0227 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl12.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 12621 microseconds. | ||
| + |     (= 12621 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1805.0657       0.0178       0.0177       0.0179 | ||
| + |  Scale:       1795.5385       0.0179       0.0178       0.0182 | ||
| + |  Add:         2089.9636       0.0230       0.0230       0.0230 | ||
| + |  Triad:       2097.5313       0.0229       0.0229       0.0231 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl13.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 12203 microseconds. | ||
| + |     (= 12203 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        1768.9387       0.0181       0.0181       0.0182 | ||
| + |  Scale:       1765.9100       0.0182       0.0181       0.0184 | ||
| + |  Add:         2000.5875       0.0241       0.0240       0.0244 | ||
| + |  Triad:       2001.0897       0.0240       0.0240       0.0241 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl14.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 9576 microseconds. | ||
| + |     (= 9576 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2254.3285       0.0142       0.0142       0.0142 | ||
| + |  Scale:       2291.6040       0.0140       0.0140       0.0140 | ||
| + |  Add:         2779.5354       0.0173       0.0173       0.0173 | ||
| + |  Triad:       2804.8951       0.0177       0.0171       0.0215 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl15.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 9788 microseconds. | ||
| + |     (= 9788 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2223.0043       0.0146       0.0144       0.0161 | ||
| + |  Scale:       2264.6703       0.0142       0.0141       0.0142 | ||
| + |  Add:         2740.5171       0.0176       0.0175       0.0178 | ||
| + |  Triad:       2762.2584       0.0174       0.0174       0.0174 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
| + | hcl16.heterogeneous.ucd.ie | ||
| + |  ------------------------------------------------------------- | ||
| + |  STREAM version $Revision: 5.9 $ | ||
| + |  ------------------------------------------------------------- | ||
| + |  This system uses 8 bytes per DOUBLE PRECISION word. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Array size = 2000000, Offset = 0 | ||
| + |  Total memory required = 45.8 MB. | ||
| + |  Each test is run 10 times, but only | ||
| + |  the *best* time for each is used. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Number of Threads requested = 1 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Printing one line per active thread.... | ||
| + |  ------------------------------------------------------------- | ||
| + |  Your clock granularity appears to be less than one microsecond. | ||
| + |  Each test below will take on the order of 9590 microseconds. | ||
| + |     (= 9590 clock ticks) | ||
| + |  Increase the size of the arrays if this shows that | ||
| + |  you are not getting at least 20 clock ticks per test. | ||
| + |  ------------------------------------------------------------- | ||
| + |  WARNING -- The above is only a rough guideline. | ||
| + |  For best results, please be sure you know the | ||
| + |  precision of your system timer. | ||
| + |  ------------------------------------------------------------- | ||
| + |  Function      Rate (MB/s)   Avg time     Min time     Max time | ||
| + |  Copy:        2285.3882       0.0140       0.0140       0.0143 | ||
| + |  Scale:       2294.5687       0.0140       0.0139       0.0140 | ||
| + |  Add:         2800.9568       0.0172       0.0171       0.0172 | ||
| + |  Triad:       2823.2048       0.0170       0.0170       0.0171 | ||
| + |  ------------------------------------------------------------- | ||
| + |  Solution Validates | ||
| + |  ------------------------------------------------------------- | ||
Latest revision as of 17:38, 13 June 2011
Contents
Cluster Specifications
| Name | Make/Model | IP | Processor | Front Side Bus | L2 Cache | RAM | HDD 1 | HDD 2 | NIC | Rack | 
| hclswitch1 | Cisco Catalyst 3560G | 192.168.21.252 | N/A | N/A | N/A | N/A | N/A | N/A | 24 x Gigabit | 42 | 
| hclswitch2 | Cisco Catalyst 3560G | 192.168.21.253 | N/A | N/A | N/A | N/A | N/A | N/A | 24 x Gigabit | 41 | 
| N/A | APC Smart UPS 1500 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | 1 – 2 | 
| Hcl01 (NIC1) | Dell Poweredge 750 | 192.168.21.3 | 3.4 Xeon | 800MHz | 1MB | 1GB | 80GB SATA | 250GB SATA | 2 x Gigabit | 3 | 
| Hcl01 (NIC2) |  | 192.168.21.103 |  |  |  |  |  |  |  | 3 | 
| Hcl02 (NIC1) | Dell Poweredge 750 | 192.168.21.4 | 3.4 Xeon | 800MHz | 1MB | 1GB (4x256) | 80GB SATA | 250GB SATA | 2 x Gigabit | 4 | 
| Hcl02 (NIC2) |  | 192.168.21.104 |  |  |  |  |  |  |  | 4 | 
| Hcl03 (NIC1) | Dell Poweredge 750 | 192.168.21.5 | 3.4 Xeon | 800MHz | 1MB | 1GB (2x512) | 80GB SATA | N/A | 2 x Gigabit | 5 | 
| Hcl03 (NIC2) |  | 192.168.21.105 |  |  |  |  |  |  |  | 5 | 
| Hcl04 (NIC1) | Dell Poweredge 750 | 192.168.21.6 | 3.4 Xeon | 800MHz | 1MB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 6 | 
| Hcl04 (NIC2) |  | 192.168.21.106 |  |  |  |  |  |  |  | 6 | 
| Hcl05 (NIC1) | Dell Poweredge SC1425 | 192.168.21.7 | 3.6 Xeon | 800MHz | 2MB | 256MB | 80GB SATA | N/A | 2 x Gigabit | 7 | 
| Hcl05 (NIC2) |  | 192.168.21.107 |  |  |  |  |  |  |  | 7 | 
| Hcl06 (NIC1) | Dell Poweredge SC1425 | 192.168.21.8 | 3.0 Xeon | 800MHz | 2MB | 256MB | 80GB SATA | N/A | 2 x Gigabit | 8 | 
| Hcl06 (NIC2) |  | 192.168.21.108 |  |  |  |  |  |  |  | 8 | 
| Hcl07(NIC1) | Dell Poweredge 750 | 192.168.21.9 | 3.4 Xeon | 800MHz | 1MB | 256MB (1x256) | 80GB SATA | N/A | 2 x Gigabit | 9 | 
| Hcl07 (NIC2) |  | 192.168.21.109 |  |  |  |  |  |  |  | 9 | 
| Hcl08 (NIC1) | Dell Poweredge 750 | 192.168.21.10 | 3.4 Xeon | 800MHz | 1MB | 256MB (1x256) | 80GB SATA | N/A | 2 x Gigabit | 10 | 
| Hcl08 (NIC2) |  | 192.168.21.110 |  |  |  |  |  |  |  | 10 | 
| Hcl09 (NIC1) | IBM E-server 326 | 192.168.21.11 | 1.8 AMD Opteron | 1GHz | 1MB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 11 | 
| Hcl09 (NIC2) |  | 192.168.21.111 |  |  |  |  |  |  |  | 11 | 
| Hcl10 (NIC1) | IBM E-server 326 | 192.168.21.12 | 1.8 AMD Opteron | 1GHz | 1MB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 12 | 
| Hcl10 (NIC2) |  | 192.168.21.112 |  |  |  |  |  |  |  | 12 | 
| Hcl11 (NIC1) | IBM X-Series 306 | 192.168.21.13 | 3.2 P4 | 800MHz | 1MB | 512MB | 80GB SATA | N/A | 2 x Gigabit | 13 | 
| Hcl11 (NIC2) |  | 192.168.21.113 |  |  |  |  |  |  |  | 13 | 
| Hcl12 (NIC1) | HP Proliant DL 320 G3 | 192.168.21.14 | 3.4 P4 | 800MHz | 1MB | 512MB | 80GB SATA | N/A | 2 x Gigabit | 14 | 
| Hcl12 (NIC2) |  | 192.168.21.114 |  |  |  |  |  |  |  | 14 | 
| Hcl13 (NIC1) | HP Proliant DL 320 G3 | 192.168.21.15 | 2.9 Celeron | 533MHz | 256KB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 15 | 
| Hcl13 (NIC2) |  | 192.168.21.115 |  |  |  |  |  |  |  | 15 | 
| Hcl14 (NIC1) | HP Proliant DL 140 G2 | 192.168.21.16 | 3.4 Xeon | 800MHz | 1MB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 16 | 
| Hcl14 (NIC2) |  | 192.168.21.116 |  |  |  |  |  |  |  | 16 | 
| Hcl15 (NIC1) | HP Proliant DL 140 G2 | 192.168.21.17 | 2.8 Xeon | 800MHz | 1MB | 1GB | 80GB SATA | N/A | 2 x Gigabit | 17 | 
| Hcl15 (NIC2) |  | 192.168.21.117 |  |  |  |  |  |  |  | 17 | 
| Hcl16 (NIC1) | HP Proliant DL 140 G2 | 192.168.21.18 | 3.6 Xeon | 800MHz | 2MB | 384MB | 80GB SATA | N/A | 2 x Gigabit | 18 | 
| Hcl16 (NIC2) |  | 192.168.21.118 |  |  |  |  |  |  |  | 18 | 
Cluster Benchmarks
Stream
Cluster Performance
---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Number of processors = 18 Array size = 2000000 Offset = 0 The total memory requirement is 824.0 MB ( 45.8MB/task) You are running each test 10 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 24589.1750 0.0235 0.0234 0.0237 Scale: 24493.9786 0.0237 0.0235 0.0245 Add: 27594.1797 0.0314 0.0313 0.0315 Triad: 27695.7938 0.0313 0.0312 0.0315 ----------------------------------------------- Solution Validates! -----------------------------------------------
Individual Node Performance
hcl01.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 8386 microseconds. (= 8386 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2615.4617 0.0125 0.0122 0.0132 Scale: 2609.2783 0.0125 0.0123 0.0133 Add: 3046.4707 0.0161 0.0158 0.0168 Triad: 3064.7322 0.0160 0.0157 0.0166 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl02.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 7790 microseconds. (= 7790 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2740.6840 0.0117 0.0117 0.0120 Scale: 2745.3930 0.0117 0.0117 0.0117 Add: 3063.1599 0.0157 0.0157 0.0157 Triad: 3075.3572 0.0156 0.0156 0.0159 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl03.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 8382 microseconds. (= 8382 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2788.6728 0.0115 0.0115 0.0115 Scale: 2722.0144 0.0118 0.0118 0.0121 Add: 3266.6166 0.0148 0.0147 0.0150 Triad: 3287.4249 0.0146 0.0146 0.0147 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl04.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 8378 microseconds. (= 8378 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2815.4071 0.0114 0.0114 0.0114 Scale: 2751.5270 0.0116 0.0116 0.0117 Add: 3260.1979 0.0148 0.0147 0.0150 Triad: 3274.2183 0.0147 0.0147 0.0150 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl05.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 14916 microseconds. (= 14916 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1581.8068 0.0203 0.0202 0.0204 Scale: 1557.1081 0.0207 0.0206 0.0215 Add: 1807.7015 0.0266 0.0266 0.0266 Triad: 1832.9646 0.0263 0.0262 0.0265 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl06.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 14824 microseconds. (= 14824 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1558.3835 0.0206 0.0205 0.0206 Scale: 1550.3951 0.0207 0.0206 0.0209 Add: 1863.4239 0.0258 0.0258 0.0259 Triad: 1885.3916 0.0255 0.0255 0.0259 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl07.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 13642 microseconds. (= 13642 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1759.4014 0.0183 0.0182 0.0186 Scale: 1740.2731 0.0184 0.0184 0.0185 Add: 2036.4962 0.0236 0.0236 0.0238 Triad: 2045.5920 0.0235 0.0235 0.0235 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl08.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 13476 microseconds. (= 13476 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1809.6454 0.0177 0.0177 0.0177 Scale: 1784.8271 0.0179 0.0179 0.0179 Add: 2085.3320 0.0231 0.0230 0.0232 Triad: 2095.7094 0.0231 0.0229 0.0239 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl09.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 2 ------------------------------------------------------------- Printing one line per active thread.... Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity/precision appears to be 2 microseconds. Each test below will take on the order of 7890 microseconds. (= 3945 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2838.6204 0.0114 0.0113 0.0115 Scale: 2774.6396 0.0116 0.0115 0.0117 Add: 3144.0396 0.0155 0.0153 0.0156 Triad: 3160.7905 0.0153 0.0152 0.0156 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl10.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 2 ------------------------------------------------------------- Printing one line per active thread.... Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity/precision appears to be 2 microseconds. Each test below will take on the order of 7959 microseconds. (= 3979 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2864.2732 0.0113 0.0112 0.0114 Scale: 2784.2952 0.0116 0.0115 0.0118 Add: 3162.2553 0.0153 0.0152 0.0155 Triad: 3223.8430 0.0151 0.0149 0.0153 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl11.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity/precision appears to be 1 microseconds. Each test below will take on the order of 13082 microseconds. (= 13082 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1842.8984 0.0174 0.0174 0.0174 Scale: 1818.1816 0.0177 0.0176 0.0179 Add: 2109.8918 0.0228 0.0227 0.0231 Triad: 2119.0161 0.0227 0.0227 0.0227 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl12.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 12621 microseconds. (= 12621 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1805.0657 0.0178 0.0177 0.0179 Scale: 1795.5385 0.0179 0.0178 0.0182 Add: 2089.9636 0.0230 0.0230 0.0230 Triad: 2097.5313 0.0229 0.0229 0.0231 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl13.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 12203 microseconds. (= 12203 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 1768.9387 0.0181 0.0181 0.0182 Scale: 1765.9100 0.0182 0.0181 0.0184 Add: 2000.5875 0.0241 0.0240 0.0244 Triad: 2001.0897 0.0240 0.0240 0.0241 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl14.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 9576 microseconds. (= 9576 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2254.3285 0.0142 0.0142 0.0142 Scale: 2291.6040 0.0140 0.0140 0.0140 Add: 2779.5354 0.0173 0.0173 0.0173 Triad: 2804.8951 0.0177 0.0171 0.0215 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl15.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 9788 microseconds. (= 9788 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2223.0043 0.0146 0.0144 0.0161 Scale: 2264.6703 0.0142 0.0141 0.0142 Add: 2740.5171 0.0176 0.0175 0.0178 Triad: 2762.2584 0.0174 0.0174 0.0174 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------
hcl16.heterogeneous.ucd.ie
------------------------------------------------------------- STREAM version $Revision: 5.9 $ ------------------------------------------------------------- This system uses 8 bytes per DOUBLE PRECISION word. ------------------------------------------------------------- Array size = 2000000, Offset = 0 Total memory required = 45.8 MB. Each test is run 10 times, but only the *best* time for each is used. ------------------------------------------------------------- Number of Threads requested = 1 ------------------------------------------------------------- Printing one line per active thread.... ------------------------------------------------------------- Your clock granularity appears to be less than one microsecond. Each test below will take on the order of 9590 microseconds. (= 9590 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 2285.3882 0.0140 0.0140 0.0143 Scale: 2294.5687 0.0140 0.0139 0.0140 Add: 2800.9568 0.0172 0.0171 0.0172 Triad: 2823.2048 0.0170 0.0170 0.0171 ------------------------------------------------------------- Solution Validates -------------------------------------------------------------

