Commit ed2add73 authored by Ondrej Tichacek's avatar Ondrej Tichacek
Browse files

Update file README.md

parent 75d03b39
Loading
Loading
Loading
Loading
+9 −36
Original line number Diff line number Diff line
@@ -72,6 +72,15 @@ srun gmx_mpi mdrun -deffnm ${NAME} -nsteps -1 -v -maxh 1

## Tests 1 (SCALING over network)

Tests 1.1 - 1.4 test performance scaling over multple nodes over the low-latency network. The following results were obtained for the current cluster (2 x Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz per node).

| Cores (Nodes)   |  36 (1) |  72 (2) | 144 (4) | 216 (6) | 288 (8) | 360 (10) | 432 (12) | 504 (14) | 576 (16)| 648 (18)| 720 (20)|
| ------ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| ns/day |  13 |  25 |  48 |  71 |  90 | 104 | 130 | 152 | 167 | 175 | 188 |

** Minimal required performance** for passing the test:
For each job, the results for the same number of nodes with the number of cores per node restricted to 36 should be at least higher than 80% of the value obtained with our older aurum cluster, in the table above.

**The following description/conditions apply to tests 1.1 through 1.4**
- **Short description of the test**: Full (36 cores max) occupation of the computational nodes using 1 or more nodes per job. There might be few unallocated nodes if the number of computational nodes is not a multiple of number of computational nodes per job in each task.
- **Storage for tasks:** This test must try all 5 different NFS shares in storage1, storage2, scratch1,  scratch2, cryo2.
@@ -89,16 +98,7 @@ srun gmx_mpi mdrun -deffnm ${NAME} -nsteps -1 -v -maxh 1
- **Number of computational nodes per job in each task:**  $N^{\text{scaling\_test\_cpu}}_{\text{task,job}}$ = [1, 2, 4, 6, 8, 10, 12, 14, 16, 18 and 20] Nodes.
- **Total duration test:** $\sim 1 \text{ hour per task} \times N^{\text{scaling\_test\_cpu}}_{\text{task,job}} \times 5 \text{ storages}$
- **Number of simultaneous jobs in each task:** $J_{cpu}^{max}$
- **Current performance in aurum for each job:**

2 x Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz per node

| Cores (Nodes)   |  36 (1) |  72 (2) | 144 (4) | 216 (6) | 288 (8) | 360 (10) | 432 (12) | 504 (14) | 576 (16)| 648 (18)| 720 (20)|
| ------ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| ns/day |  13 |  25 |  48 |  71 |  90 | 104 | 130 | 152 | 167 | 175 | 188 |

- **Minimal performance for each job:**
The results for the same number of nodes with the number of cores per node restricted to 36 should be at least higher than 80% of the value obtained with our older aurum cluster.

### 1.2 Tests scaling in the bigmem nodes
- **Test name:**: scaling_test_mem
@@ -108,16 +108,7 @@ The results for the same number of nodes with the number of cores per node restr
- **Number of computational nodes per job in each task:**  $N^{\text{scaling\_test\_men}}_{\text{task,job}}$ = [1, 2] Nodes.
- **Total duration test:** $\sim 1 \text{ hour per task} \times N^{\text{scaling\_test\_men}}_{\text{task,job}} \times 5 \text{ storages}$
- **Number of simultaneous jobs in each task:** $J_{mem}^{max}$
- **Current performance in aurum for each job:**

2 x Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz per node

| Cores (Nodes)   |  36 (1) |  72 (2) |
| ------ | --- | --- |
| ns/day |  13 |  25 |

- **Minimal performance for each job:**
The results for the same number of nodes with the number of cores per node restricted to 36 should be at least higher than 80% of the value obtained with our older aurum cluster.


## 1.3 Tests scaling in the gpu nodes
@@ -129,16 +120,7 @@ The results for the same number of nodes with the number of cores per node restr
- **Number of cores per node:** Only 36 cores can be used for each node in this test 
- **Total duration test:** $\sim 1 \text{ hour per task} \times N^{\text{scaling\_test\_gpu}}_{\text{task,job}} \times 5 \text{ storages}$
- **Number of simultaneous jobs in each task:** $J_{gpu}^{max}$
- **Current performance in aurum for each job:**

2 x Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz per node

| Cores (Nodes)   |  36 (1) |  72 (2) | 144 (4) | 216 (6) | 288 (8) | 360 (10) | 432 (12) | 504 (14) | 576 (16)| 648 (18)| 720 (20)|
| ------ | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| ns/day |  13 |  25 |  48 |  71 |  90 | 104 | 130 | 152 | 167 | 175 | 188 |

- **Minimal performance for each job:**
The results for the same number of nodes with the number of cores per node restricted to 36 should be at least higher than 80% of the value obtained with our older aurum cluster.



@@ -150,16 +132,7 @@ The results for the same number of nodes with the number of cores per node restr
- **Number of computational nodes per job in each task:**  $N^{\text{scaling\_test\_biggpu}}_{\text{task,job}}$ = [1, 2, 4, 6, 8] Nodes.
- **Total duration test:** $\sim 1 \text{ hour per task} \times N^{\text{scaling\_test\_biggpu}}_{\text{task,job}} \times 5 \text{ storages}$
- **Number of simultaneous jobs in each task:** $J_{biggpu}^{max}$
- **Current performance in aurum for each job:**

2 x Intel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz per node

| Cores (Nodes)   |  36 (1) |  72 (2) | 144 (4) | 216 (6) | 288 (8) |
| ------          | ---     | ---     | ---     | ---     | ---     |
| ns/day          |  13     |  25     |      48 |      71 |      90 |

- **Minimal performance for each job:**
The results for the same number of nodes with the number of cores per node restricted to 36 should be at least higher than 80% of the value obtained with our older aurum cluster.


## 2. Test endurance cluster - not internode calculations