vault backup: 2023-10-04 12:33:31
This commit is contained in:
parent
47bd74f5ee
commit
ce64639d15
2 changed files with 27 additions and 15 deletions
|
@ -61,21 +61,33 @@ Ganhos de 4 vezes mais.
|
|||
conclusion: 20 GFlop/s
|
||||
|
||||
#### b)
|
||||
peak with vectorization: 20 GFlop/s
|
||||
peak without vectorization: 5 GFlop/s
|
||||
memory bandwith limitation: 20 GFlop/s
|
||||
real achievable performance:
|
||||
peak with vectorization: continuous 20 GFlop/s
|
||||
peak without vectorization: continuous 5 GFlop/s
|
||||
memory bandwith limitation: ***see alinea d)***
|
||||
real achievable performance:***see alinea c)***
|
||||
measured performance:
|
||||
|
||||
#### c)
|
||||
#### d)
|
||||
memory bandwith limitation
|
||||
|
||||
| GFlop/s | Flop/Byte |
|
||||
| ------- | --------- |
|
||||
| 0.125 | |
|
||||
| 0.25 | |
|
||||
| 0.125 | 2.5 |
|
||||
| 0.25 | 5 |
|
||||
| 0.5 | 10 |
|
||||
| 1 | 20 |
|
||||
| 2 | |
|
||||
| 4 | |
|
||||
| 8 | |
|
||||
| 2 | 40 |
|
||||
| 4 | 80 |
|
||||
| 8 | 160 |
|
||||
|
||||
#### c)
|
||||
2 FOP (operações vírgula flutuante) -> 2 doubles (16B)
|
||||
1 operation/8B -> 0.125
|
||||
|
||||
| GFlop/s | Flop/Byte |
|
||||
| ------- | --------- |
|
||||
| 0.125 | 2.5 |
|
||||
|
||||
|
||||
#### d)
|
||||
AVX -> 256b -> 4 doubles
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue