vault backup: 2023-10-04 12:13:31
This commit is contained in:
parent
e829078584
commit
6f6741a669
3 changed files with 19 additions and 10 deletions
|
@ -9,7 +9,7 @@ B -> consecutive elements in a collumn
|
|||
Não vai ser vetorizável.
|
||||
|
||||
|
||||
#### b)
|
||||
#### b) Enable vectorization
|
||||
result of change cycles to i, k , j :
|
||||
A -> same element
|
||||
C -> consecutive elements in a row -> consecutive access in the vector
|
||||
|
@ -29,8 +29,8 @@ Without vectorization:
|
|||
|
||||
With vectorization:
|
||||
![[Pasted image 20231004115135.png]]
|
||||
|
||||
#### c)
|
||||
Estimated: ( n^3 / 2 )* 8
|
||||
#### c)Measure and analyze results
|
||||
|
||||
| N | Version | Time | CPI | \#I |
|
||||
| --- | -------- | ---- | --- | --- |
|
||||
|
@ -43,3 +43,12 @@ With vectorization:
|
|||
>srun --partition=cpar perf stat -e cycles,instructions ./a.out
|
||||
|
||||
|
||||
#### d) Vectorization fine-tuning
|
||||
Ganhos de 4 vezes mais.
|
||||
|
||||
|
||||
|
||||
|
||||
## Ex. 3
|
||||
#### a)
|
||||
2 operações em FP
|
Loading…
Add table
Add a link
Reference in a new issue