diff options
author | Jussi Kivilinna <jussi.kivilinna@iki.fi> | 2013-06-08 09:00:59 (GMT) |
---|---|---|
committer | Herbert Xu <herbert@gondor.apana.org.au> | 2013-06-21 06:44:23 (GMT) |
commit | acfffdb803b6ab5d520bf9a816bfb155ba3a263d (patch) | |
tree | 571852276bf31e0e273c2d8a223998e06ef5df3f /arch/xtensa | |
parent | 046174d766ad40148ecf3564755055fd91ed2569 (diff) | |
download | linux-acfffdb803b6ab5d520bf9a816bfb155ba3a263d.tar.xz |
crypto: camellia-aesni-avx2 - tune assembly code for more performance
Add implementation tuned for more performance on real hardware. Changes are
mostly around the part mixing 128-bit extract and insert instructions and
AES-NI instructions. Also 'vpbroadcastb' instructions have been change to
'vpshufb with zero mask'.
Tests on Intel Core i5-4570:
tcrypt ECB results, old-AVX2 vs new-AVX2:
size 128bit key 256bit key
enc dec enc dec
256 1.00x 1.00x 1.00x 1.00x
1k 1.08x 1.09x 1.05x 1.06x
8k 1.06x 1.06x 1.06x 1.06x
tcrypt ECB results, AVX vs new-AVX2:
size 128bit key 256bit key
enc dec enc dec
256 1.00x 1.00x 1.00x 1.00x
1k 1.51x 1.50x 1.52x 1.50x
8k 1.47x 1.48x 1.48x 1.48x
Signed-off-by: Jussi Kivilinna <jussi.kivilinna@iki.fi>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Diffstat (limited to 'arch/xtensa')
0 files changed, 0 insertions, 0 deletions