diff options
| author | Eric Biggers <ebiggers@google.com> | 2025-04-04 21:09:30 -0700 |
|---|---|---|
| committer | Herbert Xu <herbert@gondor.apana.org.au> | 2025-04-07 13:22:28 +0800 |
| commit | 570ef50a15d7caa4d63d66f1e28d967065733f84 (patch) | |
| tree | 75eb0d5b88ba399d7945ac7144d2de96ecd2642d /tools/lib/python | |
| parent | bc23fe6dc172b09778ce3d75a9157decd153f4ef (diff) | |
crypto: x86/aes-xts - optimize _compute_first_set_of_tweaks for AVX-512
Optimize the AVX-512 version of _compute_first_set_of_tweaks by using
vectorized shifts to compute the first vector of tweak blocks, and by
using byte-aligned shifts when multiplying by x^8.
AES-XTS performance on AMD Ryzen 9 9950X (Zen 5) improves by about 2%
for 4096-byte messages or 6% for 512-byte messages. AES-XTS performance
on Intel Sapphire Rapids improves by about 1% for 4096-byte messages or
3% for 512-byte messages. Code size decreases by 75 bytes which
outweighs the increase in rodata size of 16 bytes.
Signed-off-by: Eric Biggers <ebiggers@google.com>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
Diffstat (limited to 'tools/lib/python')
0 files changed, 0 insertions, 0 deletions
