Search found 2 matches
- 2023-12-25, 7:35:06
- Forum: Agner's CPU blog
- Topic: Is using BSF instruction instead of using GNU C __builtin_ctz inefficient?
- Replies: 1
- Views: 29272
Is using BSF instruction instead of using GNU C __builtin_ctz inefficient?
I posted this question on stackoverflow: https://stackoverflow.com/questions/77707238/looking-for-an-efficient-function-to-find-an-index-of-max-element-in-simd-vector. The answer claims, among others, that using "legacy BSF instruction (slow on AMD), instead of using GNU C __builtin_ctz on non-MSVC"...
- 2023-12-23, 6:33:45
- Forum: Agner's CPU blog
- Topic: Efficiency of array<Vec32uc, 8> vs. ContainerV<Vec32uc, 8>
- Replies: 1
- Views: 21203
Efficiency of array<Vec32uc, 8> vs. ContainerV<Vec32uc, 8>
Is there an obvious performance penalty in using array<Vec32uc, 8> instead of ContainerV<Vec32uc, 8>? One reason for this choice is https://godbolt.org/ not having vector_containers.h available.