Noticed on #96882 - SSE/AVX code tends to result in a lot of bitcasts as the `__m128i / __m256i / __m512i` types are always treated as `vXi64`. CC @davemgreen