llvm.bitreverse for 2/4-bit scalars is lowered to: lshl, shr, and, or operations on 2/4-bit types. E2E testing is done in: https://github.com/intel/llvm/pull/13359 and all tests are passing except on GPU (GEN12, DG2, PVC fail)