PSRLW - Packed Shift Right Logical Word

PSRLW xmm1, imm8    (S2    the number of bits is specified by imm8.
__m128i _mm_srli_epi16(__m128i m, int count)    the number of bits is specified by count.
PSRLW xmm1, xmm2/m128    (S2    the number of bits is specified by the lower QWORD of xmm2/m128.
__m128i _mm_srl_epi16(__m128i m, __m128i count)    the number of bits is specified by the lower QWORD of count

For each WORD, shift the bits to the right by the specified number of bits. Emptied upper bits are zeroed.
VPSRLW xmm1, xmm2, imm8    (V1    the number of bits is specified by imm8.
__m128i _mm_srli_epi16(__m128i m, int count)    the number of bits is specified by count.
VPSRLW xmm1, xmm2, xmm3/m128    (V1    the number of bits is specified by the lower QWORD of xmm3/m128.
__m128i _mm_srl_epi16(__m128i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
VPSRLW xmm1{k1}{z}, xmm2/m128, imm8    (V5+BW+VL    the number of bits is specified by imm8.
__m128i _mm_mask_srli_epi16(__m128i s, __mmask8 k, __m128i m, unsigned int count)    the number of bits is specified by count.
__m128i _mm_maskz_srli_epi16(__mmask8 k, __m128i m, unsigned int count)    the number of bits is specified by count.
VPSRLW xmm1{k1}{z}, xmm2, xmm3/m128    (V5+BW+VL    the number of bits is specified by the lower QWORD of xmm3/m128.
__m128i _mm_mask_srl_epi16(__m128i s, __mmask8 k, __m128i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
__m128i _mm_maskz_srl_epi16(__mmask8 k, __m128i m, __m128i count)    the number of bits is specified by the lower QWORD of count.

For each WORD, shift the bits to the right by the specified number of bits. Emptied upper bits are zeroed.
VPSRLW ymm1, ymm2, imm8    (V2    the number of bits is specified by imm8.
__m256i _mm256_srli_epi16(__m256i m, int count)    the number of bits is specified by count.
VPSRLW ymm1, ymm2, xmm3/m128    (V2    the number of bits is specified by the lower QWORD of xmm3/m128.
__m256i _mm256_srl_epi16(__m256i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
VPSRLW ymm1{k1}{z}, ymm2/m256, imm8    (V5+BW+VL    the number of bits is specified by imm8.
__m256i _mm256_mask_srli_epi16(__m256i s, __mmask16 k, __m256i m, unsigned int count)    the number of bits is specified by count.
__m256i _mm256_maskz_srli_epi16(__mmask16 k, __m256i m, unsigned int count)    the number of bits is specified by count.
VPSRLW ymm1{k1}{z}, ymm2, xmm3/m128    (V5+BW+VL    the number of bits is specified by the lower QWORD of xmm3/m128.
__m256i _mm256_mask_srl_epi16(__m256i s, __mmask16 k, __m256i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
__m256i _mm256_maskz_srl_epi16(__mmask16 k, __m256i m, __m128i count)    the number of bits is specified by the lower QWORD of count.

For each WORD, shift the bits to the right by the specified number of bits. Emptied upper bits are zeroed.
VPSRLW zmm1{k1}{z}, zmm2/m512, imm8    (V5+BW    the number of bits is specified by imm8.
__m512i _mm512_srli_epi16(__m512i m, unsigned int count)    the number of bits is specified by count.
__m512i _mm512_mask_srli_epi16(__m512i s, __mmask32 k, __m512i m, unsigned int count)    the number of bits is specified by count.
__m512i _mm512_maskz_srli_epi16(__mmask32 k, __m512i m, unsigned int count)    the number of bits is specified by count.
VPSRLW zmm1{k1}{z}, zmm2, xmm3/m128    (V5+BW    the number of bits is specified by the lower QWORD of xmm3/m128.
__m512i _mm512_srl_epi16(__m512i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
__m512i _mm512_mask_srl_epi16(__m512i s, __mmask32 k, __m512i m, __m128i count)    the number of bits is specified by the lower QWORD of count.
__m512i _mm512_maskz_srl_epi16(__mmask32 k, __m512i m, __m128i count)    the number of bits is specified by the lower QWORD of count.

For each WORD, shift the bits to the right by the specified number of bits. Emptied upper bits are zeroed.

x86/x64 SIMD Instruction List  Feedback