_mm_mask_cvtne2ps_pbh
Classification
AVX-512, Convert, CPUID Test: AVX512_BF16
Header File
Instruction
VCVTNE2PS2BF16 xmm {k}, xmm, xmm
Synopsis
_mm_mask_cvtne2ps_pbh(__m128bh src, __mmask8 k, __m128 a, __m128 b);
Description
Convert packed single-precision (32-bit) floating-point elements in two vectors "a" and "b" to packed BF16 (16-bit) floating-point elements, and store the results in single vector "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 7
IF k[j]
IF j < 4
t := b.fp32[j]
ELSE
t := a.fp32[j-4]
FI
dst.word[j] := Convert_FP32_To_BF16(t)
ELSE
dst.word[j] := src.word[j]
FI
ENDFOR
dst[MAX:128] := 0