_mm256_mask_cvtne2ps_pbh
Classification
AVX-512, Convert, CPUID Test: AVX512_BF16
Header File
Instruction
VCVTNE2PS2BF16 ymm {k}, ymm, ymm
Synopsis
_mm256_mask_cvtne2ps_pbh(__m256bh src, __mmask16 k, __m256 a, __m256 b);
Description
Convert packed single-precision (32-bit) floating-point elements in two vectors "a" and "b" to packed BF16 (16-bit) floating-point elements, and store the results in single vector "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 15
IF k[j]
IF j < 8
t := b.fp32[j]
ELSE
t := a.fp32[j-8]
FI
dst.word[j] := Convert_FP32_To_BF16(t)
ELSE
dst.word[j] := src.word[j]
FI
ENDFOR
dst[MAX:256] := 0