_mm256_maskz_fmaddsub_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
Instruction
VFMADDSUB132PH ymm {z}, ymm, ymm
Synopsis
_mm256_maskz_fmaddsub_ph(__mmask16 k, __m256h a, __m256h b, __m256h c);
Description
Multiply packed half-precision (16-bit) floating-point elements in "a" and "b", alternatively add and subtract packed elements in "c" to/from the intermediate result, and store the results in "dst" using zeromask "k" (elements are zeroed out when the corresponding mask bit is not set).
Operation
FOR j := 0 to 15
IF k[j]
IF ((j & 1) == 0)
dst.fp16[j] := (a.fp16[j] * b.fp16[j]) - c.fp16[j]
ELSE
dst.fp16[j] := (a.fp16[j] * b.fp16[j]) + c.fp16[j]
FI
ELSE
dst.fp16[j] := 0
FI
ENDFOR
dst[MAX:256] := 0