_mm_mask3_fmsub_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
Instruction
VFMSUB132PH xmm {k}, xmm, xmm
Synopsis
_mm_mask3_fmsub_ph(__m128h a, __m128h b, __m128h c, __mmask8 k);
Description
Multiply packed half-precision (16-bit) floating-point elements in "a" and "b", subtract packed elements in "c" from the intermediate result, and store the results in "dst" using writemask "k" (elements are copied from "c" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 7
IF k[j]
dst.fp16[j] := (a.fp16[j] * b.fp16[j]) - c.fp16[j]
ELSE
dst.fp16[j] := c.fp16[j]
FI
ENDFOR
dst[MAX:128] := 0