_mm_mask3_fnmsub_round_sh
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
Instruction
VFNMSUB132SH xmm {k}, xmm, xmm {er}
Synopsis
_mm_mask3_fnmsub_round_sh(__m128h a, __m128h b, __m128h c, __mmask8 k, const int rounding);
Description
Multiply the lower half-precision (16-bit) floating-point elements in "a" and "b", and subtract the lower element in "c" from the negated intermediate result. Store the result in the lower element of "dst" using writemask "k" (the element is copied from "c" when mask bit 0 is not set), and copy the upper 7 packed elements from "c" to the upper elements of "dst".
[round_note]
Operation
IF k[0]
dst.fp16[0] := -(a.fp16[0] * b.fp16[0]) - c.fp16[0]
ELSE
dst.fp16[0] := c.fp16[0]
FI
dst[127:16] := c[127:16]
dst[MAX:128] := 0