_mm_mask_fmaddsub_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
immintrin.h
Instruction
VFMADDSUB132PH xmm {k}, xmm, xmm
Synopsis
 _mm_mask_fmaddsub_ph(__m128h a, __mmask8 k, __m128h b, __m128h c);
Description
Multiply packed half-precision (16-bit) floating-point elements in "a" and "b", alternatively add and subtract packed elements in "c" to/from the intermediate result, and store the results in "dst" using writemask "k" (elements are copied from "a" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 7
	IF k[j]
		IF ((j & 1) == 0)
			dst.fp16[j] := (a.fp16[j] * b.fp16[j]) - c.fp16[j]
		ELSE
			dst.fp16[j] := (a.fp16[j] * b.fp16[j]) + c.fp16[j]
		FI
	ELSE
		dst.fp16[j] := a.fp16[j]
	FI
ENDFOR
dst[MAX:128] := 0