_mm256_mask_fmsubadd_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
immintrin.h
Instruction
VFMSUBADD132PH ymm {k}, ymm, ymm
Synopsis
 _mm256_mask_fmsubadd_ph(__m256h a, __mmask16 k, __m256h b, __m256h c);
Description
Multiply packed half-precision (16-bit) floating-point elements in "a" and "b", alternatively subtract and add packed elements in "c" to/from the intermediate result, and store the results in "dst" using writemask "k" (elements are copied from "a" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 15
	IF k[j]
		IF ((j & 1) == 0)
			dst.fp16[j] := (a.fp16[j] * b.fp16[j]) + c.fp16[j]
		ELSE
			dst.fp16[j] := (a.fp16[j] * b.fp16[j]) - c.fp16[j]
		FI
	ELSE
		dst.fp16[j] := a.fp16[j]
	FI
ENDFOR
dst[MAX:256] := 0