_mm256_mask3_fnmadd_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
Instruction
VFNMADD132PH ymm {k}, ymm, ymm
Synopsis
_mm256_mask3_fnmadd_ph(__m256h a, __m256h b, __m256h c, __mmask16 k);
Description
Multiply packed half-precision (16-bit) floating-point elements in "a" and "b", add the negated intermediate result to packed elements in "c", and store the results in "dst" using writemask "k" (elements are copied from "c" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 15
IF k[j]
dst.fp16[j] := -(a.fp16[j] * b.fp16[j]) + c.fp16[j]
ELSE
dst.fp16[j] := c.fp16[j]
FI
ENDFOR
dst[MAX:256] := 0