_mm256_mask3_fmadd_ps
Classification
AVX-512, Arithmetic, CPUID Test: AVX512F
Header File
Instruction
VFMADD132PS ymm {k}, ymm, ymm
Synopsis
_mm256_mask3_fmadd_ps(__m256 a, __m256 b, __m256 c, __mmask8 k);
Description
Multiply packed single-precision (32-bit) floating-point elements in "a" and "b", add the intermediate result to packed elements in "c", and store the results in "dst" using writemask "k" (elements are copied from "c" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 7
i := j*32
IF k[j]
dst[i+31:i] := (a[i+31:i] * b[i+31:i]) + c[i+31:i]
ELSE
dst[i+31:i] := c[i+31:i]
FI
ENDFOR
dst[MAX:256] := 0