_mm256_fmaddsub_ps
Classification
AVX_ALL, Arithmetic, CPUID Test: FMA
Header File
Instruction
VFMADDSUB132PS ymm, ymm, ymm
Synopsis
_mm256_fmaddsub_ps(__m256 a, __m256 b, __m256 c);
Description
Multiply packed single-precision (32-bit) floating-point elements in "a" and "b", alternatively add and subtract packed elements in "c" to/from the intermediate result, and store the results in "dst".
Operation
FOR j := 0 to 7
i := j*32
IF ((j & 1) == 0)
dst[i+31:i] := (a[i+31:i] * b[i+31:i]) - c[i+31:i]
ELSE
dst[i+31:i] := (a[i+31:i] * b[i+31:i]) + c[i+31:i]
FI
ENDFOR
dst[MAX:256] := 0