_mm256_hadd_ps
Classification
AVX_ALL, Arithmetic, CPUID Test: AVX
Header File
Instruction
VHADDPS ymm, ymm, ymm
Synopsis
_mm256_hadd_ps(__m256 a, __m256 b);
Description
Horizontally add adjacent pairs of single-precision (32-bit) floating-point elements in "a" and "b", and pack the results in "dst".
Operation
dst[31:0] := a[63:32] + a[31:0]
dst[63:32] := a[127:96] + a[95:64]
dst[95:64] := b[63:32] + b[31:0]
dst[127:96] := b[127:96] + b[95:64]
dst[159:128] := a[191:160] + a[159:128]
dst[191:160] := a[255:224] + a[223:192]
dst[223:192] := b[191:160] + b[159:128]
dst[255:224] := b[255:224] + b[223:192]
dst[MAX:256] := 0