_mm256_dpwssds_epi32
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_VNNI
Header File
Instruction
VPDPWSSDS ymm, ymm, ymm
Synopsis
_mm256_dpwssds_epi32(__m256i src, __m256i a, __m256i b);
Description
Multiply groups of 2 adjacent pairs of signed 16-bit integers in "a" with corresponding 16-bit integers in "b", producing 2 intermediate signed 32-bit results. Sum these 2 results with the corresponding 32-bit integer in "src" using signed saturation, and store the packed 32-bit results in "dst".
Operation
FOR j := 0 to 7
tmp1.dword := SignExtend32(a.word[2*j]) * SignExtend32(b.word[2*j])
tmp2.dword := SignExtend32(a.word[2*j+1]) * SignExtend32(b.word[2*j+1])
dst.dword[j] := Saturate32(src.dword[j] + tmp1 + tmp2)
ENDFOR
dst[MAX:256] := 0