_mm_mask_dpwssd_epi32
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_VNNI
Header File
immintrin.h
Instruction
VPDPWSSD xmm {k}, xmm, xmm
Synopsis
 _mm_mask_dpwssd_epi32(__m128i src, __mmask8 k, __m128i a, __m128i b);
Description
Multiply groups of 2 adjacent pairs of signed 16-bit integers in "a" with corresponding 16-bit integers in "b", producing 2 intermediate signed 32-bit results. Sum these 2 results with the corresponding 32-bit integer in "src", and store the packed 32-bit results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 3
	IF k[j]
		tmp1.dword := SignExtend32(a.word[2*j]) * SignExtend32(b.word[2*j])
		tmp2.dword := SignExtend32(a.word[2*j+1]) * SignExtend32(b.word[2*j+1])
		dst.dword[j] := src.dword[j] + tmp1 + tmp2
	ELSE
		dst.dword[j] := src.dword[j]
	FI
ENDFOR
dst[MAX:128] := 0