_mm256_mask_maddubs_epi16
Classification
AVX-512, Arithmetic, CPUID Test: AVX512BW
Header File
immintrin.h
Instruction
VPMADDUBSW ymm {k}, ymm, ymm
Synopsis
 _mm256_mask_maddubs_epi16(__m256i src, __mmask16 k, __m256i a, __m256i b);
Description
Multiply packed unsigned 8-bit integers in "a" by packed signed 8-bit integers in "b", producing intermediate signed 16-bit integers. Horizontally add adjacent pairs of intermediate signed 16-bit integers, and pack the saturated results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 15
	i := j*16
	IF k[j]
		dst[i+15:i] := Saturate16( a[i+15:i+8]*b[i+15:i+8] + a[i+7:i]*b[i+7:i] )
	ELSE
		dst[i+15:i] := src[i+15:i]
	FI
ENDFOR
dst[MAX:256] := 0