_mm256_mask_madd_epi16
Classification
AVX-512, Arithmetic, CPUID Test: AVX512BW
Header File
Instruction
VPMADDWD ymm {k}, ymm, ymm
Synopsis
_mm256_mask_madd_epi16(__m256i src, __mmask8 k, __m256i a, __m256i b);
Description
Multiply packed signed 16-bit integers in "a" and "b", producing intermediate signed 32-bit integers. Horizontally add adjacent pairs of intermediate 32-bit integers, and pack the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
FOR j := 0 to 7
i := j*32
IF k[j]
dst[i+31:i] := SignExtend32(a[i+31:i+16]*b[i+31:i+16]) + SignExtend32(a[i+15:i]*b[i+15:i])
ELSE
dst[i+31:i] := src[i+31:i]
FI
ENDFOR
dst[MAX:256] := 0