_mm256_maskz_add_ph
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
Instruction
VADDPH ymm {z}, ymm, ymm
Synopsis
_mm256_maskz_add_ph(__mmask16 k, __m256h a, __m256h b);
Description
Add packed half-precision (16-bit) floating-point elements in "a" and "b", and store the results in "dst" using zeromask "k" (elements are zeroed out when the corresponding mask bit is not set).
Operation
FOR j := 0 TO 15
IF k[j]
dst.fp16[j] := a.fp16[j] + b.fp16[j]
ELSE
dst.fp16[j] := 0
FI
ENDFOR
dst[MAX:256] := 0