_mm_maskz_rcp28_round_ss
Classification
AVX-512, Elementary Math Functions, CPUID Test: AVX512ER
Header File
Instruction
VRCP28SS xmm {z}, xmm, xmm {sae}
Synopsis
_mm_maskz_rcp28_round_ss(__mmask8 k, __m128 a, __m128 b, int sae);
Description
Compute the approximate reciprocal of the lower single-precision (32-bit) floating-point element in "b", store the result in the lower element of "dst" using zeromask "k" (the element is zeroed out when mask bit 0 is not set), and copy the upper 3 packed elements from "a" to the upper elements of "dst". The maximum relative error for this approximation is less than 2^-28. [sae_note]
Operation
IF k[0]
dst[31:0] := (1.0 / b[31:0])
ELSE
dst[31:0] := 0
FI
dst[127:32] := a[127:32]
dst[MAX:128] := 0