_mm512_mask_rsqrt28_round_ps
Classification
AVX-512, Elementary Math Functions, CPUID Test: AVX512ER
Header File
Instruction
VRSQRT28PS zmm {k}, zmm {sae}
Synopsis
_mm512_mask_rsqrt28_round_ps(__m512 src, __mmask16 k, __m512 a, int sae);
Description
Compute the approximate reciprocal square root of packed single-precision (32-bit) floating-point elements in "a", store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set). The maximum relative error for this approximation is less than 2^-28. [sae_note]
Operation
FOR j := 0 to 15
i := j*32
IF k[j]
dst[i+31:i] := (1.0 / SQRT(a[i+31:i]))
ELSE
dst[i+31:i] := src[i+31:i]
FI
ENDFOR