_mm256_mask_rsqrt14_ps
Classification
AVX-512, Arithmetic, CPUID Test: AVX512F
Header File
Instruction
VRSQRT14PS ymm {k}, ymm
Synopsis
_mm256_mask_rsqrt14_ps(__m256 src, __mmask8 k, __m256 a);
Description
Compute the approximate reciprocal square root of packed single-precision (32-bit) floating-point elements in "a", and store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set). The maximum relative error for this approximation is less than 2^-14.
Operation
FOR j := 0 to 7
i := j*32
IF k[j]
dst[i+31:i] := (1.0 / SQRT(a[i+31:i]))
ELSE
dst[i+31:i] := src[i+31:i]
FI
ENDFOR
dst[MAX:256] := 0