_mm_mask_rcp_sh
Classification
AVX-512, Elementary Math Functions, CPUID Test: AVX512_FP16
Header File
Instruction
VRCPSH xmm {k}, xmm, xmm
Synopsis
_mm_mask_rcp_sh(__m128h src, __mmask8 k, __m128h a, __m128h b);
Description
Compute the approximate reciprocal of the lower half-precision (16-bit) floating-point element in "a", store the result in the lower element of "dst" using writemask "k" (the element is copied from "src" when mask bit 0 is not set), and copy the upper 7 packed elements from "a" to the upper elements of "dst". The maximum relative error for this approximation is less than 1.5*2^-12.
Operation
IF k[0]
dst.fp16[0] := (1.0 / b.fp16[0])
ELSE
dst.fp16[0] := src.fp16[0]
FI
dst[127:16] := a[127:16]
dst[MAX:128] := 0