_mm_fmsub_round_sh
Classification
AVX-512, Arithmetic, CPUID Test: AVX512_FP16
Header File
immintrin.h
Instruction
VFMSUB132SH xmm, xmm, xmm {er}
Synopsis
 _mm_fmsub_round_sh(__m128h a, __m128h b, __m128h c, const int rounding);
Description
Multiply the lower half-precision (16-bit) floating-point elements in "a" and "b", and subtract the lower element in "c" from the intermediate result. Store the result in the lower element of "dst", and copy the upper 7 packed elements from "a" to the upper elements of "dst". [round_note]
Operation
dst.fp16[0] := (a.fp16[0] * b.fp16[0]) - c.fp16[0]
dst[127:16] := a[127:16]
dst[MAX:128] := 0