_mm512_mask_loadunpackhi_ps
Classification
KNC, Load, CPUID Test: KNCNI
Header File
Instruction
VLOADUNPACKHPS zmm {k}, m512
Synopsis
_mm512_mask_loadunpackhi_ps(__m512 src, __mmask16 k, void const* mt);
Description
Loads the high-64-byte-aligned portion of the doubleword stream starting at element-aligned address mt-64 and expands them into packed single-precision (32-bit) floating-point elements in "dst". The initial values of "dst" are copied from "src". Only those converted quadwords that occur at or after the first 64-byte-aligned address following (mt-64) are loaded. Elements in the resulting vector that do not map to those quadwords are taken from "src". Elements are loaded from memory according to element selector "k" (elements are skipped when the corresponding mask bit is not set).
Operation
dst[511:0] := src[511:0]
loadOffset := 0
foundNext64BytesBoundary := false
addr := mt-64
FOR j := 0 to 15
IF k[j]
IF foundNext64BytesBoundary == false
IF (addr + (loadOffset + 1)*4 % 64) == 0
foundNext64BytesBoundary := true
FI
ELSE
i := j*32
tmp := MEM[addr + loadOffset*4]
dst[i+31:i] := tmp[i+31:i]
FI
loadOffset := loadOffset + 1
FI
ENDFOR
dst[MAX:512] := 0