_mm_mask_expandloadu_ps
Classification
Header File
Instruction
VEXPANDPS xmm {k}, m128
Synopsis
_mm_mask_expandloadu_ps(__m128 src, __mmask8 k, void const* mem_addr);
Description
Load contiguous active single-precision (32-bit) floating-point elements from unaligned memory at "mem_addr" (those with their respective bit set in mask "k"), and store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
m := 0
FOR j := 0 to 3
i := j*32
IF k[j]
dst[i+31:i] := MEM[mem_addr+m+31:mem_addr+m]
m := m + 32
ELSE
dst[i+31:i] := src[i+31:i]
FI
ENDFOR
dst[MAX:128] := 0