_mm256_mask_expandloadu_ps
Classification
Header File
Instruction
VEXPANDPS ymm {k}, m256
Synopsis
_mm256_mask_expandloadu_ps(__m256 src, __mmask8 k, void const* mem_addr);
Description
Load contiguous active single-precision (32-bit) floating-point elements from unaligned memory at "mem_addr" (those with their respective bit set in mask "k"), and store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
m := 0
FOR j := 0 to 7
i := j*32
IF k[j]
dst[i+31:i] := MEM[mem_addr+m+31:mem_addr+m]
m := m + 32
ELSE
dst[i+31:i] := src[i+31:i]
FI
ENDFOR
dst[MAX:256] := 0