_mm_mask_expandloadu_ps
Classification
AVX-512, Load, CPUID Test: AVX512F
Header File
immintrin.h
Instruction
VEXPANDPS xmm {k}, m128
Synopsis
 _mm_mask_expandloadu_ps(__m128 src, __mmask8 k, void const* mem_addr);
Description
Load contiguous active single-precision (32-bit) floating-point elements from unaligned memory at "mem_addr" (those with their respective bit set in mask "k"), and store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
m := 0
FOR j := 0 to 3
	i := j*32
	IF k[j]
		dst[i+31:i] := MEM[mem_addr+m+31:mem_addr+m]
		m := m + 32
	ELSE
		dst[i+31:i] := src[i+31:i]
	FI
ENDFOR
dst[MAX:128] := 0