_mm256_mask_expandloadu_pd
Classification
AVX-512, Load, CPUID Test: AVX512F
Header File
immintrin.h
Instruction
VEXPANDPD ymm {k}, m256
Synopsis
 _mm256_mask_expandloadu_pd(__m256d src, __mmask8 k, void const* mem_addr);
Description
Load contiguous active double-precision (64-bit) floating-point elements from unaligned memory at "mem_addr" (those with their respective bit set in mask "k"), and store the results in "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set).
Operation
m := 0
FOR j := 0 to 3
	i := j*64
	IF k[j]
		dst[i+63:i] := MEM[mem_addr+m+63:mem_addr+m]
		m := m + 64
	ELSE
		dst[i+63:i] := src[i+63:i]
	FI
ENDFOR
dst[MAX:256] := 0