_mm512_mask_i64gather_ps
Classification
AVX-512, Load, CPUID Test: AVX512F
Header File
immintrin.h
Instruction
VGATHERQPS ymm {k}, vm64z
Synopsis
 _mm512_mask_i64gather_ps(__m256 src, __mmask8 k, __m512i vindex, void const* base_addr, int scale);
Description
Gather single-precision (32-bit) floating-point elements from memory using 64-bit indices. 32-bit elements are loaded from addresses starting at "base_addr" and offset by each 64-bit element in "vindex" (each index is scaled by the factor in "scale"). Gathered elements are merged into "dst" using writemask "k" (elements are copied from "src" when the corresponding mask bit is not set). "scale" should be 1, 2, 4 or 8.
Operation
FOR j := 0 to 7
	i := j*32
	m := j*64
	IF k[j]
		addr := base_addr + vindex[m+63:m] * ZeroExtend64(scale) * 8
		dst[i+31:i] := MEM[addr+31:addr]
	ELSE
		dst[i+31:i] := src[i+31:i]
	FI
ENDFOR
dst[MAX:256] := 0