From: Markus Wittmann Date: Tue, 22 May 2018 08:24:08 +0000 (+0200) Subject: squased commits from internal repository X-Git-Url: http://git.rrze.uni-erlangen.de/gitweb/?p=LbmBenchmarkKernelsPublic.git;a=commitdiff_plain;h=9e0051cb083e4d8575cbd9f4a41d11552358e151 squased commits from internal repository - doc: skylakesp2 results now with AVX512 intrinsics - add AVX512 support for single precision intrinsics without gather/scatter The AVX512 intrinsics are divided into "pure" AVX512 (load/store/compute) and AVX512-GATHER which include gather/scatter. This enables us to support at least AVX512 single precision intrinsics for all kernels which do not require gather/scatter support. - fix test.sh: reenabled dp tests - fix pull-split-nt: adjusted tmp array size to different vector lenghts --- diff --git a/doc/images/benchmark-skylakesp2-dp.png b/doc/images/benchmark-skylakesp2-dp.png index 974cbcb..5488063 100644 Binary files a/doc/images/benchmark-skylakesp2-dp.png and b/doc/images/benchmark-skylakesp2-dp.png differ diff --git a/doc/images/benchmark-skylakesp2-sp.png b/doc/images/benchmark-skylakesp2-sp.png index 583b053..d369e63 100644 Binary files a/doc/images/benchmark-skylakesp2-sp.png and b/doc/images/benchmark-skylakesp2-sp.png differ diff --git a/doc/main.html b/doc/main.html index dfd45ec..89f4676 100644 --- a/doc/main.html +++ b/doc/main.html @@ -592,7 +592,7 @@ make clean-all OPENMP on, off on -OpenMP, i.,e.. threading support. +OpenMP, i.e. threading support. PRECISION dp, sp @@ -637,6 +637,22 @@ make clean-all +ADJ_LIST_MEM_TYPE +HBM + + +Determines memory location of adjacency list array, DRAM or HBM. + +PDF_MEM_TYPE +HBM + + +Determines memory location of PDF array, DRAM or HBM. + SOFTWARE_PREFETCH_LOOKAHEAD_L1 int >= 0 0 @@ -1153,7 +1169,6 @@ which mimics the kernels memory access pattern and the kernel's loop balance

Skylake, Intel Xeon Gold 6148

-

NOTE: currently we only use AVX2 intrinsics.