<tr><td>OPENMP</td>
<td>on, off</td>
<td>on</td>
-<td>OpenMP, i.,e.. threading support.</td>
+<td>OpenMP, i.e. threading support.</td>
</tr>
<tr><td>PRECISION</td>
<td>dp, sp</td>
</tr>
</thead>
<tbody valign="top">
+<tr><td>ADJ_LIST_MEM_TYPE</td>
+<td>HBM</td>
+<td><ul class="first last simple">
+<li></li>
+</ul>
+</td>
+<td>Determines memory location of adjacency list array, DRAM or HBM.</td>
+</tr>
+<tr><td>PDF_MEM_TYPE</td>
+<td>HBM</td>
+<td><ul class="first last simple">
+<li></li>
+</ul>
+</td>
+<td>Determines memory location of PDF array, DRAM or HBM.</td>
+</tr>
<tr><td>SOFTWARE_PREFETCH_LOOKAHEAD_L1</td>
<td>int >= 0</td>
<td>0</td>
</li>
</ul>
<p><strong>Skylake, Intel Xeon Gold 6148</strong></p>
-<p>NOTE: currently we only use AVX2 intrinsics.</p>
<ul class="simple">
<li>Skylake server architecture, AVX2, AVX512, 2 FMA units</li>
<li>20 cores, 2.4 GHz</li>
</tr>
<tr><td><img alt="perf_meggie_sp" src="images/benchmark-meggie-sp.png" style="width: 1000.0px; height: 250.0px;" /></td>
</tr>
-<tr><td>Skylake, Intel Xeon Gold 6148, Double Precision, <strong>NOTE: currently we only use AVX2 intrinsics.</strong></td>
+<tr><td>Skylake, Intel Xeon Gold 6148, Double Precision</td>
</tr>
<tr><td><img alt="perf_skylakesp2_dp" src="images/benchmark-skylakesp2-dp.png" style="width: 1000.0px; height: 250.0px;" /></td>
</tr>
-<tr><td>Skylake, Intel Xeon Gold 6148, Single Precision, <strong>NOTE: currently we only use AVX2 intrinsics.</strong></td>
+<tr><td>Skylake, Intel Xeon Gold 6148, Single Precision</td>
</tr>
<tr><td><img alt="perf_skylakesp2_sp" src="images/benchmark-skylakesp2-sp.png" style="width: 1000.0px; height: 250.0px;" /></td>
</tr>
</div>
<div class="section" id="acknowledgements">
<h1><a class="toc-backref" href="#id27">8 Acknowledgements</a></h1>
+<p>If you use the benchmark kernels you can cite us:</p>
+<p>M. Wittmann, V. Haag, T. Zeiser, H. Köstler, and G. Wellein: Lattice Boltzmann
+Benchmark Kernels as a Testbed for Performance Analysis, (2018), Computer &
+Fluids, Special Issue DSFD2017. doi:10.1016/j.compfluid.2018.03.030.</p>
+<p>Bibtex entry:</p>
+<pre class="literal-block">
+@article{wittmann-2018,
+ author = {M. Wittmann and V. Haag and T. Zeiser and H. K\"ostler and G. Wellein},
+ title = {Lattice {B}oltzmann benchmark kernels as a testbed for performance analysis},
+ journal = {Computers \& Fluids},
+ year = {2018},
+ issn = {0045-7930},
+ doi = {10.1016/j.compfluid.2018.03.030},
+}
+</pre>
<p>This work was funded by BMBF, grant no. 01IH15003A (project SKAMPY).</p>
<p>This work was funded by KONWHIR project OMI4PAPS.</p>
</div>
Commun. ACM, 52(4):65-76, Apr 2009. doi:10.1145/1498765.1498785</td></tr>
</tbody>
</table>
-<p>Document was generated at 2018-05-10 14:10.</p>
+<p>Document was generated at 2018-06-06 10:38.</p>
</div>
</div>
</body>