summary |
shortlog | log |
commit |
commitdiff |
tree
first ⋅ prev ⋅ next
Markus Wittmann [Tue, 15 May 2018 11:57:54 +0000 (13:57 +0200)]
merge with kernels from MH's master thesis
TODOs:
- Likwid marker in gather kernels should use the X_LIKWID interface.
- Old kernels should set loop balance in KernelData structure.
- AVX512 intrinsics for single precision must be implemented.
- This affects all kernels using intrinsics
- NOTE: gather kernels need special attention for gather intrincis.
- Extend documentation by new kernels.
Markus Wittmann [Wed, 10 Jan 2018 13:25:18 +0000 (14:25 +0100)]
add single precision, add aa-vec-sl-soa kernel, updated doc
- Binaries have now a -dp or -sp suffix, depending on whether they have been
compiled for double or single precision.
- New kernel for full array aa-vec-sl-soa added. Only one loop over the lattice
used.
- Documentation has been updated, including how to build single precision
binaries and performance graphs on various architectures.
Markus Wittmann [Tue, 21 Nov 2017 15:13:06 +0000 (16:13 +0100)]
update README
Markus Wittmann [Tue, 21 Nov 2017 14:46:25 +0000 (15:46 +0100)]
update README and doc
Markus Wittmann [Thu, 2 Nov 2017 14:54:11 +0000 (15:54 +0100)]
bulk commit
- add AA pattern full array kernels
- add padding for list kernels
- transposed loops
Markus Wittmann [Thu, 26 Oct 2017 07:43:56 +0000 (09:43 +0200)]
add HTML documentation
Markus Wittmann [Wed, 6 Sep 2017 07:51:34 +0000 (09:51 +0200)]
add acknowledgements in doc and README
Markus Wittmann [Mon, 10 Jul 2017 10:03:37 +0000 (12:03 +0200)]
fix test.sh
test.sh depended on the module infrastructure of RRZE. To run tests now
a valid config must be specified.
Markus Wittmann [Mon, 10 Jul 2017 09:38:45 +0000 (11:38 +0200)]
version 0.1
Markus Wittmann [Mon, 10 Jul 2017 09:19:40 +0000 (11:19 +0200)]
initial commit
This page took 0.038761 seconds and 5 git commands to generate.