SVE vectorization
To continue my vectorization spree, here is the version with ARM SVE instructions. Tested in QEMU 5.2. Compiler support is still a bit wonky -- GCC 10 generates very bloated assembly and produces incorrect code for non-native vector sizes, while Clang 11 misses some obvious optimizations.
Edited by Michael Kuron
Merge request reports
Activity
Filter activity
mentioned in commit b1522533
mentioned in merge request !234 (merged)
mentioned in merge request walberla/walberla!448 (merged)
Please register or sign in to reply