GPU Indexing Schemes and Launch Configurations
Files
9- Frederik Hennig authored
docs/source/backend/gpu_codegen.md
0 → 100644
+ 92
− 0
Closes #119 (closed)
This MR contributes
Add codegen.functions.Lambda
, a simple wrapper around a backend expression tree that can be exported
to the user and evaluated by the runtime system
Linear3D
and Blockwise4D
available through the config (they correspond to the pystencils 1.3.x block
and line
indexing options)Linear3D
GpuThreadsRange
; the thread range is no longer being computed by the CUDA and SYCL platformscodegen.gpu_indexing.GpuLaunchConfiguration
,
which is evaluated by the JIT in the context of the kernel launch.GpuIndexing
class which acts as a factory for launch configurations and thread indexing objects depending on the user-provided configuration