Port tier1 to GPU. #30
Labels
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: TOPIO/BigWhoop#30
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Tier 1 is currently the most time-consuming step and can be ported to the GPU so that individual code block de/encoding can be parallelized. This follows the current threading hooks with OpenMP.
GPU kernels for encode_codeblock(s) and decode_codeblock(s) would be established as
__global__
kernels triggered from the CPU replacing the loops in tier1_encode and tier1_decode.