Files
kernels/tests/regression/flash_attention
Hansung Kim 4448f31fdc fence: Fix moving fence to start of loop
For unknown reasons, guarding the fence with a tid == 0 branch causes a
TL source ID re-used assertion.  Just call the fence from all
thread/warps as a workaround.  At least, all threads in a warp will
coalesce into one request.
2024-11-09 22:04:45 -08:00
..
2024-08-15 21:04:59 -07:00
2024-08-14 20:46:09 -07:00
2024-11-09 19:58:45 -08:00