Files
kernels/tests/regression/flash_attention/kernel.cpp
Hansung Kim e809d25305 flash: Fix rowsum and write fake exp
GEMM part is disabled for faster debugging, the kernel reads the result
of A*B directly from input binary.
2024-08-15 16:32:21 -07:00

10 KiB