Files
kernels/tests/regression/flash_attention/kernel.gemmini.cpp
Hansung Kim fcd8b0b892 flash: Disable rescale flag check
GEMM-II finishes much earlier than softmax for this to be a problem.
2024-11-09 20:37:58 -08:00

31 KiB