Commit Graph

12 Commits

Author SHA1 Message Date
Hansung Kim
95e3e96c6c tensor: Change B in-memory layout to column-major 2024-08-12 15:22:07 -07:00
Hansung Kim
5d5a6fbad2 sgemm_tcore: Template-ize kernel code 2024-07-29 20:11:51 -07:00
Hansung Kim
5f342914bd sgemm_tcore: Support fp16 input generation in host code 2024-07-29 17:18:35 -07:00
Hansung Kim
ca7fd84a83 sgemm_tcore: Split util functions to a header file 2024-06-11 19:06:22 -07:00
Hansung Kim
856596cbb3 sgemm_tcore: Write reference C before sim 2024-06-07 10:39:11 -07:00
Hansung Kim
d8944db369 sgemm_tcore: Double-buffer over K-dimension
TODO: Not completely parameterized with DOUBLE_BUFFER yet.
2024-06-04 18:23:27 -07:00
Hansung Kim
bc7bd1a1dd sgemm_tcore: Write reference C matrix to file 2024-05-25 22:47:15 -07:00
Hansung Kim
b892c22f00 sgemm_tcore: Reflect WMITER/WNITER in threadblock size 2024-05-16 23:31:52 -07:00
Hansung Kim
8f64fae7a7 sgemm_tcore: Addr gen for local_k; add SIMT-only for reference 2024-05-16 14:11:09 -07:00
Hansung Kim
5de8e7c33a sgemm_tg: Fix device address to use ELF operands 2024-05-13 23:09:57 -07:00
Hansung Kim
7775830814 Hardcode chipyard device addresses 2024-05-07 16:30:30 -07:00
joshua
5bd25985c6 i kinda forgot most of changes 2024-05-04 23:01:47 -07:00