Hansung Kim
|
95e3e96c6c
|
tensor: Change B in-memory layout to column-major
|
2024-08-12 15:22:07 -07:00 |
|
Hansung Kim
|
5d5a6fbad2
|
sgemm_tcore: Template-ize kernel code
|
2024-07-29 20:11:51 -07:00 |
|
Hansung Kim
|
5f342914bd
|
sgemm_tcore: Support fp16 input generation in host code
|
2024-07-29 17:18:35 -07:00 |
|
Hansung Kim
|
ca7fd84a83
|
sgemm_tcore: Split util functions to a header file
|
2024-06-11 19:06:22 -07:00 |
|
Hansung Kim
|
856596cbb3
|
sgemm_tcore: Write reference C before sim
|
2024-06-07 10:39:11 -07:00 |
|
Hansung Kim
|
d8944db369
|
sgemm_tcore: Double-buffer over K-dimension
TODO: Not completely parameterized with DOUBLE_BUFFER yet.
|
2024-06-04 18:23:27 -07:00 |
|
Hansung Kim
|
bc7bd1a1dd
|
sgemm_tcore: Write reference C matrix to file
|
2024-05-25 22:47:15 -07:00 |
|
Hansung Kim
|
b892c22f00
|
sgemm_tcore: Reflect WMITER/WNITER in threadblock size
|
2024-05-16 23:31:52 -07:00 |
|
Hansung Kim
|
8f64fae7a7
|
sgemm_tcore: Addr gen for local_k; add SIMT-only for reference
|
2024-05-16 14:11:09 -07:00 |
|
Hansung Kim
|
5de8e7c33a
|
sgemm_tg: Fix device address to use ELF operands
|
2024-05-13 23:09:57 -07:00 |
|
Hansung Kim
|
7775830814
|
Hardcode chipyard device addresses
|
2024-05-07 16:30:30 -07:00 |
|
joshua
|
5bd25985c6
|
i kinda forgot most of changes
|
2024-05-04 23:01:47 -07:00 |
|