Commit Graph

2690 Commits

Author SHA1 Message Date
Richard Yan
8d71815809 Merge branch 'ae' into ae-hopper 2025-01-30 23:40:48 -08:00
Richard Yan
f8c51669c1 fix toolchain env sh 2025-01-30 21:17:12 -08:00
Richard Yan
63f476eb83 Merge branch 'ae' into ae-hopper 2025-01-30 15:34:58 -08:00
Richard Yan
17a9d31be5 fix dma invocation 2025-01-30 15:33:58 -08:00
Hansung Kim
0711f5f7a3 Merge branch 'ae' into ae-hopper 2025-01-30 13:24:50 -08:00
Hansung Kim
238b942133 Add missing library remake 2025-01-30 13:24:23 -08:00
Hansung Kim
97227577b5 Merge branch 'ae' into ae-hopper 2025-01-30 01:48:09 -08:00
Hansung Kim
2c1ac4e938 Do git pull to make sure up-to-date 2025-01-30 01:47:35 -08:00
Richard Yan
3cd6aacc17 Merge branch 'ae' into ae-hopper 2025-01-30 01:35:10 -08:00
Richard Yan
9cdee597b6 Merge branch 'ae' of https://github.com/richardyrh/virgo-kernels into ae 2025-01-30 01:34:29 -08:00
Richard Yan
dde3602046 disable prints for virgo gemm 2025-01-30 01:34:22 -08:00
Hansung Kim
6bdc6af607 Fix branch name and dims for flash script 2025-01-30 01:15:57 -08:00
Hansung Kim
e4f8f3481c Merge branch 'ae' into ae-hopper 2025-01-30 01:05:31 -08:00
Hansung Kim
b73147cd06 Add compile and operand generate script for flash 2025-01-30 01:04:20 -08:00
Hansung Kim
471f89e371 Add arg binary for flash 2025-01-30 01:02:12 -08:00
Hansung Kim
c7f713c71e Merge branch 'ae' into ae-hopper 2025-01-30 00:49:23 -08:00
Hansung Kim
7e1fc54c97 Fix typo in path 2025-01-30 00:41:42 -08:00
Hansung Kim
b06e345706 Merge branch 'ae' into ae-hopper 2025-01-30 00:35:10 -08:00
Hansung Kim
8a635b5fcb Set TENSOR_HOPPER to 1, add missing markers 2025-01-30 00:34:13 -08:00
Hansung Kim
50c8f1c410 Add operand generate script for tcore 2025-01-29 23:33:09 -08:00
Richard Yan
f23b2a3fcc Merge branch 'ae' into ae-hopper 2025-01-29 23:31:21 -08:00
Richard Yan
dc46135f66 fix compile tcore script 2025-01-29 23:31:09 -08:00
Richard Yan
ac34a8f5f5 hopper changes 2025-01-29 22:22:34 -08:00
Richard Yan
91a82c9f0f merge kernel changes from kernels-asplos-ae 2025-01-29 22:11:25 -08:00
Richard Yan
a61bf257ff modify makefile to point to new locations 2025-01-29 21:27:59 -08:00
Richard Yan
0d842a5930 more renaming and cleanup 2025-01-29 21:22:41 -08:00
Richard Yan
f98cd9bc22 remove old ci 2025-01-29 20:39:47 -08:00
Richard Yan
d4b78377a1 fix virgo kernel scripts 2025-01-29 20:19:42 -08:00
Richard Yan
0e6bcf51f1 cleanup 2025-01-29 18:38:49 -08:00
Richard Yan
5ba132e87b regression restructure 2025-01-29 18:30:32 -08:00
Hansung Kim
3de51577ef Check-in gemmini headers instead of submodule 2025-01-29 17:10:37 -08:00
Richard Yan
e86aac3a6f Merge branch 'new-cisc' into kernels-asplos-ae 2025-01-29 17:03:54 -08:00
Richard Yan
24894b1712 Merge branch 'new-cisc' of https://github.com/hansungk/vortex into new-cisc 2025-01-29 17:03:05 -08:00
Richard Yan
d47ef75614 update idle kernel 2025-01-29 17:00:08 -08:00
Richard Yan
ec41200845 updated no dma gemmini kernel 2025-01-29 16:59:44 -08:00
Hansung Kim
c26558bc93 Add fence before rescale 2025-01-28 23:48:02 -08:00
Hansung Kim
198a25cb16 Set NUM_CORES to 8 for Volta/Ampere 2025-01-28 22:49:36 -08:00
Hansung Kim
f2b5a3409d Merge branch 'new-cisc' into kernels-asplos-ae 2025-01-28 21:18:12 -08:00
Richard Yan
8c45b8b4b7 Merge branch 'new-cisc' of https://github.com/hansungk/vortex-private into new-cisc 2025-01-28 17:14:49 -08:00
Hansung Kim
e43f3c02a9 sgemm_impl: FP_SIZE to 16 2025-01-28 17:06:04 -08:00
Richard Yan
b1e6495630 update kernels 2025-01-28 16:39:17 -08:00
Hansung Kim
d98a414765 Change gemmini_mmio.h to fp16 GEMM setting 2025-01-28 16:36:55 -08:00
Hansung Kim
e4c0bbd039 sgemm: Check-in argument binaries 2025-01-28 16:04:56 -08:00
Hansung Kim
45e9407c99 sgemm: Check-in argument binaries 2025-01-28 15:58:27 -08:00
Hansung Kim
9894efe6c9 Update toolchain env paths for dork 2025-01-28 15:04:14 -08:00
Hansung Kim
5ef4c8023e sgemm_impl: Disable wmma fast store
Doesn't seem to have a big impact on tcore util.
2024-11-11 14:06:15 -08:00
Hansung Kim
7d7cb5f60a flash: Disable perf loop multiplier 2024-11-10 22:44:02 -08:00
Hansung Kim
4448f31fdc fence: Fix moving fence to start of loop
For unknown reasons, guarding the fence with a tid == 0 branch causes a
TL source ID re-used assertion.  Just call the fence from all
thread/warps as a workaround.  At least, all threads in a warp will
coalesce into one request.
2024-11-09 22:04:45 -08:00
Hansung Kim
cb916ead39 Fix potential bitwidth bug in compute API 2024-11-09 20:59:58 -08:00
Hansung Kim
68054689c9 flash: Move fence to start of loop; wrap all MMIO in one tid=0 branch 2024-11-09 20:59:26 -08:00