Commit Graph

  • e7229dae27 Checkpoint wu arch cases before scalar spawn wrapper ae abnerhexu 2026-05-24 10:51:58 +08:00
  • 8f7dba5920 Add Blackwell instructions test kernel and update linker script abnerhexu 2026-05-06 14:50:28 +08:00
  • bcc566b621 Add Blackwell SGEMM kernel scaffolding abnerhexu 2026-04-25 10:15:31 +08:00
  • c24585570d Merge branch 'ae' into ae-flash-virgo ae-flash-virgo Virgo-AE Eval 2025-02-07 14:53:04 -08:00
  • 37da0e2aa8 Merge branch 'ae' into ae-flash-ampere ae-flash-ampere Virgo-AE Eval 2025-02-07 14:53:00 -08:00
  • 0884ba6fcb Merge branch 'ae' into ae-hopper ae-hopper Virgo-AE Eval 2025-02-07 14:52:27 -08:00
  • 326141b11f Merge branch 'ae' into ae-volta ae-volta Virgo-AE Eval 2025-02-07 14:52:25 -08:00
  • 71f713b9fc Disable git pull for archive ae-ampere Virgo-AE Eval 2025-02-07 14:48:56 -08:00
  • 179cbbcc69 Merge branch 'ae' into ae-flash-ampere Richard Yan 2025-01-31 03:53:51 -08:00
  • 8071faf7c2 Merge branch 'ae' into ae-flash-virgo Richard Yan 2025-01-31 03:53:39 -08:00
  • d893780594 Merge branch 'ae' into ae-volta Richard Yan 2025-01-31 03:53:26 -08:00
  • fd2fe71ca1 Merge branch 'ae' into ae-hopper Richard Yan 2025-01-31 03:53:00 -08:00
  • 9847072eff fix hexadecile Richard Yan 2025-01-31 02:02:18 -08:00
  • 9a9f8549d8 Merge branch 'ae' into ae-flash-ampere Richard Yan 2025-01-30 23:42:26 -08:00
  • a4bd41392c Merge branch 'ae' into ae-flash-virgo Richard Yan 2025-01-30 23:42:05 -08:00
  • 9b7c22a7e9 Merge branch 'ae' into ae-volta Richard Yan 2025-01-30 23:41:16 -08:00
  • 8d71815809 Merge branch 'ae' into ae-hopper Richard Yan 2025-01-30 23:40:48 -08:00
  • f8c51669c1 fix toolchain env sh Richard Yan 2025-01-30 21:17:12 -08:00
  • b1ebabef26 Merge branch 'ae' into ae-volta Richard Yan 2025-01-30 15:35:22 -08:00
  • 63f476eb83 Merge branch 'ae' into ae-hopper Richard Yan 2025-01-30 15:34:58 -08:00
  • 17a9d31be5 fix dma invocation Richard Yan 2025-01-30 15:33:58 -08:00
  • 692f3dddff Merge branch 'ae' into ae-flash-virgo Hansung Kim 2025-01-30 13:25:00 -08:00
  • 66c09d3db2 Merge branch 'ae' into ae-flash-ampere Hansung Kim 2025-01-30 13:24:58 -08:00
  • 0711f5f7a3 Merge branch 'ae' into ae-hopper Hansung Kim 2025-01-30 13:24:50 -08:00
  • 9f524538a4 Merge branch 'ae' into ae-volta Hansung Kim 2025-01-30 13:24:46 -08:00
  • 238b942133 Add missing library remake Hansung Kim 2025-01-30 13:24:23 -08:00
  • c75ed0d531 Merge branch 'ae' into ae-flash-virgo Hansung Kim 2025-01-30 01:49:05 -08:00
  • dcb8549722 Merge branch 'ae' into ae-flash-ampere Hansung Kim 2025-01-30 01:49:02 -08:00
  • 51ebe18ebb Merge remote-tracking branch 'origin/ae-volta' into ae-volta Hansung Kim 2025-01-30 01:48:25 -08:00
  • 97227577b5 Merge branch 'ae' into ae-hopper Hansung Kim 2025-01-30 01:48:09 -08:00
  • 7b0a95034b Merge branch 'ae' into ae-volta Hansung Kim 2025-01-30 01:48:05 -08:00
  • 2c1ac4e938 Do git pull to make sure up-to-date Hansung Kim 2025-01-30 01:47:35 -08:00
  • c240069147 Merge branch 'ae' into ae-volta Richard Yan 2025-01-30 01:35:35 -08:00
  • 3cd6aacc17 Merge branch 'ae' into ae-hopper Richard Yan 2025-01-30 01:35:10 -08:00
  • 9cdee597b6 Merge branch 'ae' of https://github.com/richardyrh/virgo-kernels into ae Richard Yan 2025-01-30 01:34:29 -08:00
  • dde3602046 disable prints for virgo gemm Richard Yan 2025-01-30 01:34:22 -08:00
  • 96500e0abc Turn off TENSOR_HOPPER for Virgo flash Hansung Kim 2025-01-30 01:22:50 -08:00
  • a368eb2dae Fix build target for flash ampere Hansung Kim 2025-01-30 01:21:37 -08:00
  • 7f4adfaaa2 Increase SMEM size for flash Hansung Kim 2025-01-30 01:17:38 -08:00
  • 4f12227327 Increase SMEM size for flash Hansung Kim 2025-01-30 01:17:38 -08:00
  • efd2d232fe Merge branch 'ae' into ae-flash-virgo Hansung Kim 2025-01-30 01:16:23 -08:00
  • 30001b7677 Merge branch 'ae' into ae-flash-ampere Hansung Kim 2025-01-30 01:16:21 -08:00
  • 6bdc6af607 Fix branch name and dims for flash script Hansung Kim 2025-01-30 01:15:57 -08:00
  • 5ab2cc4334 Switch to fp32 for flash Hansung Kim 2025-01-30 01:12:32 -08:00
  • b97df2ce6a Switch to fp32 for flash Hansung Kim 2025-01-30 01:12:32 -08:00
  • bc64474114 Update config for flash ampere Hansung Kim 2025-01-30 01:06:54 -08:00
  • e4f8f3481c Merge branch 'ae' into ae-hopper Hansung Kim 2025-01-30 01:05:31 -08:00
  • d86c33acf3 Merge branch 'ae' into ae-volta Hansung Kim 2025-01-30 01:05:27 -08:00
  • b73147cd06 Add compile and operand generate script for flash Hansung Kim 2025-01-30 01:04:20 -08:00
  • 471f89e371 Add arg binary for flash Hansung Kim 2025-01-30 01:02:12 -08:00
  • c7f713c71e Merge branch 'ae' into ae-hopper Hansung Kim 2025-01-30 00:49:23 -08:00
  • b49e8a293c Merge branch 'ae' into ae-volta Hansung Kim 2025-01-30 00:49:19 -08:00
  • 7e1fc54c97 Fix typo in path Hansung Kim 2025-01-30 00:41:42 -08:00
  • b06e345706 Merge branch 'ae' into ae-hopper Hansung Kim 2025-01-30 00:35:10 -08:00
  • 19731b8e2f Merge branch 'ae' into ae-volta Hansung Kim 2025-01-30 00:35:00 -08:00
  • 8a635b5fcb Set TENSOR_HOPPER to 1, add missing markers Hansung Kim 2025-01-30 00:34:13 -08:00
  • 50c8f1c410 Add operand generate script for tcore Hansung Kim 2025-01-29 23:33:09 -08:00
  • afc69507a3 Merge branch 'ae' into ae-volta Richard Yan 2025-01-29 23:31:34 -08:00
  • f23b2a3fcc Merge branch 'ae' into ae-hopper Richard Yan 2025-01-29 23:31:21 -08:00
  • dc46135f66 fix compile tcore script Richard Yan 2025-01-29 23:31:09 -08:00
  • ac34a8f5f5 hopper changes Richard Yan 2025-01-29 22:22:34 -08:00
  • 6e279c905f volta change Richard Yan 2025-01-29 22:16:39 -08:00
  • 91a82c9f0f merge kernel changes from kernels-asplos-ae Richard Yan 2025-01-29 22:11:25 -08:00
  • a61bf257ff modify makefile to point to new locations Richard Yan 2025-01-29 21:27:59 -08:00
  • 0d842a5930 more renaming and cleanup Richard Yan 2025-01-29 21:22:41 -08:00
  • f98cd9bc22 remove old ci Richard Yan 2025-01-29 20:39:47 -08:00
  • d4b78377a1 fix virgo kernel scripts Richard Yan 2025-01-29 20:19:42 -08:00
  • 0e6bcf51f1 cleanup Richard Yan 2025-01-29 18:38:49 -08:00
  • 5ba132e87b regression restructure Richard Yan 2025-01-29 18:28:09 -08:00
  • 3de51577ef Check-in gemmini headers instead of submodule Hansung Kim 2025-01-29 17:08:32 -08:00
  • e86aac3a6f Merge branch 'new-cisc' into kernels-asplos-ae Richard Yan 2025-01-29 17:03:54 -08:00
  • 24894b1712 Merge branch 'new-cisc' of https://github.com/hansungk/vortex into new-cisc Richard Yan 2025-01-29 17:03:05 -08:00
  • d47ef75614 update idle kernel Richard Yan 2025-01-29 17:00:08 -08:00
  • ec41200845 updated no dma gemmini kernel Richard Yan 2025-01-29 16:59:44 -08:00
  • c26558bc93 Add fence before rescale Hansung Kim 2025-01-28 23:48:02 -08:00
  • 198a25cb16 Set NUM_CORES to 8 for Volta/Ampere Hansung Kim 2025-01-28 22:49:36 -08:00
  • f2b5a3409d Merge branch 'new-cisc' into kernels-asplos-ae Hansung Kim 2025-01-28 21:18:12 -08:00
  • 8c45b8b4b7 Merge branch 'new-cisc' of https://github.com/hansungk/vortex-private into new-cisc Richard Yan 2025-01-28 17:14:49 -08:00
  • e43f3c02a9 sgemm_impl: FP_SIZE to 16 Hansung Kim 2025-01-28 17:06:04 -08:00
  • b1e6495630 update kernels Richard Yan 2025-01-28 16:37:22 -08:00
  • d98a414765 Change gemmini_mmio.h to fp16 GEMM setting Hansung Kim 2025-01-28 16:36:55 -08:00
  • e4c0bbd039 sgemm: Check-in argument binaries Hansung Kim 2025-01-28 16:03:32 -08:00
  • 45e9407c99 sgemm: Check-in argument binaries Hansung Kim 2025-01-28 15:58:27 -08:00
  • 9894efe6c9 Update toolchain env paths for dork Hansung Kim 2025-01-28 15:04:14 -08:00
  • 5ef4c8023e sgemm_impl: Disable wmma fast store Hansung Kim 2024-11-11 14:06:15 -08:00
  • 7d7cb5f60a flash: Disable perf loop multiplier Hansung Kim 2024-11-10 22:44:02 -08:00
  • 4448f31fdc fence: Fix moving fence to start of loop Hansung Kim 2024-11-09 22:04:45 -08:00
  • cb916ead39 Fix potential bitwidth bug in compute API Hansung Kim 2024-11-09 20:59:58 -08:00
  • 68054689c9 flash: Move fence to start of loop; wrap all MMIO in one tid=0 branch Hansung Kim 2024-11-09 20:59:26 -08:00
  • fcd8b0b892 flash: Disable rescale flag check Hansung Kim 2024-11-09 20:37:58 -08:00
  • 1c9b022156 flash: Rename nowarpspec to default Hansung Kim 2024-11-09 19:58:45 -08:00
  • 8fe6d918f2 flash: Update tcore kernel to use new CISC Hansung Kim 2024-11-09 19:49:20 -08:00
  • 76a6aaf085 flash: doc update Hansung Kim 2024-11-09 19:09:09 -08:00
  • 673e07ed43 flash: Add non-warp-specialized gemmini flash kernel Hansung Kim 2024-11-09 19:08:39 -08:00
  • ac42f2dbba sgemm_gemmini_dma: Update with new compute API Hansung Kim 2024-11-09 16:49:39 -08:00
  • ad75561efe flash: Reduce fence calls to improve util Hansung Kim 2024-11-09 16:44:17 -08:00
  • 6990fcc1e6 Add compute-and-mvout-to-spad API Hansung Kim 2024-11-09 16:43:45 -08:00
  • 952b8debbb flash: Update to use new CISC interface Hansung Kim 2024-11-09 16:21:34 -08:00
  • dc89309ad0 Merge branch 'kernels-flash' into new-cisc Hansung Kim 2024-11-09 14:42:46 -08:00
  • 365b1d8e67 flash: Add begin end markers Hansung Kim 2024-11-09 10:16:40 -08:00