Hansung Kim
2036d37840
sgemm_wg: Prevent run-ahead using ternary flags; reduce mem accesses
2024-03-13 21:35:24 -07:00
Hansung Kim
510a834db5
sgemm_wg: Implement software barrier for inter-core synchronization
2024-03-12 15:34:42 -07:00
Hansung Kim
fbe872c831
sgemm_wg: Add missing makefile dep to common.h
2024-03-12 15:34:17 -07:00
Hansung Kim
6f4dfe5a0e
sgemm_wg: Implement 2D threadtiling
2024-02-29 14:40:54 -08:00
Hansung Kim
a06b2dd20e
sgemm_wg: Cleanup & proper unroll
2024-02-28 21:17:42 -08:00
Hansung Kim
46f242e520
sgemm_wg: Constantify BM/BN/BK/TM, computationally set gridsize and TB/core
2024-02-27 22:23:25 -08:00
Hansung Kim
27646bb507
sgemm_wg: Implement multiple C per thread with sliding A/B blocks
2024-02-27 22:06:01 -08:00
Hansung Kim
2b1b5fe537
convolution: Fix write_operand_file after upstream merge
2024-02-27 15:45:22 -08:00
Hansung Kim
f1e7407d3a
sgemm_wg: Run multiple threadblock per core
2024-02-27 15:44:04 -08:00
Richard Yan
5b1c527186
Merge branch 'kernels' of https://github.com/hansungk/vortex into kernels
2024-02-24 00:27:23 -08:00
Richard Yan
914864206a
MMIO gemmini matmul kernel
2024-02-24 00:27:16 -08:00
Hansung Kim
d2da0d3394
sgemm_wg: Parameterize threadblock dimensions
2024-02-17 18:05:59 -08:00
Hansung Kim
301f1ca260
sgemm_wg: Implement blocking over k-dimension
2024-02-16 16:20:57 -08:00
Hansung Kim
5f79e8a3f1
sgemm_wg: reference matmul in cpu
2024-02-12 22:29:38 -08:00
Hansung Kim
6b420aceb6
sgemm_wg: write simple C=A*A matmul
2024-02-12 22:22:28 -08:00
Hansung Kim
a43d5eb1a7
Merge remote-tracking branch 'upstream/master' into kernels
2024-02-12 20:50:32 -08:00
Hansung Kim
6a1a506b64
sgemm_wg: save args and input bin
2024-02-12 20:49:08 -08:00
Hansung Kim
f586ab28df
vecadd: save operand to file
2024-02-12 20:49:08 -08:00
Hansung Kim
5a216ef9ca
flops: unroll by 16
2024-02-12 20:49:08 -08:00
Richard Yan
c258557999
Merge branch 'kernels' of https://github.com/hansungk/vortex into kernels
2024-02-08 17:05:37 -08:00
Richard Yan
12bdab8043
update gemmini matmul kernel
2024-02-08 17:00:19 -08:00
Hansung Kim
ad8bf9b223
Add sgemm_wg C kernel
2024-02-07 21:31:08 -08:00
Hansung Kim
b5bfa7d4b9
Fix bogus spad address
2024-02-01 14:05:13 -08:00
Hansung Kim
0462a91953
Update mmio kernel to do single gemm
2024-02-01 13:52:29 -08:00
Hansung Kim
7f6f1d605f
Add bare mmio kernel
2024-01-24 16:24:19 -08:00
Hansung Kim
a5c50b60c8
Add 'smemcoherence' kernel
2024-01-02 20:28:40 -08:00
Hansung Kim
edb385f138
Increase smem read loop iter
2024-01-01 14:26:19 -08:00
Hansung Kim
ee1ed315e2
Write out operand files in sharedmem kernel
2024-01-01 14:26:10 -08:00
Hansung Kim
1586ea93ef
Add new 'sharedmem' kernel
2024-01-01 13:56:46 -08:00
Blaise Tine
bd18b03cc3
minor update
2023-12-31 15:29:04 -08:00
Blaise Tine
e7f8b40d93
minor update
2023-12-31 11:46:41 -08:00
Blaise Tine
031d24e695
minor updates
2023-12-30 00:52:44 -08:00
Blaise Tine
36f5dd87fe
minor update
2023-12-28 12:22:22 -08:00
Blaise Tine
e217bc2c23
adding tracking for SFU stalls
2023-12-28 12:12:11 -08:00
Blaise Tine
e04e026a14
profiling update
...
minor updates
2023-12-18 04:43:44 -08:00
Blaise Tine
c6845a4c8d
profiling timing optimization
...
minor update
minor update
minor update
2023-12-18 04:43:10 -08:00
Hansung Kim
dd289ef964
Write operand to file in matmul kernel
2023-11-29 11:30:54 -08:00
Blaise Tine
9dc5793046
minor udpate
2023-11-27 02:21:47 -08:00
Blaise Tine
1271c9c03f
minor update
2023-11-27 02:12:12 -08:00
Blaise Tine
2f1171ca76
minor update
2023-11-27 02:04:22 -08:00
Blaise Tine
43154cf738
minor updates
2023-11-16 23:41:59 -08:00
Blaise Tine
ede5e1c311
minor update
2023-11-15 00:28:26 -08:00
Blaise Tine
61e3442ef8
adding opencl convolution benchmark
2023-11-14 22:31:30 -08:00
Blaise Tine
4e7a536918
adding tensor regression test.
2023-11-14 05:37:46 -08:00
Blaise Tine
62cdd8e993
minor update
2023-11-11 15:49:39 -08:00
Blaise Tine
c1e168fdbe
Vortex 2.0 changes:
...
+ Microarchitecture optimizations
+ 64-bit support
+ Xilinx FPGA support
+ LLVM-16 support
+ Refactoring and quality control fixes
minor update
minor update
minor update
minor update
minor update
minor update
cleanup
cleanup
cache bindings and memory perf refactory
minor update
minor update
hw unit tests fixes
minor update
minor update
minor update
minor update
minor update
minor udpate
minor update
minor update
minor update
minor update
minor update
minor update
minor update
minor updates
minor updates
minor update
minor update
minor update
minor update
minor update
minor update
minor updates
minor updates
minor updates
minor updates
minor update
minor update
2023-11-10 02:47:05 -08:00
Blaise Tine
b9cda8fca7
minor update
2023-05-15 20:19:14 -04:00
Blaise Tine
e1b666cb93
minor update
2022-07-14 08:55:09 -04:00
Blaise Tine
2277e3c878
minor update
2022-02-05 17:59:58 -05:00
Santosh Srivatsan
b7e5a83ba3
Merged branch xlen-parameterization into staging
2022-02-05 13:47:42 -05:00