Hansung Kim
6a3aa549d3
Add skeleton for Hopper Tensor Core
2024-10-14 15:02:25 -07:00
Hansung Kim
447977bd89
addResource hopper tensor core
2024-10-14 15:02:08 -07:00
Richard Yan
0989d90dd2
connect tc nodes and maybe fix distributor node
2024-10-07 02:59:22 -07:00
Richard Yan
4f057c6994
Merge branch 'main' of https://github.com/ucb-bar/radiance into main
2024-10-05 02:48:48 -07:00
Richard Yan
c6df484c00
add tensor core read client
2024-10-05 02:48:47 -07:00
Hansung Kim
b4bd9ecbc9
Dummy comment
2024-10-02 15:18:56 -07:00
Richard Yan
2929a84ecc
get smem params from key
2024-09-26 16:49:06 -07:00
Richard Yan
f11385218f
move virgo components into shared mem module, more cleanup
2024-09-26 14:41:46 -07:00
Richard Yan
998f73b54a
general cleanup
2024-09-24 18:17:00 -07:00
Richard Yan
3b8c9812b4
refactor smem counter
2024-09-24 17:24:52 -07:00
Richard Yan
85336399c2
refactor radiance cluster shared memory into components
2024-09-24 03:14:32 -07:00
Richard Yan
20cf4609b7
camelCase
2024-09-22 01:21:37 -07:00
Richard Yan
daacae9edc
fallback for hint select
2024-09-11 15:09:52 -07:00
Richard Yan
f1a1b77828
actually support large smem subbanks
2024-09-10 23:24:02 -07:00
Richard Yan
13142ab0b9
Merge branch 'main' of https://github.com/ucb-bar/radiance into main
2024-09-10 18:30:54 -07:00
Richard Yan
810db6a1ea
new crossbar w/ individual select and group hint, subbanks > num lanes support
2024-09-10 18:30:48 -07:00
Hansung Kim
b335132c34
Parameterize tensor core FP16
2024-09-10 15:38:12 -07:00
Richard Yan
3fd0fd296b
queued cisc commands
2024-09-09 22:38:16 -07:00
Richard Yan
06edba2a78
fix comb loop & revert xbar temporarily
2024-09-09 02:27:08 -07:00
Richard Yan
afc6ba7eca
fix ext policy xbar, add rectangular tile support
2024-09-08 13:21:31 -07:00
Richard Yan
378b3531d4
balanced shared memory across cores
2024-09-07 20:29:27 -07:00
Richard Yan
84972181a5
large smem size, fix single gemmini, bump vortex
2024-09-05 16:50:03 -07:00
Hansung Kim
24df14d7af
Bump vortex
2024-08-28 16:23:49 -07:00
Hansung Kim
e31f25b432
Switch to FP32 tensor core for use in flash
2024-08-28 16:23:27 -07:00
Hansung Kim
ec0c8750d3
Bump vortex
2024-08-20 14:47:18 -07:00
Hansung Kim
2364cd213e
Bump vortex
2024-08-15 13:40:09 -07:00
Hansung Kim
d8823a0416
Add back generated verilog for FP32 TensorDPU
2024-08-12 19:52:13 -07:00
Hansung Kim
7b06c1778c
Bump vortex
2024-08-07 11:30:09 -07:00
Hansung Kim
c1d95ff205
Revert rename
2024-08-07 11:29:42 -07:00
Hansung Kim
477f3955ed
Update generated SV for tensordpu
2024-08-07 11:09:57 -07:00
Hansung Kim
32c7aed263
Fix fp exception by rounding right after MulRawFN
2024-08-07 11:09:55 -07:00
Richard Yan
b7a342fcf6
bump vortex
2024-08-06 02:33:48 -07:00
Richard Yan
b7239917c0
temporary fix to rename fp units
2024-08-06 02:33:13 -07:00
Richard Yan
af60ae3332
fp16 gemmini support
2024-08-06 02:32:35 -07:00
Richard Yan
0d9c2ca6ad
rename TensorDPU.scala
2024-08-06 02:30:37 -07:00
Hansung Kim
d8674b753e
Support fp16 input, fp32 output in TensorDPU
...
TODO could see improvement towards handling raw format as much as
possible.
2024-08-01 22:01:26 -07:00
Hansung Kim
ee79fdaa05
Fix typetag for half
2024-07-31 17:17:03 -07:00
Hansung Kim
4e3ed96c94
Support fp16 operand/accum in TensorDPU
...
TODO: fp32 accum
2024-07-31 16:37:01 -07:00
Hansung Kim
33aa4e08db
Bump vortex
2024-07-26 16:27:40 -07:00
Hansung Kim
42285c7193
Instantiate StallingPipe as a separate module
...
To allow having flop/mac breakdown in area/power reports.
2024-07-25 16:29:08 -07:00
Hansung Kim
79604f51be
Fix possible CIRCT bug on SourceGenerator
...
When migrated to amd3 (possibly wiht newer CIRCT version), a new bug
shows up where storing both meta and valid into a single table doesn't
work, since writing meta writes {1'b0, meta} to the whole row of the
table overwriting the valid bit. Work this around by creating separate
tables for the meta and valid bits.
While at it, remove use of outdated NewSourceGenerator in VortexBank.
2024-07-23 15:09:03 -07:00
Hansung Kim
4566f954ee
Update doc; move code out of copypaste block
2024-07-23 15:05:09 -07:00
Hansung Kim
37fd0c6200
Bump vortex
2024-07-19 16:39:32 -07:00
Hansung Kim
21baeae758
Separate read and write counter for smem
2024-07-02 14:52:58 -07:00
Hansung Kim
7aad800a2d
Fix module imports after rocket-chip bump
2024-06-27 17:17:27 -07:00
Hansung Kim
e1342e431c
Bump radiance
2024-06-26 14:28:31 -07:00
Hansung Kim
018167480d
Add access counter to smem banks
2024-06-26 14:28:31 -07:00
Richard Yan
74cd633246
Merge branch 'main' of https://github.com/ucb-bar/radiance into main
2024-06-20 01:45:18 -07:00
Richard Yan
5eb581c7e5
different sized gemminis
2024-06-20 01:45:14 -07:00
Hansung Kim
5a79d7ff79
Add new opcodes for tensor-core DMA
2024-06-12 09:52:59 -07:00