LSU dedup is useful for sharedmem for which we don't have a coalescer on, esp. when broadcasting a single value that's cached in smem to all threads in the kernel.
2.7 KiB
2.7 KiB