|
|
7064ebd5b4
|
Batch GPU stage downloads
|
2026-04-12 21:06:41 +08:00 |
|
|
|
db2d6978b2
|
Reduce final GPU host downloads
|
2026-04-12 18:46:42 +08:00 |
|
|
|
c8977d8356
|
Optimize GPU RK4 stage sync path
|
2026-04-12 18:36:05 +08:00 |
|
|
|
d9287ea530
|
Fix GPU RK4 boundary and sync correctness
|
2026-04-12 12:13:47 +08:00 |
|
|
|
e1a0bff43c
|
Reduce redundant GPU host buffer preparation
|
2026-04-09 21:20:45 +08:00 |
|
|
|
4463f1d23e
|
Unpack intermediate sync stages directly to GPU
|
2026-04-09 19:01:12 +08:00 |
|
|
|
4484635f0d
|
Pack sync send buffers directly from GPU state
|
2026-04-09 18:49:11 +08:00 |
|
|
|
5bc67ded06
|
Download staged GPU sync regions incrementally
|
2026-04-09 18:23:05 +08:00 |
|
|
|
3b16795e78
|
Refresh synced GPU regions incrementally
|
2026-04-09 17:07:31 +08:00 |
|
|
|
5b00d49070
|
Reduce staged GPU host-device copies
|
2026-04-09 16:44:08 +08:00 |
|
|
|
e1e3b4a448
|
Reduce GPU RK4 transfer overhead
|
2026-04-09 12:11:40 +08:00 |
|
|
|
49409645c0
|
Stabilize GPU output path and MPI sync
|
2026-04-09 10:57:49 +08:00 |
|
|
|
ea470737db
|
Add runnable GPU main-path prototype
|
2026-04-08 19:14:37 +08:00 |
|