Default Branch

8c1f4d8108 · 迁移C算子的循环融合和临时量消除 · Updated 2026-03-03 16:20:15 +08:00

Branches

082f9c3423 · feat: Implement hybrid MPI+OpenMP parallelization · Updated 2026-02-06 13:25:07 +08:00    gh0s7

95
1

79af79d471 · baseline updated · Updated 2026-02-05 19:53:55 +08:00    gh0s7

102
0
Included

6fffaa13f6 · Optimize buffer_width dynamically based on FD order to improve scalability · Updated 2026-01-31 19:04:19 +08:00    gh0s7

95
5

95575d9450 · fix: try to fix segfault at 240 steps by adding WithShell guard for writecheck_sh call · Updated 2026-01-22 14:26:41 +08:00    ianchb

104
7

d11eaa2242 · Optimize bssn_rhs.f90: Fuse loops for metric inversion and Christoffel symbols to improve cache locality · Updated 2026-01-21 11:22:33 +08:00    gh0s7

95
3

ed89bc029b · Fix potential division by zero in reta_val calculation and enable NaN checks · Updated 2026-01-19 20:29:48 +08:00    gh0s7

96
5

3914659ebb · Optimize BSSN RHS and finite difference calculations · Updated 2026-01-19 10:49:14 +08:00    gh0s7

96
3
cjy

75be0968fc · feat: port GPU code to CUDA 13 and enable GPU computation · Updated 2026-01-14 02:15:49 +08:00    gh0s7

104
5