unifolm-world-model-action

Files

olivame 1d23e5d36d Layer 3: 延迟 decode，只解码 CLIP 需要的 1 帧

- world model 调用 decode_video=False，跳过 16 帧全量 decode
- 只 decode 最后 1 帧给 CLIP embedding / observation queue
- 存 raw latent，循环结束后统一 batch decode 生成最终视频
- 每轮省 15 次 VAE decode，8 轮共省 120 次
- 跳过中间迭代的 wm tensorboard/mp4 保存

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-02-11 07:11:55 +00:00

base_model_inference.py

init commit

2025-09-12 21:53:41 +08:00

eval_utils.py

upload eval_utils.py file

2025-09-16 21:48:15 +08:00

profile_iteration.py

添加三层迭代级性能分析工具 profile_iteration.py

2026-02-10 05:42:11 +00:00

profile_pipeline.py

1. einsum('b i d, b j d -> b i j') → torch.bmm(q, k.transpose(-1,-2)) — 直接映射 rocBLAS batched GEMM