This commit is contained in:
tankya2
2024-02-01 15:21:04 +08:00
committed by yangliwei
parent a61ac21dd4
commit d852fd89e9

View File

@@ -73,10 +73,10 @@ print(result.state())
</pre> </pre>
## Multi-Node ## Multi-Node
Multi-node is enabled by setting either the MPI or NCCL enabled flag to True in the computation settings. Below shows the script to launch multi node on 4 GPU in cluster. Multi-node is enabled by setting either the MPI or NCCL enabled flag to True in the computation settings. Below shows the script to launch on 2 nodes with 2 GPUs each. $node_list contains the IP of the nodes assigned.
<pre> <pre>
``` ```
mpirun -n 4 --mca opal_common_ucx_opal_mem_hooks 1 --mca orte_base_help_aggregate 0 -mca btl ^openib -hostfile $node_list python test.py mpirun -n 4 -hostfile $node_list python test.py
``` ```
</pre> </pre>