This commit is contained in:
tankya2
2024-02-01 15:21:04 +08:00
committed by yangliwei
parent a61ac21dd4
commit d852fd89e9

View File

@@ -73,10 +73,10 @@ print(result.state())
</pre>
## Multi-Node
Multi-node is enabled by setting either the MPI or NCCL enabled flag to True in the computation settings. Below shows the script to launch multi node on 4 GPU in cluster.
Multi-node is enabled by setting either the MPI or NCCL enabled flag to True in the computation settings. Below shows the script to launch on 2 nodes with 2 GPUs each. $node_list contains the IP of the nodes assigned.
<pre>
```
mpirun -n 4 --mca opal_common_ucx_opal_mem_hooks 1 --mca orte_base_help_aggregate 0 -mca btl ^openib -hostfile $node_list python test.py
mpirun -n 4 -hostfile $node_list python test.py
```
</pre>