(1) I found the GPU and NIC map error on my env, I am using RCCL to run 16 GPU cards according 4 NICs each NODE. As the TOPO file searched by RCCL is error, we need ...