NVIDIA’s TensorRT-LLM MultiShot Enhances AllReduce Performance with NVSwitch