High-performance Computing (HPC) Networking
Cisco DNA / Arista CloudVision® for H100 InfiniBand Solution
Overview
The solution based on the NVIDIA® H100 GPU, integrated with Cisco DNA or Arista CloudVision® management platforms, offers a robust Infiniband network architecture tailored for high-performance computing (HPC). This setup meets various business needs by encompassing the InfiniBand network, management network, and storage network.
InfiniBand
InfiniBand is a high-performance networking technology primarily used in HPC environments, data centers, and enterprise networks.
Purpose: InfiniBand is designed to deliver high-speed data transfer, low latency, and high throughput for communication between servers and storage systems.
In-band Management
In-band management involves using the same network that handles regular data traffic to manage network devices and systems.
Purpose: This method allows administrators to perform tasks such as configuration, monitoring, and troubleshooting over the same network used for normal data communication.
Out-of-band Management
Out-of-band management employs a dedicated management network separate from the regular data network for managing network devices and systems.
Purpose: OOB management provides a separate pathway for management tasks, ensuring that administrators can access network devices even if the primary data network is down.
InfiniBand Network Components
Powered by NVIDIA® H100 GPUs and InfiniBand switches, the InfiniBand network offers ultra-low latency and high bandwidth. It ensures lossless transmission through advanced flow control mechanisms.
(1) |
|
(2) |
|
(3) |
|
Management Network
Cisco and Arista switches leverage the advanced capabilities of the Cisco DNA or Arista CloudVision management platforms. These platforms enable customers to efficiently provision, monitor, manage, proactively troubleshoot, and maintain their HPC infrastructure, resulting in higher utilization and reduced overall operational expenses.
(1) |
|
(2) |
|
(3) |
|
(4) |
|
(5) |
|
(6) |
|
(7) |
|
Storage Network
CISCO/ARISTA switches support the BGP protocol with powerful routing control capabilities while ensuring the optimal forwarding path and low-latency forwarding status of the storage network. They are flexible and can scale to meet specific capacity and bandwidth requirements.
(1) |
|
(2) |
|
RoCE Computing
Enhance HPC Network with a 400G RoCE Lossless Solution
This solution provides an optimal 400G interconnect for QSFP-DD switches and OSFP network cards, resolving compatibility issues between different port encapsulations. It is designed for the network topology of HPC architectures, encompassing RoCE compute networks, management networks, and storage networks, to meet diverse business requirements.
(1) |
|
(2) |
|
(3) |
|
(4) |
|
(5) |
|
(6) |
|