User Tools

Site Tools


network_stuff:infiniband

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
network_stuff:infiniband [2025/07/07 15:16] jotasandokunetwork_stuff:infiniband [2025/10/03 16:04] (current) jotasandoku
Line 12: Line 12:
   * RDMA provides access to the memory from one computer to the memory of another computer without involving either computer’s operating system. This technology enables high-throughput and low-latency networking with low CPU utilization.   * RDMA provides access to the memory from one computer to the memory of another computer without involving either computer’s operating system. This technology enables high-throughput and low-latency networking with low CPU utilization.
     * Mellanox provides RDMA via the OFED package     * Mellanox provides RDMA via the OFED package
-  * lid : local indentifier (All devices in a subnet have a Local Identifier (LID)). Routing between different subnets is done on the basis of a Global Identifier (GID)+  * **LID** : local indentifier (All devices in a subnet have a Local Identifier (LID)). Routing between different subnets is done on the basis of a **Global Identifier (GID)** 
 +  * GID: Is another identifier but is to route BETWEEN SUBNETS. Contains : Subnet Prefix and a GUID (Global Unique Identifier).
   * NSD (Network Shared Disks): In our context, NSD is the server that connects to the storage via the Mellanox switch. The servers share the NSD's to the clients, creating some sort of distributed logical disk (a bit like the hyperflex technology). Particuartly in our setupm the servers dont share their local disks but they expose the DDN's disks.   * NSD (Network Shared Disks): In our context, NSD is the server that connects to the storage via the Mellanox switch. The servers share the NSD's to the clients, creating some sort of distributed logical disk (a bit like the hyperflex technology). Particuartly in our setupm the servers dont share their local disks but they expose the DDN's disks.
-  * SM (Subnet Manager):  It performs the InfiniBand specification's required tasks for initializing InfiniBand hardware. One SM must be running for each InfiniBand subnet. It's run by the OpenSM daemon which can run bith in the switches and  the servers+  * **SM (Subnet Manager):**  It performs the InfiniBand specification's required tasks for initializing InfiniBand hardware. One SM must be running for each InfiniBand subnet. It's run by the OpenSM daemon which can run bith in the switches and  the servers
     * SM master is the node truly acting as SM. The node with the highest priority [0-15] wins.     * SM master is the node truly acting as SM. The node with the highest priority [0-15] wins.
     * In our setup, servers all have priority 14 while switch has priority 15.     * In our setup, servers all have priority 14 while switch has priority 15.
network_stuff/infiniband.1751901408.txt.gz · Last modified: by jotasandoku