Quantcast
Channel: Mellanox Interconnect Community: Message List
Viewing all 6226 articles
Browse latest View live

Re: iSER driver for ESXi 6.5 - Does it support ConnectX-3 or not

$
0
0

Here are some links that will guide you through the proper installation & implimentation of iSER v1.0.0.1 along with EXSi 6.5 ConnectX-3/Pro Inbox Driver v3.16.0.0

  1. Inbox Drive v3.16.0.0 supports only CX-3/pro adapter
  2. the iSER driver supports all CX-3, CX-4 & CX-5 adapters
  3. iSER driver:

- Release-Note:  http://www.mellanox.com/related-docs/prod_software/Mellanox_MLNX-NATIVE-ESX-iSER_Driver_for_VMware_ESXi_6.5_Release_Notes_v1.0.0.1.pdf

- Instructions on how to install are in the Quick start guide:  http://www.mellanox.com/related-docs/prod_software/Mellanox_MLNX-NATIVE-ESX-iSER_Driver_for_VMware_ESXi_6.5_Quick_Start_Guide_v1.0.0.1.pdf

  1. as for Inbox driver: It is part of the esxi6.5 OS and can be revealed by running: # esxcli software vib list

 

Hope this helps


Re: iSER driver for ESXi 6.5 - Does it support ConnectX-3 or not

$
0
0

Your guide show me a old ESXi 6.0 interface figure and old Global Pause switch configuration.

 

Do you have real world guide for RoCE v1 that on iSER Quick Guide on your site?

 

Best Regard,

Jae-Hoon Choi

How do I disable FEC for MCX416A-CCAT on windows

$
0
0

I have a MCX416A-CCAT I already set the seed to 100Gb and disable auto- negotiation I was wondering if is there a way to disable FEC.

Re: ConnectX-3 WinOF 5.35 on Win2016 Multiple Partitions

$
0
0

I am not sure if this applies, but Linux has a max IPoB MTU of 2048(2044) while Windows has max 4096(4092)

 

When connecting the 2 different OS you must use the smaller MTU 2044. This normally must be configured manually on the Windows Servers.  Failure to set a common MTU in the past yielded poor performance or connectivity issues.

 

It has been a while since we used both together so things may have changed.

Re: Issues with setting up Storage Spaces Direct

$
0
0

We are using S2D with IB on ConnectX 3 cards.  No problems.

 

I am not familiar with the Test-RDMA script.

 

How are you testing RDMA?  I use Windows Performance Monitor.  There are RDMA specific counters which make it easy to track RDMA traffic.  Other than setup and drivers there was not much else to do.

 

Have you installed the latest drivers for you cards?

IB Switch IS5035 MTU Setting?

$
0
0

I am wondering if someone can help with this.

 

We have 2x Mellanox IS5035 Switches running SM.

 

I have set the MTU on all of our Mellanox ConnectX3 Adapters on our Windows 2012 and 2016 Servers to 4092.

We get the following error in the Windows Server 2016 Event log

"According to the configuration under the "Jumbo Packets" advanced property, the MTU configured for device Mellanox ConnectX-3 IPoIB Adapter is 4092. The effective MTU is the supplied value + 4 bytes (for the IPoIB header). This configuration exceeds the MTU reported by OpenSM, which is 2048. This inconsistency may result in communication failures. Please change the MTU of IPoIB or OpenSM, and restart the driver."

 

In the IS5035 under Fabric MGMT > Partitions the Default MTU is set to Default.

 

Is this the correct place to change the SM MTU?

Will it take effect right away?

Will there be any interruption in traffic? These are production servers so I need to know the impact

 

Thanks!

 

Todd

Re: RDS-TOOLS PACKAGE ON MLNX_OFED_LINUX-4.1-1.0.2.0-fc24-x86_64.iso ?

$
0
0

Hi Minouche S.A.

 

Thank you for contacting the Mellanox Community and posting your question.

 

Unfortunately, as mentioned in the Mellanox OFED Driver Release Notes ( http://www.mellanox.com/related-docs/prod_software/Mellanox_OFED_Linux_Release_Notes_4_1-1_0_2_0.pdf  ), Section 2.2 "Unsupported Functionalities/Features/HCAs", Mellanox does not support RDS. Therefor no RDS module and / or RDS related tools are supplied with the latest Mellanox OFED driver. Some older driver versions may contain a version of the RDS module but still Mellanox does not support the use of it.

 

For support and use of RDS, the OS vendors INBOX driver and tools should be used.

 

Thanks.

 

Cheers,

~Martijn

New iSER Driver installation on ESXi 6.5-U1

$
0
0

Hello,

 

We have tried to install the new iSER driver (MLNX-NATIVE-ESX-ISER_1.0.0.1-10EM-650.0.0.4598673.zip) on ESXi 6.5 U1.

But only the software iSCSI shows up in the storage adapter list. The iSER adapter is not visible.

 

The configuration is as follows:

Hardware:

  • - Servers: PowerEdge R720, Cisco C-240 M3 and Lenovo RD650 128 to 192GB RAM.
  • - Dual Xeon
  • - Mellanox ConnectX-3 MCX354A-FCBT adapters
  • Inbox 3.16 Drivers.PNG
  • iSER Module.PNG

Software:

  • - ESXi 6.5-U1 Build 5969303
  • - Mellanox Drivers native drivers 3.16.0.0-1vmw.650.0.0.4564106

The installation of the new iSER drivers is successful and we have issued the following command to ensure that the driver is loaded on boot:

# esxcli system module set --enabled=true --module=iser

All the hosts boot normally and

# esxcfg-module -g iser

Gives:

iser enabled = 1 options = ''

However just to check, after booting if we issue the following command the first time:

# vmkload_mod iser

We get:

Module iser loaded successfully

Which leads to believe that the module was not loaded automatically

Launching the same command a second time leads to:

vmkload_mod: Can not load module iser: module is already loaded

On all 5 hosts we are not able to see the iSER adapter even after several reboots.

Storage Adapters List.PNG

The network adapters are connected to a vDS:

Network Adapters List.PNG

 

Any clue or procedure to clean install everything in order to bring up the iSER adapter ?

Thanks !

 

 

 


Re: ConnectX-3 WinOF 5.35 on Win2016 Multiple Partitions

$
0
0

Two nodes are both Windows 2016.  SM has partition configured as mtu=5.  Windows driver side is also configured with 4096.  In fact, on the later versions of the Windows driver, it will throw out warning message in the event log if partition MTU does not match the driver MTU.  Through my testing, I know for sure FW 2.36.5150 works perfect with driver 5.25.12665.  FW 2.40.7000 does not work with driver 5.25 nor 5.35.  I have yet to see if FW 2.36.5150 works with driver 5.35.  The server is an older unit based on the Intel Tylersburg chipset.

Re: How to test RDMA traffic congestion

$
0
0

Here few steps to try to analyse your congestion problem:

 

 

What is IB congestion?

  • IB congestion is a situation where nodes fail to send data or send rate decreases
  • In most cases when an IB network is experiencing congestion, there will be no packets drops. Just slowness
  • Usually IB congestion is caused by a slow node receiver.

It can also cause by the network itself in cases where the network is blocking by design or due to an issue

How to identify congestion situation:

  • Network is slow. All or some of the nodes packet rate decreases dramatically
  • No packet drops in the fabric. If the network drops packets it is probably not real congestion, just a physical problem that should be locally identified and fixed

 
  

Suspect #1: Physical Layer Issues

  • Ø Ibdiagnet diagnostic

Physical layer issues can cause degraded performance of the fabric. In order to eliminate any impact on the fabric by physical layer issues, fabric cleanup is required.

Information on fabric status and ports’ counters can be collected using the ibdiagnet tool (from the UFM server where we have the ibdiagnet2 version installed):

ibdiagnet -r -pc -P all=1 --pm_pause_time 600 -o <output_dir>

  • It is recommended specifying the output directory so files will not get overwritten
  • Output files can be used in other sections of this technical guide

In the ibdiagnet2.log file, need to look for ports reporting on one or more of the following physical layer issues:

  • link_down_counter – ignoring scheduled servers’ reboot

 

-E- lid=0x0143 dev=51000 xxxxxxxx/U1/P36

Performance Monitor counter : Value

link_down_counter : 3 (threshold=0)

 

  • Links degraded speed and width – links with reduced capability will be reported in the “Speed / Width checks” section

 

Speed / Width checks

-I- Link Speed Check (Compare to supported link speed)

-E- Links Speed Check finished with errors

-E- Link: S0002c902004213d3/N0002c902004213d0(Infiniscale-IV Mellanox Technologies)/P24<-->switch-1137be:IS5030/U1/P32 - Unexpected actual link speed 2.5

 

-I- Link Width Check (Expected value given = 4x)

-E- Links Width Check finished with errors

-E- Link: S0002c902004213d3/N0002c902004213d0(Infiniscale-IV Mellanox Technologies)/P24<-->switch-1137be:IS5030/U1/P32 - Unexpected width, actual link width is 1x

 

  • link_error_recovery_counter

 

-E- lid=0x0009 dev=51000 xxx/U1/P32

Performance Monitor counter : Value

link_error_recovery_counter : 255 (overflow)

 

  • max_retransmission_rate– check for increments during test run. Look for anything greater than threshold of 500 (the threshold mentioned in the example below is set by the ibdiagnet test flag “-P all=1”)

 

-E- Ports counters Difference Check (during run) finished with errors

-E- Sf4521403004d20a0/r xxx/P6 - "max_retransmission_rate" increased during the run (difference value=1,difference allowed threshold=1)

 

  • symbol_error_counter– relevant only for non FDR/FDR10 links

 

-E- lid=0x016e dev=23131 S0008f1040040c018/N0008f10500650e4e/P30

Performance Monitor counter : Value    

symbol_error_counter : 65535      (overflow)

 

  • Ø UFM Port Counters CSV diagnostic

Configuring UFM to collect PortCounters CSV files in gv.cfg configuration file:

[CSV]

max_files= 5

write_interval= 30

ext_ports_only= no

Output files will be saved in this location on the UFM server: /opt/ufm/files/csv/.

  1. Extract the latest file and open with Excel
  2. Form a table
  3. Relevant column for physical layer issues:
    1. E: Width – look for any port without 4x width
    2. T: SymErr – SymbolError. Relevant for non FDR/FDR10 links
    3. U: LinkRecovers
    4. V: LinkDowned
    5. AY: Speed – look for any degraded rate
    6. AZ: Status – look for anything not OK

Device name and port can be found in columns P and B respectively.

 


  

 

Suspect #2: Unresponsive node/s issue 

Looking for unresponsive nodes to fabric MADs. Nodes can get to this situation if there is any issue with OS, driver or card firmware. Once identified, it is recommended that the unresponsive nodes will not participate in any job in the fabric.

If there are any unresponsive nodes in the fabric, we can find them by invoking one of the direct path commands such as iblinkinfo, ibnetdiscover, ibswitches, ibhosts, ibnodes, ethc.

  1. Run one of the direct path commands: iblinkinfo/ibnetdiscover/ibswitches/ibhosts/ibnodes
  2. If there are unresponsive nodes in the fabric, you will get 1 “Connection times out” line per unresponsive node at the start of the command output, with specific direct path to the node

 

Example:

root # ibnetdiscover

src/query_smp.c:197; umad (DR path slid 0; dlid 0; 0,1,18 Attr 0xff90:2) bad status 110; Connection timed out

src/query_smp.c:197; umad (DR path slid 0; dlid 0; 0,1,17 Attr 0xff90:2) bad status 110; Connection timed out

#

# Topology file: generated on Mon Mar  2 17:19:19 2016

#

# Initiated from node f4521403008b9a30 port f4521403008b9a31

  1. Identify the unresponsive node/s:
    1. From the same node where the direct path command invoked, run:

smpquery nd -D <direct_path_without_last_number>

 

Example: for direct path "0,1,18" invoke: "smpquery nd -D 0,1"

 

  1. The unresponsive device is connected to the device outputted in last step by port number as the last number in the direct path

 

Example: for direct path "0,1,18", the unresponsive device will be connected to port 18

 
 

 

Suspect #3: Slow Receivers

  • Nodes that pushes back on data because it can’t process data fast enough
  • A slow node will not give the switch credits to send traffic. The backpressure will spread on to other connected switches by allocating buffer space for delayed traffic

 

Congested links:

  • Indication for a congested link is a link that sends or receive high amount of data (high XmitPacket/RcvPacket) and is also having high rates of XmitWait
  • We can get a clear indication for congestion if: WmitWait / XmitPackets >10

(Ratio between XmitWait and the XmitPacket is bigger than 10)

 

Possible causes for slow receiver:

  • Server resources
    • CPU speed – it is recommended to work with CPU in max performance mode
    • Memory - bad memory dimm or memory section can decrease the server performance. This can only be detected with low-level memory testing utilities
  • PCI connection – degraded Gen (speed) and/or width

 

More information can be found in the Performance Tuning Guide document.

 

  • Ø Detecting slow receivers using PortCounters CSV file

For using this method, the reset counters policy should be reset_every_poll (only data counters will be reset).

 

  1. Extract 2x latest CSV files (by name convention)
  2. Open the 2 files in Excel and format as tables
  3. Copy the XmitWait column from the older file to the new file right next to the XmitWait column in the newer file
  4. Insert new column (NEW_ XmitWait) and calculate the delta between the 2 XmitWait values (we want the number of ticks counted between the 2 files)
  5. In column D (NodeType) select only Switch
  6. In Column AR (PeerPlatform) select only Computer
  7. Insert new column, Congestion Ratio, and add formula of: NEW_ XmitWait/XmitPkts
  8. Sort Congestion Ratio column from largest to smallest
  9. Start from the top on any transmitting port reporting on a ratio greater than 10

 

  • Ø Detecting slow receivers using ibdiagnet2

With this method, manual mapping between GUIDs and hostname is required.

This can be done using the Excel vlookup function and any parsed hostname <-> GUIDs list.

 

  1. Copy the “PM_INFO” data from the f ibdiagnet2.db_csv file to Excel sheet and for a table

 

Example – all other columns are hidden:

 

  1. Calculate the Congestion index = XmitWait / XmitPkt

Using 32/64 bits counters. 64 bit Counters requires additional translation from Hex to Dec


 

 

Example:

 

  1. Complete data & Analyze results

Congestion index: Normalized XmitWait [ticks] = ∆XmitWait  / ∆XmitPackets

  • Avg # of ticks packet waits in Head of Queue

Ports with Congestion index >= 10 should be treated as congested

 

                Example:

               

 

 

Suspect #4: Network issues

  • Ø Routing issue

Routing issues can be investigated by Mellanox support using the following information:

  • ibdiagnet output files 
  • Opensm log
  • Opensm configuration files (/opt/ufm/files/conf/opensm/)
  • ibnetdiscover
  • partitions.conf
  • /opt/ufm/files/log/ opensm-sa.dump
  • Root GUIDs file

 

  • Ø Topology change

Using MSTK:

Missing links or devices can cause degradation in performance.

You can use the /opt/ufm/support/MSTK5.5/Linux/Host-Tools/ib-topology-viewer.sh script on the UFM server for backing up reference topology summary and comparing to any new collected topology summary.

 

  [root@xxx Host-Tools]# ./ib-topology-viewer.sh

 

ib-topology-viewer.sh Version 5.5

 

MF0;xxx:SX6036/U1(0x0002c903004693c1)                                                                          1 HCA ports and 2  switch ports.

SwitchIB Mellanox Technologies(0x7cfe9003009ea930)                                      2  HCA ports and 3  switch ports.

SwitchIB Mellanox Technologies(0x7cfe900300bf8530)                                      1  HCA ports and 1  switch ports.

Using ibnetdiscover:

  1. Cache ibnetdiscover data – this will be the reference data:

ibnetdicover --cache <file>

  1. Compare any new ibnetdiscover to the cached data:

ibnetdiscover --diff <cache_file>

 

Output will contain changed between cached data and new ibnetdiscover output.

 

 

 

BR

Marc

Re: IB Switch IS5035 MTU Setting?

$
0
0

All Mellanox embedded SM's MTU is 2k(2048-4=2044).

Therefore you should change default partition's MTU to 4k(4096-4092).

 

Best Regard,

Jae-Hoon Choi

Re: New iSER Driver installation on ESXi 6.5-U1

$
0
0

You must add iSER adapter on ESXi console, not a software iSCSI adapter.

 

If you execute esxcli rdma iser add command then you will see a iSER adapter.

But if you reboot ESXi host iSER adapter will disappear...

Therefore you should be add a esxcli rdma iser add to /etc/rc.local.d/local.sh

 

I'm also test with my SCST iSER Target, but fail to connect to iSER Target

My Friend also test with LIO Target, StarWind iSER Target, but everytime failed to connect to iSER Target.

My friend use iSER with 1.9.10.5 for ESXi 6.0 driver - that support only ESXi 6.0 - on ESXi 6.0 host successfully.

 

I think this iSER driver 1.0.0.1 for ESXi 6.5 is a beta level addon module...:(

 

Best Regard,

Jae-Hoon Choi

Re: Question about ESXi 6.5 iSER driver with PFC port configuraion.

$
0
0

here are some fine articles that Mellanox published on how to configure PFC on Mellanox switches

For switchX

HowTo Enable PFC on Mellanox Switches (SwitchX)

For Spectrum switch

How to Enable PFC on Mellanox Switches (Spectrum)

 

As for the other issues you raised (iser storage adapter disappeared after every ESXi 6.5 host reboot + iSER initiator failing to connect SCST ) - this I believe requires a whole session of troubleshooting logs & dumps so suggesting that you approach support@mellanox.com, describe the problems in details and get advised by relevant Mellanox support engineers

Re: How to test RDMA traffic congestion

$
0
0

Hi Marc,

 

Thank you for the comprehensive answer, however please note that this is a RoCE(v1) fabric, i.e., there is no IB link-layer, so almost all of the troubleshooting tips you provided do not apply (directly). I would dearly love to see the same sort of guide for RoCE.

 

Cheers,

Blair

Question about ESXi 6.5 iSER PFC direct connection

$
0
0

I have connectx-4 cards currently directly connected to each other. My one host is running a scst storage target and the other is running esxi 6.5 u1. I see the iser driver has been released and i noted the pfc config mentioned. Since there is no switch between the cards, they are direct connect, is pfc needed? If so is there anything i could configure on the nics themselves to meet this requirement?


Re: RoCE not working on Win 2016 (ConnectX-3 Pro)

$
0
0

Firmware version 2.42.500 is now out.

 

I can't see any notes regarding this issue in the release notes, either in the fixes or known bugs....

 

Anyone been able to test?

Re: New iSER Driver installation on ESXi 6.5-U1

$
0
0

Thanks Jae-Hoon !

 

I can now see the iSER adapters.

 

Will try to hook the host to some iSER targets later and post the results here.

 

I hope it can work with the switch ports configured with PFC instead of Global Pause.

solution for design small HPC

$
0
0

we have 35 HP G9 and 40 HP G8 server. Help us for solution to create one or 2 HPCs.

Re: Issues with setting up Storage Spaces Direct

$
0
0

Before the tests that involve I/O system, did you verify that

* TCP/IP connectivity works ( ping?)

* nd_write_bw/nd_read_bw tests are working?

* are you able to run nd_XXXX tests on the same machine? Use two shell windows to run sever in one and client in another?

* what is the failure you are getting when RDMA test fails?

Re: New iSER Driver installation on ESXi 6.5-U1

Viewing all 6226 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>