<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: IFGRP port &amp;quot;imbalance&amp;quot; in ONTAP Hardware</title>
    <link>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136343#M8538</link>
    <description>&lt;P&gt;Thanks - we have a bunch of clients on these particular nodes (database servers, hypervisors, etc) all separated by VLANs but using the same IFGRP.&amp;nbsp;&amp;nbsp;The node1 stats I captured are somewhat of an anomaly - usually e0e and e0g are right in line on that node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm hoping to engage with our network engineering team this week to see what they can see.&amp;nbsp; We're Arista-based on the private storage network, but given that Cisco sued Arista for how much they copied IOS, the commands you provided should be pretty close.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The good thing is that we're not saturating any of our 10GbE pipes right now and our latency from the clients is right in alignment with what we'd expect.&amp;nbsp; Just want to make sure we're not leaving throughput on the table going forward.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks again,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
    <pubDate>Wed, 29 Nov 2017 13:52:14 GMT</pubDate>
    <dc:creator>colsen</dc:creator>
    <dc:date>2017-11-29T13:52:14Z</dc:date>
    <item>
      <title>IFGRP port "imbalance"</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136324#M8534</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So I've been scratching my head on this one and was wondering if anyone else out there has run into this.&amp;nbsp; Anyway, we have an AFF8080 (2 node/HA) each with a two 10GbE-port IFGRP configured LACP:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;node ifgrp distr-func mode up-ports&lt;/EM&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;-------- ----- ---------- -------------- --------&lt;/EM&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;node1&amp;nbsp;a0a ip multimode_lacp e0e,e0g&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;On "node1" we're seeing a good balance of I/O between ports e0e and e0g.&amp;nbsp; Looking at the graphs on OnCommand UM the two ports track eachother pretty well depending on what kind of work is going on.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;On "node2" we're seeing a super-busy e0e port and a mostly idle e0g port.&amp;nbsp; I did a few statit gathers and here's what we're seeing:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;node1&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;e0e&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; recv 313,367,472.46&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; xmit 221,804,526.36&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;e0g&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; recv&amp;nbsp; 31,750,676.82&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; xmit&amp;nbsp; 16,976,874.19&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;node2&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;e0e&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; recv 217,023,661.04&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; xmit 377,199,469.46&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;e0g&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; recv&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 4,764.75&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;&lt;EM&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; xmit&amp;nbsp; 20,303,389.42&lt;/EM&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ignore the "lower" numbers for e0g on "node1" in the list above - it's usually just as busy.&amp;nbsp; However, e0g on "node2" just never seems to get much traffic.&amp;nbsp; Both ports are healthy (i.e. no dropped packets, no retransmits, etc) so it's just like e0g isn't holding up its part of the ifgrp on "node2".&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've got a note into our network team to see if they can see anything wonky with the port/switch/etc.&amp;nbsp; Any ideas the community might have as to where to start would be greatly appreciated.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jun 2025 14:18:21 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136324#M8534</guid>
      <dc:creator>colsen</dc:creator>
      <dc:date>2025-06-04T14:18:21Z</dc:date>
    </item>
    <item>
      <title>Re: IFGRP port "imbalance"</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136325#M8535</link>
      <description>&lt;P&gt;I'm not sure I'd call the data you have balanced but maybe it was just when you grabbed the stats.&amp;nbsp; Do you have lots and lots of clients or just a few?&amp;nbsp; You might want to check the load balancing on the switch side as cisco != netapp load balacing.&amp;nbsp; Maybe it's set incorrectly (as the impalance seems to be inbound to the netapp) or maybe you've just got a couple of clients which happen to be hitting the same port from the switches perspective.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://www.cisco.com/c/en/us/support/docs/lan-switching/etherchannel/12023-4.html" target="_blank"&gt;https://www.cisco.com/c/en/us/support/docs/lan-switching/etherchannel/12023-4.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In that doc there's a command to see what channel data will flow over:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;e.g.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;show channel hash 865 10.10.10.1 10.10.10.2&lt;/P&gt;&lt;P&gt;selected channel port: 1/1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;That might help to narrow things down if you have a few clients or if you have lots and lots then check the load balance method on the switch&lt;/P&gt;</description>
      <pubDate>Tue, 28 Nov 2017 23:33:00 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136325#M8535</guid>
      <dc:creator>michael_england</dc:creator>
      <dc:date>2017-11-28T23:33:00Z</dc:date>
    </item>
    <item>
      <title>Re: IFGRP port "imbalance"</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136343#M8538</link>
      <description>&lt;P&gt;Thanks - we have a bunch of clients on these particular nodes (database servers, hypervisors, etc) all separated by VLANs but using the same IFGRP.&amp;nbsp;&amp;nbsp;The node1 stats I captured are somewhat of an anomaly - usually e0e and e0g are right in line on that node.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm hoping to engage with our network engineering team this week to see what they can see.&amp;nbsp; We're Arista-based on the private storage network, but given that Cisco sued Arista for how much they copied IOS, the commands you provided should be pretty close.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The good thing is that we're not saturating any of our 10GbE pipes right now and our latency from the clients is right in alignment with what we'd expect.&amp;nbsp; Just want to make sure we're not leaving throughput on the table going forward.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks again,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
      <pubDate>Wed, 29 Nov 2017 13:52:14 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/136343#M8538</guid>
      <dc:creator>colsen</dc:creator>
      <dc:date>2017-11-29T13:52:14Z</dc:date>
    </item>
    <item>
      <title>Re: IFGRP port "imbalance"</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/138365#M8641</link>
      <description>&lt;P&gt;Hi Chris,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As the previous poster alluded to, incoming traffic distribution is controlled by the&amp;nbsp;network switch's&amp;nbsp;load balancing policy, whereas outgoing traffic from a NetApp node is controlled by the&amp;nbsp;NetApp's interface group&amp;nbsp;load balancing policy.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Normally these types of imbalances are caused by&amp;nbsp;unfairness in how the load balancing algorithm is calculated based on&amp;nbsp;too many common&amp;nbsp;third or fourth octets of the incoming IP addresses.&amp;nbsp; Since you mentioned there are multiple VLANs being served up from these port channel groups, that is less likely.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Moving on;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I would validate both the health of the Arista port channel group and what load balancing policy is enabled - mac / ip / port.&amp;nbsp; &amp;nbsp;It is ideal but not critical for the NetApp's load balancing policy to match what is set on the switch.&amp;nbsp; What is really critical is for the load balancing mode to match - dynamic/lacp vs static/multi.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;You can give a heads up to your switch team by providing the exact interface the NetApp is connected to on the switch by using the command &lt;STRONG&gt;network device-discovery,&lt;/STRONG&gt; it is really awesome for stuff like this.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After you've validated&amp;nbsp;the network looks clean, one thing you may want to try is to&amp;nbsp;eject&amp;nbsp;the&amp;nbsp;quiet port&amp;nbsp;from&amp;nbsp;the&amp;nbsp;port channel group&amp;nbsp; from the NetApp side or the switch side and see if re-adding it back to the LACP will cause it to "wake" up and start receiving more traffic.&amp;nbsp;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Good luck!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hadrian&lt;/P&gt;</description>
      <pubDate>Thu, 22 Feb 2018 20:07:31 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/IFGRP-port-quot-imbalance-quot/m-p/138365#M8641</guid>
      <dc:creator>hadrian</dc:creator>
      <dc:date>2018-02-22T20:07:31Z</dc:date>
    </item>
  </channel>
</rss>

