<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Latency issue with Windows Failover Cluster role failover and FAS2240-2/Data ONTAP 8.1.4 7-Mode in ONTAP Discussions</title>
    <link>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/131782#M28709</link>
    <description>&lt;P&gt;Greetings,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've found an issue involving our specific filer model and ONTAP version (FAS2240-2/ONTAP 8.1.4 7&lt;BR /&gt;-Mode) with a new implementation that we're testing, and I'm hoping that someone could provide&lt;BR /&gt;some thoughts. When using LUNs created on this filer in this implementation, manually failing over&lt;BR /&gt;a file server role in a two-node Server 2016 Windows Failover Cluster using in-guest iSCSI&lt;BR /&gt;consistently takes around 12 minutes for the failover between WFC nodes to complete. During the&lt;BR /&gt;failover between these WFC nodes, a LUN reset request is sent by the MS iSCSI initiator to our&lt;BR /&gt;filer, and the connection to the disk is reestablished within the Windows environment.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tested the same configuration on an old filer/ONTAP version (FAS2020/ONTAP 7.3.5.1) and we&lt;BR /&gt;do not experience the 12 minute failover time. The failover of the file server role happens within&lt;BR /&gt;seconds, as expected. The only part of the configuration that changes to reproduce the long&lt;BR /&gt;failover time is which filer the Windows source and destination disks are hosted on.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The implementation is a newer Microsoft block-level replication technology called Storage Replica.&lt;BR /&gt;Our configuration involves two Windows Server 2016 DCE nodes in a Windows Failover Cluster, with&lt;BR /&gt;each node using the in-guest MS iSCSI initiator and SnapDrive 7.1.4 x64. Each node is connected to&lt;BR /&gt;one separate LUN for data (2TB) and one separate LUN for logging (25GB), making four LUNs total,&lt;BR /&gt;each thin-provisioned with SnapDrive. The four disks are then added to the Windows Failover&lt;BR /&gt;Cluster and a File Server role is created using one of the 2TB disks as the source disk.&lt;BR /&gt;Replication is then successfully enabled between the identically-sized disks using the Storage&lt;BR /&gt;Replica wizard, to create a source and destination for replication. The role is supposed to&lt;BR /&gt;failover to the other node (destination) within seconds, but this operation takes around 12&lt;BR /&gt;minutes on our specific filer and ONTAP version. As stated previously, the long failover does not&lt;BR /&gt;happen on an older filer, with an older ONTAP version.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have a total of four FAS2240-2 filers, and each pair are in a HA configuration and reside at&lt;BR /&gt;different physical sites. I have tested hosting the storage in this configuration across the&lt;BR /&gt;physical sites and have also isolated the configuration to each individual site, and consistenly&lt;BR /&gt;achieve the same long failover time of the file server role with the FAS2240-2/ONTAP 8.1.4 7-Mode&lt;BR /&gt;filers. The older filer is a FAS2020 pair in a HA configuration, running ONTAP 7.3.5.1. The long&lt;BR /&gt;failover time does not happen when hosting the storage in this configuration on the older filer.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since we are currently on 8.1.4 7-mode, we are unable to get support due to the version falling&lt;BR /&gt;under EOVS. We intend to move to a newer version when possible to open a support case. However in&lt;BR /&gt;the meantime, we've been scratching our heads on this one and are hoping to see if anyone on the&lt;BR /&gt;NetApp forums have any ideas/thoughts. I would be happy to answer any additional questions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
    <pubDate>Wed, 04 Jun 2025 14:58:16 GMT</pubDate>
    <dc:creator>Wonkins</dc:creator>
    <dc:date>2025-06-04T14:58:16Z</dc:date>
    <item>
      <title>Latency issue with Windows Failover Cluster role failover and FAS2240-2/Data ONTAP 8.1.4 7-Mode</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/131782#M28709</link>
      <description>&lt;P&gt;Greetings,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've found an issue involving our specific filer model and ONTAP version (FAS2240-2/ONTAP 8.1.4 7&lt;BR /&gt;-Mode) with a new implementation that we're testing, and I'm hoping that someone could provide&lt;BR /&gt;some thoughts. When using LUNs created on this filer in this implementation, manually failing over&lt;BR /&gt;a file server role in a two-node Server 2016 Windows Failover Cluster using in-guest iSCSI&lt;BR /&gt;consistently takes around 12 minutes for the failover between WFC nodes to complete. During the&lt;BR /&gt;failover between these WFC nodes, a LUN reset request is sent by the MS iSCSI initiator to our&lt;BR /&gt;filer, and the connection to the disk is reestablished within the Windows environment.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have tested the same configuration on an old filer/ONTAP version (FAS2020/ONTAP 7.3.5.1) and we&lt;BR /&gt;do not experience the 12 minute failover time. The failover of the file server role happens within&lt;BR /&gt;seconds, as expected. The only part of the configuration that changes to reproduce the long&lt;BR /&gt;failover time is which filer the Windows source and destination disks are hosted on.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The implementation is a newer Microsoft block-level replication technology called Storage Replica.&lt;BR /&gt;Our configuration involves two Windows Server 2016 DCE nodes in a Windows Failover Cluster, with&lt;BR /&gt;each node using the in-guest MS iSCSI initiator and SnapDrive 7.1.4 x64. Each node is connected to&lt;BR /&gt;one separate LUN for data (2TB) and one separate LUN for logging (25GB), making four LUNs total,&lt;BR /&gt;each thin-provisioned with SnapDrive. The four disks are then added to the Windows Failover&lt;BR /&gt;Cluster and a File Server role is created using one of the 2TB disks as the source disk.&lt;BR /&gt;Replication is then successfully enabled between the identically-sized disks using the Storage&lt;BR /&gt;Replica wizard, to create a source and destination for replication. The role is supposed to&lt;BR /&gt;failover to the other node (destination) within seconds, but this operation takes around 12&lt;BR /&gt;minutes on our specific filer and ONTAP version. As stated previously, the long failover does not&lt;BR /&gt;happen on an older filer, with an older ONTAP version.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have a total of four FAS2240-2 filers, and each pair are in a HA configuration and reside at&lt;BR /&gt;different physical sites. I have tested hosting the storage in this configuration across the&lt;BR /&gt;physical sites and have also isolated the configuration to each individual site, and consistenly&lt;BR /&gt;achieve the same long failover time of the file server role with the FAS2240-2/ONTAP 8.1.4 7-Mode&lt;BR /&gt;filers. The older filer is a FAS2020 pair in a HA configuration, running ONTAP 7.3.5.1. The long&lt;BR /&gt;failover time does not happen when hosting the storage in this configuration on the older filer.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Since we are currently on 8.1.4 7-mode, we are unable to get support due to the version falling&lt;BR /&gt;under EOVS. We intend to move to a newer version when possible to open a support case. However in&lt;BR /&gt;the meantime, we've been scratching our heads on this one and are hoping to see if anyone on the&lt;BR /&gt;NetApp forums have any ideas/thoughts. I would be happy to answer any additional questions.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jun 2025 14:58:16 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/131782#M28709</guid>
      <dc:creator>Wonkins</dc:creator>
      <dc:date>2025-06-04T14:58:16Z</dc:date>
    </item>
    <item>
      <title>Re: Latency issue with Windows Failover Cluster role failover and FAS2240-2/Data ONTAP 8.1.4 7-Mode</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132105#M28789</link>
      <description>&lt;P&gt;not sure if it's related about this burt:&lt;/P&gt;&lt;P&gt;&lt;A href="http://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&amp;amp;Display=605236" target="_blank"&gt;http://mysupport.netapp.com/NOW/cgi-bin/bol?Type=Detail&amp;amp;Display=605236&lt;/A&gt;&lt;/P&gt;&lt;P&gt;8.1.4p1 is fixed... try to find some patch version of 8.1.4?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;hopefully helps&lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Jeff&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2017 10:44:03 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132105#M28789</guid>
      <dc:creator>Jeff_Yao</dc:creator>
      <dc:date>2017-06-21T10:44:03Z</dc:date>
    </item>
    <item>
      <title>Re: Latency issue with Windows Failover Cluster role failover and FAS2240-2/Data ONTAP 8.1.4 7-Mode</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132125#M28791</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/9673"&gt;@Jeff_Yao&lt;/a&gt;i think &lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/60430"&gt;@Wonkins&lt;/a&gt; refer to windows failover rather the filer failover.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/60430"&gt;@Wonkins&lt;/a&gt;, you mentioned that you see LUN restes. is it in EMS? can you maybe share a packettrace (pktt) from the filer side while you failover?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Gidi&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Jun 2017 14:18:24 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132125#M28791</guid>
      <dc:creator>GidonMarcus</dc:creator>
      <dc:date>2017-06-21T14:18:24Z</dc:date>
    </item>
    <item>
      <title>Re: Latency issue with Windows Failover Cluster role failover and FAS2240-2/Data ONTAP 8.1.4 7-Mode</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132179#M28805</link>
      <description>&lt;P&gt;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/9673"&gt;@Jeff_Yao&lt;/a&gt;,&amp;nbsp;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/38137"&gt;@GidonMarcus&lt;/a&gt;&amp;nbsp;is correct. I am referring to failover of the Windows Failover Cluster file server role as having the issue described in my post.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/38137"&gt;@GidonMarcus&lt;/a&gt;, I noticed the LUN reset notice in the Syslog on the filer with the destination disk of the failover. I did check EMS and found the same message that I saw in the Syslog. The specific message in EMS is:&amp;nbsp;&lt;BR /&gt;&amp;lt;iscsi_notice_1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;m="Initiator (iqn.1991-05.com.microsoft:server) sent LUN Reset request, aborting all SCSI commands on lun X"/&amp;gt;.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Jun 2017 19:55:27 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Latency-issue-with-Windows-Failover-Cluster-role-failover-and-FAS2240-2-Data/m-p/132179#M28805</guid>
      <dc:creator>Wonkins</dc:creator>
      <dc:date>2017-06-22T19:55:27Z</dc:date>
    </item>
  </channel>
</rss>

