<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Partial Writes with VMWare ESX hosts in VMware Solutions Discussions</title>
    <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14353#M1492</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here's our TR on how to align filesystems&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://media.netapp.com/documents/tr-3747.pdf" target="_blank"&gt;http://media.netapp.com/documents/tr-3747.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Amrita&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 29 Apr 2010 09:24:50 GMT</pubDate>
    <dc:creator>amritad</dc:creator>
    <dc:date>2010-04-29T09:24:50Z</dc:date>
    <item>
      <title>Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14320#M1479</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;!--[if gte mso 10]&gt;
&lt;style&gt;
 /* Style Definitions */
 table.MsoNormalTable
 {mso-style-name:"Table Normal";
 mso-tstyle-rowband-size:0;
 mso-tstyle-colband-size:0;
 mso-style-noshow:yes;
 mso-style-priority:99;
 mso-style-qformat:yes;
 mso-style-parent:"";
 mso-padding-alt:0cm 5.4pt 0cm 5.4pt;
 mso-para-margin-top:0cm;
 mso-para-margin-right:0cm;
 mso-para-margin-bottom:10.0pt;
 mso-para-margin-left:0cm;
 line-height:115%;
 mso-pagination:widow-orphan;
 font-size:11.0pt;
 font-family:"Calibri","sans-serif";
 mso-ascii-font-family:Calibri;
 mso-ascii-theme-font:minor-latin;
 mso-fareast-font-family:"Times New Roman";
 mso-fareast-theme-font:minor-fareast;
 mso-hansi-font-family:Calibri;
 mso-hansi-theme-font:minor-latin;}
&lt;/style&gt;
&lt;![endif]--&gt;&lt;/P&gt;&lt;P class="MsoNormal"&gt;We are experiencing performance problems in our environment and it points to partial writes. We are seeing back-to-back CPs that is causing a spike in latency across all volumes on a filer to above 500ms.&lt;/P&gt;&lt;P class="MsoNormal"&gt;We have contacted NetApp support and they have said yes it is partial writes and it is probably caused by ESX. The filer&lt;SPAN&gt;&amp;nbsp; &lt;/SPAN&gt;is almost dedicated to ESX so it has to be ESX, we know all our VMs are unaligned but short of aligning 1000’s of VMs we want to target a few that are causing the most havoc.&lt;/P&gt;&lt;P class="MsoNormal"&gt;How can we narrow it down to a VM level accurately, which ones are causing us the most pain?&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Cheers.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 05 Jun 2025 07:18:30 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14320#M1479</guid>
      <dc:creator>braidenjudd</dc:creator>
      <dc:date>2025-06-05T07:18:30Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14326#M1481</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;You might want to take a look at vscsistats and/or the nfstop tool to get an idea of which VMs are creating the highest workload.&amp;nbsp; Check out &lt;A href="http://www.yellow-bricks.com/2009/12/17/vscsistats-output-in-esxtop-format/" target="_blank"&gt;this post on Yellow-Bricks&lt;/A&gt;.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 18 Feb 2010 14:29:20 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14326#M1481</guid>
      <dc:creator>forgette</dc:creator>
      <dc:date>2010-02-18T14:29:20Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14331#M1483</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;with which FAS you are working with?&lt;/P&gt;&lt;P&gt;how many hosts do you have?&lt;/P&gt;&lt;P&gt;how many vms do you have?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I did a short look over your sysstat output and read the following:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- row 2 needs to be shifted 4 columns to the right&lt;/P&gt;&lt;P&gt;- very high CPU load (more than 95%)&lt;/P&gt;&lt;P&gt;- NFS only&lt;/P&gt;&lt;P&gt;- very high Net in and out (in: 701142 KB/s, out: 484975 KB/s)&lt;/P&gt;&lt;P&gt;- very high Disk read and write (read: 991328 KB/s, write: 886479 KB/s)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;are you sure that these numbers are correct?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Supposed that's correct, i think your netapp is overloaded or undersized&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Erich&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 18 Feb 2010 16:03:33 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14331#M1483</guid>
      <dc:creator>vexperts</dc:creator>
      <dc:date>2010-02-18T16:03:33Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14336#M1485</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Are you hosting the VMs on LUNs? Over NFS?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you're curious about which ones are causing the most pain, collecting a perfstat and working with NetApp Support will be your best bet.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 18 Feb 2010 18:06:01 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14336#M1485</guid>
      <dc:creator>parisi</dc:creator>
      <dc:date>2010-02-18T18:06:01Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14341#M1487</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;FAS6080&lt;/P&gt;&lt;P&gt;We have 42 ESX Hosts&lt;/P&gt;&lt;P&gt;We have approx 1500 Hosts&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I think the filer is overloaded yes, but i beleive it is because of the number of partial writes that are occuring. If all VMs (especially the highest IO ones) were aligned I would think the filer could handle the given workload quite comfortably.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We are utilising NFS datastores.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have engaged NetApp support but i am asking here to try and get some information from people that may have experianced VM alignment problems before.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another Note:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have written a script to poll the filer every 15mins and get pw.over_limit stat from wafl_susp -w. I have found at times this number grows by 3000 counts /s. See attached graph over_limt. These large spikes correspond to when we see massive latency jumps on our filers (4am everyday). We are still trying to work out what happens at this time to cause this massive IO spike (and subsequent latency spike), but i still beleive the root cause is unaligned VMs. Any comments appreciated.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 22 Feb 2010 02:33:42 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14341#M1487</guid>
      <dc:creator>braidenjudd</dc:creator>
      <dc:date>2010-02-22T02:33:42Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14347#M1490</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;We have the exact same problem on our IBM Nseries (rebranded Netapp)&lt;/P&gt;&lt;P&gt;All our ESX hosts are using FC and allmost all of our 1000+ virtual servers are unaligned.. We have approx. 60 mill pw.over_limit every 24 hours &lt;SPAN __jive_emoticon_name="sad" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/4.0.6/images/emoticons/sad.gif"&gt;&lt;/SPAN&gt; and our latency is going from a few ms to more than a second if someone is doing excessive writes.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Recently we started aligning the virtual servers using software from VisionCore, -but it's a very time consuming process and we expect to use the next 6-12 months aligning.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We qualified the top writers (LUNS) using Operations manager and our ESX guru found the virtual servers using the most busy luns.&lt;/P&gt;&lt;P&gt;We are about 10% done, but haven't seen any major improvements yet - but we're still optimistic..!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN __jive_emoticon_name="grin" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/4.0.6/images/emoticons/grin.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;/Henrik&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 27 Apr 2010 13:29:09 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14347#M1490</guid>
      <dc:creator>heg</dc:creator>
      <dc:date>2010-04-27T13:29:09Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14353#M1492</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here's our TR on how to align filesystems&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="http://media.netapp.com/documents/tr-3747.pdf" target="_blank"&gt;http://media.netapp.com/documents/tr-3747.pdf&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Hope this helps.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Regards&lt;/P&gt;&lt;P&gt;Amrita&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 29 Apr 2010 09:24:50 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14353#M1492</guid>
      <dc:creator>amritad</dc:creator>
      <dc:date>2010-04-29T09:24:50Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14357#M1494</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi, this thread shows up on the top of "Netapp Latency Spikes" searches.&lt;/P&gt;&lt;P&gt;We have a 3040 cluster hosting 11 vSphere hosts with 200 VMs on NFS datastores.&lt;/P&gt;&lt;P&gt;We see latency spikes 3-4 times a month as reported by Operations Manager.&lt;/P&gt;&lt;P&gt;We hoped our upgrade from 7.3.1.1 last week to 7.3.3 would help, but we had another spike up to 1 second take out a NFS mount and all several of the VMs on Saturday.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We previously&amp;nbsp; determined the High &amp;amp; medium IO VMs and either aligned them or migrated them to local disk - has NOT helped - still getting the spikes.&lt;/P&gt;&lt;P&gt;I have another case opened with Netapp.&lt;/P&gt;&lt;P&gt;Following the notes in this thread, I ran the wafl_susp -w to check the pw.over_limit&lt;/P&gt;&lt;P&gt;Turns out ours is ZERO (is it relevant to NFS?)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I suspect an internal Netapp process is responsible for these (dedup?) - we had it disabled on 7.3.1.1 - 7.3.3 was supposed to fix this (we re-enabled de-dup after the upgrade)&lt;/P&gt;&lt;P&gt;And the latency spike outages are back &lt;SPAN __jive_emoticon_name="sad" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/4.0.6/images/emoticons/sad.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Will share any info from the case&lt;/P&gt;&lt;P&gt;thanks for any tips,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Fletcher.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 03 May 2010 23:13:06 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14357#M1494</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-05-03T23:13:06Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14362#M1496</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We have seen excessive responsetimes, when the system did aggregate snapshots.. Try comparing the aggr snap schedule to your response time problems..&lt;/P&gt;&lt;P&gt;Our aggr snap problem might be related to the misalignment.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;"snap sched -A"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;brgds&lt;/P&gt;&lt;P&gt;/henrik&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 10 May 2010 10:22:32 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14362#M1496</guid>
      <dc:creator>heg</dc:creator>
      <dc:date>2010-05-10T10:22:32Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14368#M1498</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Henrik, yes we disabled AGGR snapshots over a year ago&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Still searching for an explanation for the spikes&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Sat, 15 May 2010 05:22:38 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14368#M1498</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-05-15T05:22:38Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14372#M1500</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Basic question: Do you have at least 20% free space in all aggregates?&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jun 2010 08:09:53 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14372#M1500</guid>
      <dc:creator>heg</dc:creator>
      <dc:date>2010-06-10T08:09:53Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14377#M1503</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;No, we have 90% allocation in most (with aggregate snapshots disabled)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jun 2010 13:08:49 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14377#M1503</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-06-10T13:08:49Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14381#M1505</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;No doubt it's a problem for performance&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 10 Jun 2010 13:20:33 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14381#M1505</guid>
      <dc:creator>heg</dc:creator>
      <dc:date>2010-06-10T13:20:33Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14386#M1507</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi, we are experiencing HUGE 1,000,000+ microseconds (1 second+) latency spikes on ONTAP 7.3.3 NFS volumes as reported by NetApp Management Console which is disabling VMware virtual machines (Windows SQL server needs to be rebooted, Linux VMs go into read only mode and need reboots etc)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;We have a case open with Netapp (2001447643) and the latest analysis of perstat and archive stats from the spikes is being presented to us:&lt;/P&gt;&lt;P&gt;"Version:1.0 StartHTML:0000000149 EndHTML:0000003705 StartFragment:0000000199 EndFragment:0000003671 StartSelection:0000000199 EndSelection:0000003671&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/P&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;This data is definitely good.&amp;nbsp; We are seeing the latency.&lt;BR /&gt; Here is what I am seeing on the filer side:&lt;BR /&gt; &lt;/SPAN&gt;&lt;SPAN style="font-size: 10pt; font-family: Courier New; "&gt;Server rpc:&lt;BR /&gt; TCP:&lt;BR /&gt; calls&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; badcalls&amp;nbsp;&amp;nbsp;&amp;nbsp; nullrecv&amp;nbsp;&amp;nbsp; badlen&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; xdrcall&lt;BR /&gt; 232298&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 4294959566&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 4294959566&lt;BR /&gt; &lt;/SPAN&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt; &lt;BR /&gt; The NetApp filer is getting a huge number of bad XDR calls, indicating that the filer is unable to read the NFS headers.&lt;BR /&gt;&amp;nbsp; &lt;BR /&gt; We cannot determine at this time what the source of these bad calls is.&lt;BR /&gt; Some of the worst offending volumes during this period, regarding latency appear to be:&lt;BR /&gt;&amp;nbsp; &lt;BR /&gt; Vm64net&lt;BR /&gt; Vm65net&lt;BR /&gt; Vw65net2&lt;BR /&gt; Vm65net3&lt;BR /&gt; Ora64net02&lt;BR /&gt;&amp;nbsp; &lt;BR /&gt; &lt;/SPAN&gt;&lt;SPAN style="font-family: Calibri,Verdana,Helvetica,Arial;"&gt; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Time&amp;nbsp;&amp;nbsp; Time Delta&amp;nbsp;&amp;nbsp; Volume&amp;nbsp;&amp;nbsp; Parent Aggr&amp;nbsp;&amp;nbsp; Total Op/s&amp;nbsp;&amp;nbsp; Avg Lat (µs)&amp;nbsp;&amp;nbsp; Read Op/s&amp;nbsp;&amp;nbsp; Read Data (B/s)&amp;nbsp;&amp;nbsp; Read Lat (µs)&amp;nbsp;&amp;nbsp; Write Op/s&amp;nbsp;&amp;nbsp; Write Data (B/s)&amp;nbsp;&amp;nbsp; Write Lat (µs)&amp;nbsp;&amp;nbsp; Other Op/s&amp;nbsp;&amp;nbsp; Other Lat (µs)&amp;nbsp;&amp;nbsp; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Tue Jun 15 17:45:46 UTC 2010&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; vm64net&amp;nbsp;&amp;nbsp; aggr1&amp;nbsp;&amp;nbsp; 311.00&amp;nbsp;&amp;nbsp; 6,981,691.95&amp;nbsp;&amp;nbsp; 35.00&amp;nbsp;&amp;nbsp; 336,402.00&amp;nbsp;&amp;nbsp; 1,336,540.48&amp;nbsp;&amp;nbsp; 267.00&amp;nbsp;&amp;nbsp; 1,561,500.00&amp;nbsp;&amp;nbsp; 7,953,614.68&amp;nbsp;&amp;nbsp; 8.00&amp;nbsp;&amp;nbsp; 4.29&amp;nbsp;&amp;nbsp; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Tue Jun 15 17:45:46 UTC 2010&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; vm65net&amp;nbsp;&amp;nbsp; aggr1&amp;nbsp;&amp;nbsp; 115.00&amp;nbsp;&amp;nbsp; 6,283,673.41&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; 2,453.00&amp;nbsp;&amp;nbsp; 38,863.33&amp;nbsp;&amp;nbsp; 107.00&amp;nbsp;&amp;nbsp; 1,441,475.00&amp;nbsp;&amp;nbsp; 6,714,803.11&amp;nbsp;&amp;nbsp; 6.00&amp;nbsp;&amp;nbsp; 12.41&amp;nbsp;&amp;nbsp; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Tue Jun 15 17:45:46 UTC 2010&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; vm65net3&amp;nbsp;&amp;nbsp; aggr1&amp;nbsp;&amp;nbsp; 292.00&amp;nbsp;&amp;nbsp; 3,481,462.35&amp;nbsp;&amp;nbsp; 14.00&amp;nbsp;&amp;nbsp; 110,824.00&amp;nbsp;&amp;nbsp; 1,390,729.32&amp;nbsp;&amp;nbsp; 263.00&amp;nbsp;&amp;nbsp; 1,582,710.00&amp;nbsp;&amp;nbsp; 3,780,725.12&amp;nbsp;&amp;nbsp; 14.00&amp;nbsp;&amp;nbsp; 6.82&amp;nbsp;&amp;nbsp; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Tue Jun 15 17:45:46 UTC 2010&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; ora64net02&amp;nbsp;&amp;nbsp; aggr1&amp;nbsp;&amp;nbsp; 17.00&amp;nbsp;&amp;nbsp; 3,280,731.47&amp;nbsp;&amp;nbsp; 5.00&amp;nbsp;&amp;nbsp; 92,421.00&amp;nbsp;&amp;nbsp; 4,776.50&amp;nbsp;&amp;nbsp; 7.00&amp;nbsp;&amp;nbsp; 24,536.00&amp;nbsp;&amp;nbsp; 7,710,536.08&amp;nbsp;&amp;nbsp; 4.00&amp;nbsp;&amp;nbsp; 2.77&amp;nbsp;&amp;nbsp; &lt;BR /&gt;&amp;nbsp;&amp;nbsp; Tue Jun 15 17:45:46 UTC 2010&amp;nbsp;&amp;nbsp; 0.00&amp;nbsp;&amp;nbsp; vm65net2&amp;nbsp;&amp;nbsp; aggr1&amp;nbsp;&amp;nbsp; 315.00&amp;nbsp;&amp;nbsp; 2,838,381.82&amp;nbsp;&amp;nbsp; 11.00&amp;nbsp;&amp;nbsp; 56,383.00&amp;nbsp;&amp;nbsp; 22,902.19&amp;nbsp;&amp;nbsp; 287.00&amp;nbsp;&amp;nbsp; 1,805,548.00&amp;nbsp;&amp;nbsp; 3,105,157.06&amp;nbsp;&amp;nbsp; 15.00&amp;nbsp;&amp;nbsp; 21.81&amp;nbsp; &lt;BR /&gt; &lt;SPAN style="color: #1f497d;"&gt; &lt;BR /&gt; Out best bet to track down the source of the bad calls would be to capture a packet trace from the filer when this issue is occurring."&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;My &lt;SPAN style="font-family: Calibri,Verdana,Helvetica,Arial;"&gt;points I’d like to clarify:&lt;BR /&gt; &lt;/SPAN&gt;&lt;OL&gt;&lt;LI&gt;&lt;SPAN style="font-family: Calibri,Verdana,Helvetica,Arial;"&gt;What is a bad XDR call and why are they relevant to the latency spike? &lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="font-family: Calibri,Verdana,Helvetica,Arial;"&gt;“&lt;SPAN style="color: #1e487c;"&gt;indicating that the filer is unable to read the NFS headers” - need you to clarify and expand &lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="color: #1e487c; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;We saw another smaller spike around 2:30am today: &lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN style="color: #1e487c; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;&lt;IMG src="cid:3359611684_123408225" /&gt;&lt;/SPAN&gt;&lt;SPAN style="color: #1e487c; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;These are all volumes on aggregate aggr1 (10K RPM disks)&amp;nbsp; is this an overloaded (IOPS-wise) AGGR issue?&lt;/SPAN&gt;&lt;/LI&gt;&lt;/OL&gt; &lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt; We can't currently predict when these spikes in latency occur - they are random - so getting a packet capture of a random event does not seem feasible...&lt;/SPAN&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;Any insight is welcome - we are in major pain with this for months now&lt;/SPAN&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;thanks&lt;BR /&gt;&lt;/SPAN&gt;&lt;/BLOCKQUOTE&gt;&lt;BLOCKQUOTE class="jive-quote"&gt;&lt;SPAN style="color: #1f497d; font-family: Calibri,Verdana,Helvetica,Arial; "&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/BLOCKQUOTE&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 17 Jun 2010 16:31:01 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14386#M1507</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-06-17T16:31:01Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14392#M1509</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;After I took apart the 6800+ IOPS on the problem aggregate the issue turned out to be we were hitting physical limitations of the 10K RPM disks.&lt;/P&gt;&lt;P&gt;Further analysis (surprisingly) revealed about 50% of these IOPS were snapmirror related.&lt;/P&gt;&lt;P&gt;We rescheduled the snapmirrors to reduce this and have said goodbye to the latency spikes.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If interested in the details, please see:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A class="jive-link-external-small" href="http://www.vmadmin.info/2010/07/vmware-and-netapp-deconstructing.html" target="_blank"&gt;http://www.vmadmin.info/2010/07/vmware-and-netapp-deconstructing.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I want to thank Netapp support - especially Errol Fouquet for his expertise pulling apart this problem and help isolating the cause.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Fletcher.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 15 Jul 2010 17:29:02 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14392#M1509</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-07-15T17:29:02Z</dc:date>
    </item>
    <item>
      <title>Re: Partial Writes with VMWare ESX hosts</title>
      <link>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14398#M1511</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;A quick followup outlining how we currently quantify the misalignment issue:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A class="jive-link-external-small" href="http://www.vmadmin.info/2010/07/quantifying-vmdk-misalignment.html" target="_blank"&gt;http://www.vmadmin.info/2010/07/quantifying-vmdk-misalignment.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Cheers&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 16 Jul 2010 21:26:22 GMT</pubDate>
      <guid>https://community.netapp.com/t5/VMware-Solutions-Discussions/Partial-Writes-with-VMWare-ESX-hosts/m-p/14398#M1511</guid>
      <dc:creator>fletch2007</dc:creator>
      <dc:date>2010-07-16T21:26:22Z</dc:date>
    </item>
  </channel>
</rss>

