<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Node resources over-utilized in Active IQ Unified Manager Discussions</title>
    <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130749#M23663</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The first place to look is at your "scheduled tasks" (i.e. snapmirrors, scheduled snaps, dedupes, etc). &amp;nbsp;See if any of those events (or multiple events) are occurring when you're seeing your spikes. &amp;nbsp;We had a few scenarios where our dedupes were kicking off on top of our snap-then-mirror schedules which was throwing the spindles and cores into the "over-utilized" zone.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If nothing like that jumps out, you'll need to see if there's an obvious pattern you can find with the over-utilized spikes. &amp;nbsp;If convenient (i.e. not 2AM) jump on the console and run a &lt;EM&gt;statit&lt;/EM&gt;&amp;nbsp;at the node level during the period the spike would normally show up. &amp;nbsp;If you have the time, run a&amp;nbsp;&lt;EM&gt;systat -M&lt;/EM&gt; to see what your cores are doing during this time.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If none of that turns up the smoking gun and/or it's just a pain to do that kind of gather due to time/inconsistency, open up a case and look at running a perfstat. &amp;nbsp;Here's a very useful link I got from Daniel Savino for doing a long-running perfstat:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://kb.netapp.com/support/s/article/ka31A00000012gpQAA/how-to-collect-performance-statistics-for-intermittent-issues?language=en_US" target="_blank"&gt;https://kb.netapp.com/support/s/article/ka31A00000012gpQAA/how-to-collect-performance-statistics-for-intermittent-issues?language=en_US&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The GUI tool is pretty nice - much easier for a Windoze guy like me to figure out. &amp;nbsp;The TSE you work with might also be able to get some good information from the ASUP performance gather (if you're generating full/HTTPS ASUPs).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Finally (and maybe even first) look at your main workloads and see if your latency is straying out of what you'd consider to be acceptable ranges. &amp;nbsp;We have a couple of our nodes that regularly report a few periods of over-utilization, but those periods tend to be after-hours (our snapmirror times) but our user I/O stays in &amp;lt;5ms latency range even at those times, so we've honestly stopped worrying about it. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope that helps,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
    <pubDate>Wed, 03 May 2017 14:35:31 GMT</pubDate>
    <dc:creator>colsen</dc:creator>
    <dc:date>2017-05-03T14:35:31Z</dc:date>
    <item>
      <title>Node resources over-utilized</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130747#M23662</link>
      <description>&lt;P&gt;I have been getting the following message off and on for about the last two weeks.&amp;nbsp; When I have looked at the ”OnCommand System Manager“ everything looked green.&amp;nbsp; Today I looked in to the charts more and could see where the utilization exceeded 100%.&amp;nbsp; I asked my local VAR and he had said I was probably over committing my deduped volumes and if I needed to rehydrate a “VMDK” I would not have enough room.&amp;nbsp; My cluster is connected to and used as the Data Store for a cluster of five (5) VMware hosts, all the guests are using thin provisioning.&amp;nbsp; My question is how do I determine which resource is being over utilized? &amp;nbsp;&amp;nbsp;I have plenty of unused space so I can increase one of the volumes if it needs more space.&amp;nbsp; If it is a different resource how do I identify which resource is being over utilized?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Warning System-defined Threshold Event&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Summary&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Trigger Time&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 12:29 am, 3 May CDT&lt;/P&gt;&lt;P&gt;Description&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1 new warning system-defined threshold(s) breached on Cluster s00nacluster1&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://192.168.16.154/27c3f2f8-959b-44d7-81b9-511233684c13/#/events/784" target="_blank"&gt;p-sdt-s00nacluster1-nod-784&lt;/A&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Policy Name: Node resources over-utilized&lt;/P&gt;&lt;P&gt;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Perf. Capacity Used value of 159% on s00nacluster1-01 has triggered a WARNING event based on threshold setting of 100%.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="Node Utilization Chart" style="width: 999px;"&gt;&lt;img src="https://community.netapp.com/t5/image/serverpage/image-id/7174i5A27228DA672D6A8/image-size/large?v=v2&amp;amp;px=999" role="button" title="Node Utilization Chart" alt="Node Utilization Chart" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jun 2025 15:07:39 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130747#M23662</guid>
      <dc:creator>bghanson</dc:creator>
      <dc:date>2025-06-04T15:07:39Z</dc:date>
    </item>
    <item>
      <title>Re: Node resources over-utilized</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130749#M23663</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The first place to look is at your "scheduled tasks" (i.e. snapmirrors, scheduled snaps, dedupes, etc). &amp;nbsp;See if any of those events (or multiple events) are occurring when you're seeing your spikes. &amp;nbsp;We had a few scenarios where our dedupes were kicking off on top of our snap-then-mirror schedules which was throwing the spindles and cores into the "over-utilized" zone.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If nothing like that jumps out, you'll need to see if there's an obvious pattern you can find with the over-utilized spikes. &amp;nbsp;If convenient (i.e. not 2AM) jump on the console and run a &lt;EM&gt;statit&lt;/EM&gt;&amp;nbsp;at the node level during the period the spike would normally show up. &amp;nbsp;If you have the time, run a&amp;nbsp;&lt;EM&gt;systat -M&lt;/EM&gt; to see what your cores are doing during this time.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;If none of that turns up the smoking gun and/or it's just a pain to do that kind of gather due to time/inconsistency, open up a case and look at running a perfstat. &amp;nbsp;Here's a very useful link I got from Daniel Savino for doing a long-running perfstat:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://kb.netapp.com/support/s/article/ka31A00000012gpQAA/how-to-collect-performance-statistics-for-intermittent-issues?language=en_US" target="_blank"&gt;https://kb.netapp.com/support/s/article/ka31A00000012gpQAA/how-to-collect-performance-statistics-for-intermittent-issues?language=en_US&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The GUI tool is pretty nice - much easier for a Windoze guy like me to figure out. &amp;nbsp;The TSE you work with might also be able to get some good information from the ASUP performance gather (if you're generating full/HTTPS ASUPs).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Finally (and maybe even first) look at your main workloads and see if your latency is straying out of what you'd consider to be acceptable ranges. &amp;nbsp;We have a couple of our nodes that regularly report a few periods of over-utilization, but those periods tend to be after-hours (our snapmirror times) but our user I/O stays in &amp;lt;5ms latency range even at those times, so we've honestly stopped worrying about it. &amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope that helps,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Chris&lt;/P&gt;</description>
      <pubDate>Wed, 03 May 2017 14:35:31 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130749#M23663</guid>
      <dc:creator>colsen</dc:creator>
      <dc:date>2017-05-03T14:35:31Z</dc:date>
    </item>
    <item>
      <title>Re: Node resources over-utilized</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130750#M23664</link>
      <description>&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 03 May 2017 14:43:16 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Node-resources-over-utilized/m-p/130750#M23664</guid>
      <dc:creator>bghanson</dc:creator>
      <dc:date>2017-05-03T14:43:16Z</dc:date>
    </item>
  </channel>
</rss>

