<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Issue with Grafana/Graphite and QOS counters? in Active IQ Unified Manager Discussions</title>
    <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114597#M20338</link>
    <description>&lt;P&gt;I think I understand.&amp;nbsp; The customer uses a script that searches for clones older than 45 days and applies the policy to them.&amp;nbsp; They also have another script that reclaims certain clones base on age too.&amp;nbsp; I will verify exactly when these scripts are executed.&amp;nbsp; Perhaps that's the answer.&amp;nbsp;cDOT release 8.2.3&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 13 Jan 2016 05:30:33 GMT</pubDate>
    <dc:creator>James_Castro</dc:creator>
    <dc:date>2016-01-13T05:30:33Z</dc:date>
    <item>
      <title>Issue with Grafana/Graphite and QOS counters?</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114542#M20321</link>
      <description>&lt;P&gt;Hoping someone has seen this before or could give me some tips in troubleshooting it.&amp;nbsp; Please view the screenshot below.&amp;nbsp; I am trying to figure out why ome of my QOS policies show a huge spike in the number of OPs.&amp;nbsp; Below shows it spiking to 1.5 million ops.&amp;nbsp; This can't be right.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="weirdness.jpg" style="width: 999px;"&gt;&lt;img src="https://community.netapp.com/t5/image/serverpage/image-id/4583i6D56AE6C25FB5051/image-size/large?v=v2&amp;amp;px=999" role="button" title="weirdness.jpg" alt="weirdness.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jun 2025 22:21:24 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114542#M20321</guid>
      <dc:creator>James_Castro</dc:creator>
      <dc:date>2025-06-04T22:21:24Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Grafana/Graphite and QOS counters?</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114570#M20326</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.netapp.com/t5/user/viewprofilepage/user-id/42362"&gt;@James_Castro﻿&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Data ONTAP counters should be monotonically increasing, meaning they only go up. &amp;nbsp;Harvest basically takes the value at T1, waits a bit, and again at T2, and then calcs T2-T1 to get the rate of change (and more fancy stuff depending on the counter type). &amp;nbsp;If T2-T1 is negative then it was not monotonically increasing and Harvest assumes a counter reset occurred (reboot of node, max int size reached, etc) and skips the data point and uses that new value as the base for the next iteration. &amp;nbsp;The logic Harvest uses is the same as any perf API calling app would use.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have seen some scenarios&amp;nbsp;where the counter is not monotonically increasing but where a reset didn't occur. &amp;nbsp;One of them is with counter aggregations introduced in cDOT. &amp;nbsp;With an aggegrated counter you can have something like volume:node object which is a summary of all volumes on a node. &amp;nbsp;If you poll that object and get the values, then one volume goes offline and you poll again you might get a decreasing value because that vol's counters are not in the aggregation. &amp;nbsp;So you skip posting the data and now have a new, lower counter value as your base for the next comparison. &amp;nbsp;Then the volume comes online again, and you poll again, and&amp;nbsp;now that volume is included in the calc resulting in an apparent spike in the counter. &amp;nbsp;The spike would be equivalent to every 'tick' of that counter on that volume since it was created. &amp;nbsp;I saw this with snapmirror destination volumes because as part of the 'jumpahead' after each update the volume is taken offline momentarily. &amp;nbsp;If during that moment a poll occurs the volume is not included in the aggregation and on the next poll when it is you get the spike. &amp;nbsp;I opened a bug&amp;nbsp;899768 on this if you want to request a fix.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;From the screenshot though it looks like you have a policy group and from the naming flexclones are involved. &amp;nbsp;So not an aggregation issue like the bug I already opened. &amp;nbsp;Maybe flexclones inherit counters values on creation that causes us to see a spike? &amp;nbsp;Or maybe somehow a destroy and creation again causes it? I would check what activity happened on a volume in that policy group at that point in time. &amp;nbsp;If you provide what happened, and the DOT release, I can try to reproduce and determine root cause.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers,&lt;BR /&gt;Chris Madden&lt;/P&gt;&lt;P&gt;Storage Architect, NetApp EMEA (and author of Harvest)&lt;/P&gt;&lt;P&gt;Blog:&amp;nbsp;&lt;A href="http://blog.pkiwi.com/" target="_blank"&gt;It all begins with data&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;P.S.&amp;nbsp; Please select “Options” and then “Accept as Solution” if this response answered your question so that others will find it easily!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 12 Jan 2016 15:48:09 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114570#M20326</guid>
      <dc:creator>madden</dc:creator>
      <dc:date>2016-01-12T15:48:09Z</dc:date>
    </item>
    <item>
      <title>Re: Issue with Grafana/Graphite and QOS counters?</title>
      <link>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114597#M20338</link>
      <description>&lt;P&gt;I think I understand.&amp;nbsp; The customer uses a script that searches for clones older than 45 days and applies the policy to them.&amp;nbsp; They also have another script that reclaims certain clones base on age too.&amp;nbsp; I will verify exactly when these scripts are executed.&amp;nbsp; Perhaps that's the answer.&amp;nbsp;cDOT release 8.2.3&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 13 Jan 2016 05:30:33 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Active-IQ-Unified-Manager-Discussions/Issue-with-Grafana-Graphite-and-QOS-counters/m-p/114597#M20338</guid>
      <dc:creator>James_Castro</dc:creator>
      <dc:date>2016-01-13T05:30:33Z</dc:date>
    </item>
  </channel>
</rss>

