Hello,
After upgrading our clusters from ONTAP 9.3P4 to 9.4P3 last week, the node latency data from netapp-harvest in Grafana seems to be unrealistically high (NetApp Dashboard: Cluster -> Highlights -> Latency).
We are using netapp-harvest 1.4.1 without the hotfix, as the link seems to be expired:
https://community.netapp.com/t5/OnCommand-Storage-Management-Software-Discussions/NetApp-Harvest-1-4-1-Hotfix-to-fix-2-bugs/m-p/144160#M26247
We copied cdot-9.3.0.conf to cdot-9.4.0.conf and restarted netapp-harvest's pollers.
On the ONTAP CLI the Latencies of all nodes are constantly within the 100-1500 us range (statistics node show -interval 5 -iterations 50 -max 4), which is much lower than the latencies reported by netapp-harvest.
In the attached picture the Latency increase since the upgrade to 9.4 is clearly visible.
How are these latencies calculated?
There are no entries in the /opt/netapp-harvest/log/*.log file of the cluster, other than NORMAL Poller status messages.
This has been an off-topic discussion in a few other threads, which are marked as solved, which is why I am opening a new one.
Edit: Latencies reported by OCUMs graphs are in line with the values seen on the CLI.
Kind Regards
Joel