Active IQ Unified Manager Discussions

OnCommand Performance Manager - "Node HA pair over-utilized " what does it mean?

lines_tim
5,761 Views

I get these messages from OPM pretty frequently, but I don't understand the metric that throws the alarm. There's lots of CPU, NVRAM is flushing nicely, disks aren't busy ... yet something in the node pair is being over-utilized.

 

Does anyone know what's being measured?

6 REPLIES 6

coreywanless
5,595 Views

Hello Tim,

 

You may already be doing this, but that particular alert is measuring the sum of both nodes in the HA pair relationship. It's alerting you to tell you that if one of the nodes were to fail, you may run into a performance problem until you were able to get that issue resolved.  Below is a snip out of the OPM user guide.

 

Identifies situations where nodes in an HA pair are operating above the bounds of the HA pair operational efficiency. It does this by looking at the CPU and RAM usage for the two nodes in the HA pair. If the combined node utilization of the two nodes exceeds 100%, then a controller failover will impact workload latencies.

Reference: https://library.netapp.com/ecm/ecm_download_file/ECMP12406790

 

-Corey

netappmagic
5,552 Views

Hello All,

 

I am trying to modify the threshld value for Node HA pair over-utilized , Can somebody please instruct me how to locate the policy and I wanted to change it to a different value.

 

Please help me out.

ruijuan
5,537 Views

This is a system defined threashold and cannot be modified by the user.

niels
5,528 Views

RFE: Allow users to override system-defined thresholds.

 

Especially overprivisioning of node utilization in HA configurations is common - e.g. 120% combined utilization as reduced performanc during HA events is often accepted.

 

regards, Niels

ruijuan
5,516 Views

The RFE was logged by QA earlier internally and we did receive same feedback from the others which are all captured in one burt. Need to work with PM on this.

netappmagic
5,485 Views

Does than mean we could not modify it for now?

Public