ONTAP Discussions

Netapp Web GUI Admin and SNMP Monitoring for Displaying Critical Warning Status

edlam2000
3,018 Views

We tested the NetApp Storage by shutting down one of its Controller Power Supply and/or its Disk Shelf Power Supply, and we find that sometimes the Web GUI and its SNMP can missed out reflecting the critical warning status properly, and only its Console Command is working properly to display the Critical Warnining status.

 

Our Storage is NetAapp 8.3.1P1.

 

Could there be any latency of the Web GUI and its SNMP to reflect the latest Critical Warning Status, OR the Console Command is a better way to do the Monitoring????

 

Please advise, thanks.

 

Also, we find the following Recent CRITICAL Error in the Log, but seems it can't be showed up by Web GUI / SNMP / Console Command:-

 

YCKPLFR02::> event log show -severity CRITICAL
Time Node Severity Event
------------------- ---------------- ------------- ---------------------------
11/15/2016 14:00:00 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 14:00:00 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 13:00:00 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 13:00:00 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 12:00:00 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 12:00:00 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 11:40:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: There are not enough spare disks. Power Supply Status Critical: PSU1. Disk shelf fault.
11/15/2016 11:39:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: There are not enough spare disks. Disk shelf fault.
11/15/2016 11:34:01 YCKPLFR02-C01 CRITICAL hm.alert.raised: detailed_info="Alert Id = CriticalPSUFruFaultAlert , Alerting Resource = PSD041154303986", monitor="chassis", alert_id="CriticalPSUFruFaultAlert", alerting_resource="PSD041154303986"
11/15/2016 11:23:37 YCKPLFR02-C01 CRITICAL callhome.hm.alert.major: Call home for Health Monitor process nchm: DualPathToDiskShelf_Alert[50:0a:09:80:03:83:62:17].
11/15/2016 11:23:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: There are not enough spare disks. Power Supply Status Critical: PSU1. Disk shelf fault.
11/15/2016 11:23:00 YCKPLFR02-C02 CRITICAL monitor.globalStatus.critical: Power Supply Status Critical: PSU1. Disk shelf fault.
11/15/2016 11:22:28 YCKPLFR02-C01 CRITICAL hm.alert.raised: detailed_info="Alert Id = DualPathToDiskShelf_Alert , Alerting Resource = 50:0a:09:80:03:83:62:17", monitor="node-connect", alert_id="DualPathToDiskShelf_Alert", alerting_resource="50:0a:09:80:03:83:62:17"
11/15/2016 11:22:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: There are not enough spare disks. Disk shelf fault.
11/15/2016 11:22:00 YCKPLFR02-C02 CRITICAL monitor.globalStatus.critical: Disk shelf fault.
11/15/2016 11:21:49 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 11:21:49 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/15/2016 11:21:40 YCKPLFR02-C01 CRITICAL ses.status.psError: DS2246 (S/N SHFFG1551000132) shelf 0 on channel 0a power error for Power supply 1: critical status; DC undervoltage. This module is on the rear of the shelf at the bottom left.
11/15/2016 11:21:40 YCKPLFR02-C02 CRITICAL ses.status.psError: DS2246 (S/N SHFFG1551000132) shelf 0 on channel 0b power error for Power supply 1: critical status; DC undervoltage. This module is on the rear of the shelf at the bottom left.
11/14/2016 17:00:00 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/14/2016 17:00:00 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/14/2016 16:00:00 YCKPLFR02-C01 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/14/2016 16:00:00 YCKPLFR02-C02 CRITICAL monitor.shelf.fault: Fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
11/14/2016 15:40:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: Power Supply Status Critical: PSU1. Disk shelf fault.
11/14/2016 15:39:00 YCKPLFR02-C01 CRITICAL monitor.globalStatus.critical: Disk shelf fault.
11/14/2016 15:34:11 YCKPLFR02-C01 CRITICAL hm.alert.raised: detailed_info="Alert Id = CriticalPSUFruFaultAlert , Alerting Resource = PSD041154303986", monitor="chassis", alert_id="CriticalPSUFruFaultAlert", alerting_resource="PSD041154303986"
Press <space> to page down, <return> for next line, or 'q' to quit...
26 entries were displayed.

YCKPLFR02::>

1 REPLY 1

Jeff_Yao
2,960 Views

are you referring to system manager? what's the version of your system manager? if possible, upgrade system manager to the latest version and try again?

 

hopefully helps

 

Jeff

Public