We have the OnCommand Plug-in 4.1 for Microsoft SCOM installed in SCOM 2012 R2 UR7 and we are seeing alerting occurring but there are many duplicate alerts for issues and every alert seems to auto-resolve itself after a couple of minutes even for a broken array/bad disk when no action has been taken. After further troubleshooting in Health explorer every monitor is showing a couple of hundred change state events. Every couple of minutes or hours, it goes from a "not monitored" to a "first event raised" which it should not be doing. There should only be one first event raised. It would appear that in the case where there is a valid issue, the monitor resets then redetects the issue and generates a duplicate alert.
Example of alert from below:
"Auto resolved by System"
Date and Time: | 9/30/2015 1:07:37 PM |
State: | Success |
Context: | The monitor has been initialized for the first time or it has exited maintenance mode |
Disk 0c.00.5 state: Online.
RAID state: broken.
Controller xxxxxx
Address: xxx.xxx.xxx.xxx.
Disk UID: xxxxxx:xxxxxxxxxx:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000.