I have a FAS2520 double controller system with external DS4246 disk shelves that I need to perform a complete reset on, and I'm having a series of problems. So far, I've managed to reset the admin password on one controller, and I've attemtped to use boot menu option 4 to issue a 4a command (to zero the disks and get the system into the setup state), but I'm not having any luck. I've also tried the procedure described in KB 1030427.
The serial session on controller 1, after 4a is issued, waits for a while and then shows a message that 'Your boot menu selection, "4a", has been cancelled.'
If I connect a serial session to controller 2, the preboot messages appear to load normally but:
I cannot login using the admin password I reset on controller 1 (password has not synced)
If I try to use the boot menu option 3 to reset the password, it does not succeed and the error message noted below is shown
An error message is displayed on boot, whether I select a boot option or not
The error message I'm seeing on controller 2 is, "[hwhnetapp-02:mgmtgwd.rootvol.recovery.changed:EMERGENCY]: The contents of the root volume might have changed and the local management databases might be out of sync with the replicated databases. This node is not fully operational. Contact technical support to obtain the root volume recovery procedures.", followed by "[hwhnetapp-02:callhome.root.vol.recovery.reqd:EMERGENCY]: Call home for ROOT VOLUME NOT WORKING PROPERLY: RECOVERY REQUIRED."
Of note, Option 4a is not recommended to use anymore. Please use Option 4 from the boot menu when doing a new cluster setup or when repurposing NetApp controllers. Or if considering using Advanced Drive Partitioning (ADP), use Option 9.
Here's more information on the ONTAP Boot Menu options:
Examples: ::>system configuration backup show ::*> system configuration recovery node restore -backup cluster1.8hour.2011-02-22.18_15_00.7z
If the above procedure does not work:1) Reboot the Node, go to option 3) Maitenance mode. >aggr status one by one take each aggregate offline and destory it, all the data aggr first and finally root. (Aggr offline/destroy) >halt <enter>
2) When system reboots, again go to menu option and try '4' erase thing again.
Thanks @Ontapforrum - the aggr offline/destroy option appears to be getting me somwhere - the option 4 command is actually showing progress zeroing the disks in the shelves now. I'll check in again if this still seems weird, and flag your post as a solution if I'm able to bring this system back to the setup state.