I've noticed a disappointing feature of the ONTAP 8.1X45 7-mode simulator. It doesn't handle crashes well (or at all for that matter). If the 7-mode vsim is powered-off abruptly for whatever reason (power outage, VM powered-off, etc...) it cannot recover. When it reboots it goes into endless panic dumps (see screenshot). I lose my vsim configuration along with all my volumes, SV relationships, demos, etc... This is sad because the ONTAP 7.3 simulator handled this situation beautifully. I would run it in a Linux O/S with a journaled file system and both Linux and the 7.3 simulator would recover nicely.
Can anyone explain why it doesn't recovery successfully from crashes? Is there anything I can do to ensure it DOES recover from crashes as they're pretty common when running vsims on a laptop.
The system is a V3140 on ONTAP 8.1 with back-end disks on an IBM DS5300.
Some of our filers have their aggr0 restricted. The wafliron hidden option on the special boot menu seems to do nothing and at reboot, the message "root aggregate or volume was taken offline in SK process rc on release" appears.
For the non-root restricted aggregates, the "aggr wafliron start <aggr>" fails with the message:
toaster*> aggr wafliron start a_toto_00
aggr wafliron start a_toto_00: Neither fsinfo block of aggregate 'a_toto_00' is valid.
toaster*> Mon Aug 27 14:18:04 EDT [toaster:wafl.volinfo.fsinfo.error:ALERT]: Bad Volinfo/Fsinfo magic for aggregate 'a_toto_00'
Mon Aug 27 14:18:04 EDT [toto:wafl.iron.mount.inconsistent.fail:info]: Wafliron could not mark volume a_toto_00 inconsistent as this operation doesn't apply to aggregates/traditional volumes.
A case is open with N... IBM actually but it should be at the NetApp engineers level now.
I have run into the same thing with the 8.1.1 simulator so the issue hasn't been resolved yet. It's not small amount of work that goes into configuring a multi-node cluster, especially if you expand disk capacity, etc.
I would like to hear if there is an advanced scanning feature we can use to correct this, wafl iron, etc.
I create a VMware SnapShot then restore when that happens... not an ideal workaround but works well and lets met have a baseline to setup my VSIMs the way I want and revert back after giving a demo or test.
Please, please, release SIM for ESXi with SCSI controller and easy System ID change procedure. And if you through a bit of performance gain to us, simple mortals, we promise to sacrifice an EMC array every quarter in your name!
Any word from the NetApp dev team on fixing the kernel crashes with v8.1? Also see https://communities.netapp.com/thread/19658 It would be nice to have a stable playground for SRM testing and the new vCenter v4 plugin.
Here is another user who appears to be experiening the exact same issue. He has even tried to bring the aggregate on-line after the crash, but it reports that WAFL is inconsistent and needs to be checked.