We have two FAS2050 Filers (st10 & st11) with HA pair configured running OnTAP7.3.6. Recently, one of the filer st10 is broken and we have replaced the hard disks of the filer. When we tried to boot the filer we couldnt able to as it shows aggr0 has been failed the root partition. We tried to recover/diagnostics/scan, but nothing worked out. We have enabled takeover on st11 and it can able to takeover and we could see the partitions of st10 from st11. However, we couldnt able to boot st10 and we have data stored on st10. Can you please help us in this issue?
1. When we run disk show, we found some of the disks of st10 was owned by st11. Is there anyway we can get back the ownership?
2. Is there anyway we can enable takeover and access st10 volumes through st11 to copy the data?
We need to retrieve the data as the snapmirror also been failed and any help would be greatly appreciated since we already spend days on this issue and nothing worked out.
From what you've said, it sounds like your data is on a different aggregate of st10, and what has failed is the root aggregate of st10. Additionally, you believe there are a number of disks with incorrect ownership. To resolve this problem you will need to resolve ownership of the disks (it may be unclear because the system is in failover?), recreate aggr0 of st10, reinitialize ONTAP on that controller only, and then reconfigure failover.
This is quite possible, however it is an advanced activity, and my strong recommendation would be to contact a NetApp reseller in your area and have it done by a professional services consultant to avoid potential problems.
If you wish to attempt it yourself, please post the output from "disk show -a" and "aggr status -r", and please tell us which disks have been replaced, and what you have done to controller st10.
It looks like aggr0 on st10 has suffered a failure of up to 10 disks - which is not protectable by RAID-DP, and is likely to have caused data loss. With 16 disks originally, I am concerned this aggregate did not just contain root data. On st11, can you please run "partner aggr status -r" and post the results? It is possible the aggregate only looks inconsistent on st10 because it does not have all disks visible, but I believe that is unlikely.
Unfortunately with this many disks (94 at my count?), and a 10 year old system, I don't feel confident that a forum post is appropriate for resolution.
thanks for your reply, really appreciate your support.
We have tried the "Sensor Show" command and it starts displaying the sensor values. However we could see State for all the sensors as Unknown. I have tried for the BIOS Status and posted the results below.
Any idea what could be the issue? Thanks again for your quick support.
bmc shell -> sensors search id=#f0 Name State Id Reading Crit-Low Warn-Low Warn-High Crit-High ------------------------------------------------------------------------------------- BIOS Status Unknown #f0 -- -- -- -- --