I have a netapp FAS 3020 box with dual controller , and they are in cluster mode.
The issue i am facing with it is that when i give halt cmd from one controller it gets stuck in cifs termination, cifs termination dosenot happen and because of that the other controller does not takeover its partner.
but when i give manually cmd of cf takeover the takeover happens.
With hardware issues or network disconnected I have seen cifs not terminate... once with an HBA that was flakey and another with no network cifs would hang in a process on termination...any errors on the console or messages with cifs locked in step with a number? There was also a BURT on an older version of ONTAP where you had to halt -d dumpname to get cifs to terminate and dump core...don't have the burt # now but am pretty sure it was a release a year or more ago.
- when i do cifs terminate the number of cifs sessions does not decrease.
-network is ok i can ping DCs and by both IP and name.
-I have not check "cifs testdc" i will check it , but i don't think it is going to help .
-No any type of error message is not generated.
I want to give little bit more information which i sawed when i was manually performing the "cf takeover", when takeover happend , and during "cf giveback" when i used the "cifs terminate" cmd it terminated the cifs sessions. and this happend on both the head during takeover the cifs terminate cmd works , but in normal mode cifs terminate does not works.
Here are the simlar burts where "halt -d" is used to dump core and force cifs to terminate... I saw another case where a bad FC HBA caused issues and cifs gave an odd error on the current process it was in (step 120 of 200 or something) on the termination... so could be a hardware issue too.