Something just isn't quite right with my cluster. I have a fas2040, the top controller e0a and e0b are VIFs , on the bottom controller e0a and e0b are VIFs, both are set to the same network address.
They plug into a Dell PowerCOnnect 5524 which supports LACP and LAGs. The port mapping looks like this:
NodeT to 5524
e0a to port 1
e0b to port 2
NodeB to 5524
e0a to port 3
e0b to port 4
Then I setup in the 5524, port 1,2,3,4 as Lag1 with LACP. So on the switch they're all clustered together. My goal here is to get 2GB to NodeT or NodeB, whichever is the active node.
Now I did some testing.
First I will note that before I had all four ports in Lag1 I had two different LAGs, one for each node and I was getting TCP/IP conflicts because the two nodes were conflicting with each other since they had the same addresses. Clustering them together into one LAG fixed that, though i'm not sure if that was the appropriate solution to the problem.
So I start a ping to the Netapp, and get a response. I notice the lights for the two ports connected to Node B are actively blinking so B must be the active node. So I start my first test, on Node T I initiate a CF Takeover, Node T becomes the active node with no interruption to the ping. However I notice an oddity, the switch lights are still very active for Node B like the traffic is still on those ports, my expectations were that the Node T lights would become the active ones. Then i CF Giveback to hand control back to Node B, again no interruption in the ping and the switch lights remain active for the ports wired to Node B.
Next I test by restarting Node B. So i execute a console reboot command on Node B without first doing a CF Takeover to hand control over to Node T. My ping dies. But here's the weird thing, the switch lights for Node T now act like they're processing the ping traffic but my ping response is dead. Once Node B comes backup, the ping remains dead, and the lights for Node T remain active. It's not until Node B is up and from Node T i do a CF Takeover does my ping start to respond again. Then i do a CF Giveback on Node T to hand control back to Node B, the ping remains uninterrupted but the switch link lights for Node T remain active.
Can someone help me understand what it is i'm seeing here or what's not working? Do the nodes not automatically take over for each other , they rely on my to do a CF Takeover to move control around?