2018-02-28 03:16 PM
Assuming I have two nodes HA cluster, each node has an aggregate, and also owns a spare disk, if one spare is being already used, and the second disk failed, can the other spare owned by the other node be automatically picked up for 2nd failure?
I think the answer is yes, but just to make sure with you.
2018-02-28 05:09 PM
disks are owned by individual nodes (nodes are part of HA).
If the failed disk is owned by NodeA then it will use the spare disk assigned to that node (conditions apply).
Can you please check the ownership of the second spare disk that you have ? Is it owned by node A or node B ? Use the command aggr status -s or disk show -v <disk name>
2018-02-28 05:34 PM
OK. So, the answer is NO, the node can not use the spare one which is owned by the other node.
2nd question: Can a spare one be picked up by any one of aggregtes on the same node should one disk in an aggr failed? assuming disk characterstics are all the same.
2018-02-28 05:46 PM
So, you can use the spare disk from the other node. To do that, you will need to assign ownership of the spare disk from NodeB to Node A. This also means that there are no spares disks available for NodeB.
2nd question : Yes, you are right.
2018-02-28 06:27 PM
Here it comes a touger question, I didn't intend to ask you at beignning.
I have two AFF nodes in the cluster, and each node already has 28 ssd disks(3.8TB) filling up the first raid-group in the aggr. Now, if I am adding one more ssd shelf with 24 disks. I can do:
1. add 12 disks to form 2nd raid-group, in the existing aggr on each node, 4 disks will be used for parity..
2. form a new aggr/raid-group with 24 disks on one of two nodes, 2 disks will be used for parity.
No extra spares needed, because each node already has its own spare. Option 2 will have about 6 TB more than opetion 1.
Which opiton would you go with? I probably would go option 1, only becuase it will balance well, althrough it would loss about 6TB usable space. The only thing is if there are any benifits to leave the entire shelf belong to an aggr, not split it to two different aggrs?
2018-03-01 04:20 AM - edited 2018-03-01 05:07 AM
what information else about the aggr you would like to know?
Using the new whole shelf of ssd to create a new aggr ,or add them into existing aggr, that is the question.
2018-03-01 10:25 PM
Either option is valid.
With SSDs, raidgroup sizing doesn't have to be as uniform as spinning disk, due to latency differences between raidgroups due to different utilization not being as much of an issue. We still want to avoid "hot spindles", but it's a much higher threshhold
2018-03-02 06:08 AM - edited 2018-03-02 06:14 AM
Thanks for such details.
Just to make sure, is it true that we only need one spare on each node, and no need to add any more even after add this new shelf? Including this new shelf, we will have total of 3 and half shelve for this AFF HA pair.
What would be the ratio of how many ssd shares should be configured for how many total of SSD's or shelves?
2018-03-02 09:08 PM
It's up to every admin to decide what they're comfortable with - my usual recommendation is 1+1% per type per controller - so if you have 24 disks, or 72 - 2 spares per controllers are fine, at 144 drives, you probably want 3. But if you're physically close enough to the system to replace drives as soon as they come in, maybe you're happy with 1 per controller, or if it takes 4 weeks to get replacement drives to it because it's on a ship in the middle of the ocean, maybe you want 6.