ONTAP Discussions

How to unfail a disk?

DCHEBOTAROV
20,307 Views

Hi

I'm looking for help on how to unfail a disk.

I'm running some test on non-production system (yet) and failed one of the disks.

VCC:::> storage disk fail -disk VCC-01:0a.10.4

(waited for copy to finish and removed failed disk)

Now, trying to put it back online as spare:

VCC::*> storage disk unfail -disk VCC-01:0a.10.4 -s true

Error: command failed: Failed to unfail the disk. Reason: Disk 0a.10.4 not found.

VCC::*> storage disk unfail -disk 0a.10.4 -s true

Error: command failed: Failed to unfail the disk. Reason: Disk 0a.10.4 not found.

but system 'sees' the disk:

VCC::*> storage disk show

                     Usable           Container

Disk                   Size Shelf Bay Type        Position   Aggregate Owner

---------------- ---------- ----- --- ----------- ---------- --------- --------

...

VCC-01:0a.10.4            -    10   4 broken      present    -         -

...

What am I missing?

Thank you.

5 REPLIES 5

MCARDINA221975
20,231 Views

Dmitri,

I am guessing that you are running in cluster mode?  If this is the case, I believe you need to specify the node to which the disk belongs.

Try this : storage disk unfail -disk VCC-01:0a.10.4 -s true

Hope this helps.

-Mike-

MCARDINA221975
20,231 Views

You might also try this :

> storage disk modify -disk VCC-01:0a.10.4 -state unfail

DCHEBOTAROV
20,231 Views

Michael,

Yes, it's 8.2P2 in C-mode, sorry forgot to mention.

VCC::*> storage disk unfail -disk VCC-01:0a.10.4 -s true

Error: command failed: Failed to unfail the disk. Reason: Disk 0a.10.4 not found.

VCC::*> storage disk modify -disk VCC-01:0a.10.4 -state unfail

Error: command failed: Failed to unfail the disk. Reason: Disk not found.

VCC::*> storage disk modify -disk VCC-01:0a.10.4 -state spare

Error: command failed: Failed to unfail the disk. Reason: Disk 0a.10.4 not found.

My goal is to clear 'broken' state and make is spare.

Thank you.

MCARDINA221975
20,231 Views

Very interesting.  Your syntax is correct.  Those commands should have worked.

You could then try to "run" the unfail command against the owning node.

>run -node "nodename" disk unfail -s 0a.10.4

If it still cannot find the disk...maybe you could remove it from the shelf and re-seat it?

-Mike-

MCARDINA221975
20,231 Views

I should have prefaced that last statement. You will need to be in "diag" mode before you can run that command.

1) >set diag

2)) >run -node "nodename" disk unfail -s 0a.10.4

--Mike-

Public