ONTAP Discussions

Should I Be Concerned About a Suspended Scrub Status?

TMADOCTHOMAS
4,567 Views

I recently ran the storage aggregate show-scrub-status command on all of our clusters. On all but one cluster the last scrub time was within a week of today, with one exception, and that one was within two weeks. However, on one cluster (a 4-node all AFF cluster), two aggregates had a "-" for Last Scrub Time and was in a suspended state. One was 37% complete and the other was 71% complete. Should I be concerned and run the scrub manually?

1 ACCEPTED SOLUTION

Ontapforrum
4,517 Views

Hi,

 

For AFF system, the default schedule is weekly at 1 a.m. on Sunday for the duration of 6 hours. So, if it extends beyond 6 hrs, ONTAP will suspend it and for non-AFF it is 4 hrs on weekday so any scrubbing that extends beyond that time will be suspended. This is understood.

 

However, the next schedule should have pick it up from suspension or start another run? Don't know!

 

Could be that if raid-scrub is in 'suspended' state, it remains in that state and never picked up again. That's very risky.

 

Could you do one thing:

1) Resume the suspended scrub:
::> storage aggregate scrub -aggregate <aggr> -node <node> -action resume

 

Hopefully, that should kick-off scrubbing from where it left. Bydefault : scrub perf impact is low, which means it is not going to have any negative impact on the storage performance. Also, as it is Friday, so no harm in resuming it.


2) Have a look later on and see if it completes.

 

May be once it is completed that it will pick-up new one (weekly for AFF).

 

https://docs.netapp.com/ontap-9/index.jsp?topic=%2Fcom.netapp.doc.dot-cm-cmpr-960%2Fstorage__aggregate__scrub.html

https://docs.netapp.com/ontap-9/index.jsp?topic=%2Fcom.netapp.doc.dot-cm-cmpr-960%2FTOC__storage__raid-options.html [raid.scrub.schedule]

 

Thanks!

View solution in original post

5 REPLIES 5

Ontapforrum
4,518 Views

Hi,

 

For AFF system, the default schedule is weekly at 1 a.m. on Sunday for the duration of 6 hours. So, if it extends beyond 6 hrs, ONTAP will suspend it and for non-AFF it is 4 hrs on weekday so any scrubbing that extends beyond that time will be suspended. This is understood.

 

However, the next schedule should have pick it up from suspension or start another run? Don't know!

 

Could be that if raid-scrub is in 'suspended' state, it remains in that state and never picked up again. That's very risky.

 

Could you do one thing:

1) Resume the suspended scrub:
::> storage aggregate scrub -aggregate <aggr> -node <node> -action resume

 

Hopefully, that should kick-off scrubbing from where it left. Bydefault : scrub perf impact is low, which means it is not going to have any negative impact on the storage performance. Also, as it is Friday, so no harm in resuming it.


2) Have a look later on and see if it completes.

 

May be once it is completed that it will pick-up new one (weekly for AFF).

 

https://docs.netapp.com/ontap-9/index.jsp?topic=%2Fcom.netapp.doc.dot-cm-cmpr-960%2Fstorage__aggregate__scrub.html

https://docs.netapp.com/ontap-9/index.jsp?topic=%2Fcom.netapp.doc.dot-cm-cmpr-960%2FTOC__storage__raid-options.html [raid.scrub.schedule]

 

Thanks!

TMADOCTHOMAS
4,514 Views

THanks @Ontapforrum . I have kicked off the jobs and will review results on Monday. Hopefully they catch up!

Ontapforrum
4,504 Views

Good stuff...do let us know how it goes eventually ,  yes that should catch up.  Thanks!

TMADOCTHOMAS
4,444 Views

@Ontapforrum the resume command got one of the affected aggregates caught up. The other one caught up partly and then went back to suspended. I'll keep manually resuming on that aggregate until it is fully caught up. All others have a fairly recent date now. Thanks for the help!

Ontapforrum
4,439 Views

Thanks for the update. Great stuff. Take care!

Public