did you find a solution to this problem? Seems that we have the sam Problem with one of our customers: slow ndmp to tape, high WAFL_Ex and suspended aggr scrubs
WAFL_ex represents parallel WAFL processing while Kahuna represents serial WAFL processing. These two logical domains are mutually exclusive, meaning either Kahuna can be active on 1 CPU, or WAFL_ex can be active on 1+ CPUs, but both Kahuna and WAFL_Ex cannot be active at the same time. Depending on the workload, it is possible for Kahuna to limit the amount of work that can be performed by WAFL_ex. It is important to note that this type of bottleneck is a simple variation on the previous condition.