That CIFS issue went out in a NetApp email bulletin several weeks ago. I didn't have the timeout disabled but that wasn't what was crashing the box. I disabled it since I didn't need yet another thing to crash the box. The crash I am experiencing is a unrecoverable error occurring with communication on the PCI bus. You would at first possibly think that it could be hardware related but it happens on multiple controllers and only since ONTap 8.1. The only thing I can add is what they have said after they have done core analysis to the issues. Filer is reporting PCI error NMI on Br(20,0,0) that is responsible for communication to [23/24,0,0] PMC-Sierra SAS Adapter in slot 3. Usually, when the ultimate recipient of a transaction receives a completion packet into the transaction layer, the packet format is checked for violations of the TLP formatting rules. The specification defines that the following items can cause a MfTLB: 1. Data payload exceeds Max payload size. 2. The actual data length does not match data length specified in the header. 3. Packets which use an undefined Type field values. The filers we have run a mix of CIFS/NFS/Fibre with some just Fibre.
... View more
I sure hope that in RC3 or GA they fix the panic and core dump that has happened 2 times in the last month to 2 of my controllers on RC2. Both times I contacted support they said there was no idea when it would be fixed or what exactly was causing it. I would think that would be serious enough to be fixed before GA. I would tell others to think twice before a upgrade.
... View more