OpenStack Discussions

OpenStack NFS Not Responding During ONTAP Upgrade

JeremyWeber

Interesting issue we have seen during ONTAP upgrades in OSP environment. During the upgrade from 9.3 to 9.5 we have seen across multiple clusters in multiple locations random OSP compute nodes that end up with "NFS not responding" and VM's hang. Only way OSP team has found to recover is to reboot the compute nodes which impacts all VM including those still running fine. We have only seen this behavior during upgrades and only on OSP compute nodes. Other, what I will call "normal",  NFS or CIFS shares for files are not impacted at all. There is even evidence that VMware datastores from NFS had no impact. Only OSP compute nodes. Other interesting fact is that we can't reproduce this in any way other than during the upgrade. We have had to do failovers for dimm replacements and did multiple tests doing manual LIF migrations and never saw a single issue. Only during the upgrade. Anyone else seen this? What did you find to be the cause?

 

Note: I can't share much detail on our env due to security, so I have to keep it generalized here. 

1 REPLY 1

Ontapforrum

We will need some basic information to understand this issue.

 

Which NFS version is in place? (I am assuming it is 4.0/4_1)
OnepStack platform you are using?
Client side logs? (/var/log/dmseg,/var/log/messages,/var/log/kern)

Announcements
NetApp on Discord Image

We're on Discord, are you?

Live Chat, Watch Parties, and More!

Explore Banner

Meet Explore, NetApp’s digital sales platform

Engage digitally throughout the sales process, from product discovery to configuration, and handle all your post-purchase needs.

NetApp Insights to Action
I2A Banner
Public