The transition to NetApp MS Azure AD B2C is complete. If you missed the pre-registration, you will be invited to reigister at next log in.
Please note that access to your NetApp data may take up to 1 hour.
To learn more, read the FAQ and watch the video.
Need assistance? Complete this form and select “Registration Issue” as the Feedback Category.

ONTAP Hardware

NFSv4 interface not contactable

2Chris

Good Morning,

and thank you for your help in advance to my first post request. 

FAULT - Intermitted interface connection drop out from NFSv4 clients and ping request.

 

Overview - we have approx. 200 HPC servers connecting to a FAS8200 HA pair running clustered ontap 9.5P8, the clients are load balanced to both nodes via their IP addresses from the client side. the nodes have 1 HPC only interface consisting of 2 physical ports using multimode. node 1 interface losing connectivity to all clients connected to it, its very intermittent and shows no pattern or logic like load issues. we have no errors on interfaces, ports , nodes -nothing , same with network (clients and Nodes are off the same physical switch and same VLAN), no errors from client side other than losing connectivity which lasts a few seconds to a couple of min. we have even done packet analysis on the interface and this shows nothing other than a complete stop in communication. i have even moved interface to node 2 and the fault migrates to it (indicating no filer hardware issues)

 

has anyone experience anything like this before, HPC and network engineers adamant no fault there either ?

5 REPLIES 5

parisi

Generally speaking, if you're seeing pings fail, it's a likely network problem.

 

ONTAP has some protective measures in place for flooding network connections, but it doesn't generally result in dropped pings.

 

If you can PM me the serial numbers/case number, I can take a look to see if anything sticks out.

2Chris

Hi Paul,

thank you for the reply.

 

Yes I have captured packets via Wireshark but unfortunately apart from a few resends throughout the whole trace nothing unusual happens before the communications just stops. I have raised a call with NETAPP and sent the packet trace to them and they confirm nothing that would cause loss of communication and see nothing wrong with filer or configuration.

 

Just trying here to pick more brains here as this keep happening for no reason and causing user headaches. Again networks and HPC clients (all connected to same switch)  see no errors apart from loss of connection.

paul_stejskal

What OS are the clients?

2Chris

Hi Paul,

 

clients are RedHat Enterprise Linux servers for HPC 7.6

Announcements
NetApp on Discord Image

We're on Discord, are you?

Live Chat, Watch Parties, and More!

Explore Banner

Meet Explore, NetApp’s digital sales platform

Engage digitally throughout the sales process, from product discovery to configuration, and handle all your post-purchase needs.

NetApp Insights to Action
I2A Banner
Public