ONTAP Discussions

Snapmirror resync stuck on status: transfering

Stefan_V
2,650 Views

Hi all,

 

I have 2 netapp clusters with Ontap 9.1P2

I have a configured snapmirror between SVM and SVM_DR.

After failover, when I want to reverse the snapmirror relationship with command:

 

snapmirror resync SVM:

 

The status of resync stuck on the Relationship status: Transferring.

And the total progress size is 41.50MB, so some data seems to be transfered.

However it should transfer 4GB of data.

 

And later I have this message in the event log:

 

8/9/2018 05:42:17 node-01 ERROR csm.sessionFailed: Cluster interconnect session (req=node-01:dblade, rsp=dr_node-01:dblade, uniquifier=010572fe472e3179) failed with record state ACTIVE and error CSM_FAIL.

8/9/2018 05:42:17 node-02 ERROR csm.sessionFailed: Cluster interconnect session (req=node-02:dblade, rsp=dr_node-02:dblade, uniquifier=010572fe472e3179) failed with record state ACTIVE and error CSM_FAIL.

 

However, when I check the health of the peers everythig is ok.

 

cluster peer health show
Node Cluster-Name Node-Name
Ping-Status RDB-Health Cluster-Health Availability
---------- --------------------------- --------- --------------- ------------
node-01
dr_cluster dr_node-01
Data: interface_reachable
ICMP: interface_reachable true true true
dr_node-02
Data: interface_reachable
ICMP: interface_reachable true true true
node-02
dr_cluster dr_node-01
Data: interface_reachable
ICMP: interface_reachable true true true
dr_node-02
Data: interface_reachable
ICMP: interface_reachable true true true
4 entries were displayed.

 

Anybody knows, what CSM_FAIL.means and what should I check?

The creation of the Snapmirror and the failover was performed without any issue.

 

 

Also I'm not sure,if that could be related to this issue, but I have an event log full of these messages on different ports:

8/9/2018 05:38:46 node-02 ERROR netif.rateLimitThreshold: High rate limit on network interface e0f for broadcast protocol ARP being dete cted: 5001 pkts/sec.

 

 

 

1 REPLY 1

Trubida
2,604 Views

Do you have a volume offline at the new source?  I've seen SVM_DR fail when a volume is offline on the source.

Public