Data Backup and Recovery
Data Backup and Recovery
Dear all,
I'm currently facing an issue while adding Shared Disks to one of our Microsoft Failover Clusters in our testing environment.
A few hours earlier, I was able to add LUNs to another Microsoft Failover Cluster that was setup using the exact same procedure. All of a sudden, I now recieve the following error:
Error: No storage connection is set for StorageSystem : svm03-mgmt.example.com
The only change that was made in between both the succesfull and failed tries is the SnapCenter server's Fully Qualified Domain Name.
Troubleshooting Steps I've taken so far:
> svm03-mgmt.example.com Server: dc01.example.com Address: 10.0.0.1 Name: svm03-mgmt.example.com Address: 10.0.0.130 > 10.0.0.130 Server: dc01.example.com Address: 10.0.0.1 Name: svm03-mgmt.example.com Address: 10.0.0.130
Exchange02.example.com SCW_132.log ERROR 2018-05-13T21:53:50.3432389+02:00 Error SDW PID=[1608] TID=[2060] Cannot retrieve storage connection setting from SMS server. Exchange02.example.com SCW_132.log ERROR 2018-05-13T21:53:50.3432389+02:00 Error SDW PID=[1608] TID=[2060] Failed to provision disk Exchange02.example.com SCW_132.log ERROR 2018-05-13T21:53:50.3432389+02:00 Error SDW PID=[1608] TID=[2060] Error: No storage connection is set for StorageSystem : svm03-mgmt.example.com at SnapDrive.Nsf.Core.DiskProvisioningHelper.PreProcessandValidateCreateRequest(SDProvisionCreateStorageRequest request, DiskOperationContext context, HostDetails& host) at SnapDrive.Nsf.Core.DiskProvisioningManager.SDProvisionCreateStorage(SDProvisionCreateStorageRequest request, DiskOperationContext context) Exchange02.example.com SCW_132.log ERROR 2018-05-13T21:53:50.3432389+02:00 Error SDW PID=[1608] TID=[2060] Error: No storage connection is set for StorageSystem : svm03-mgmt.example.com Exchange02.example.com SCW_132.log ERROR 2018-05-13T21:53:50.3119889+02:00 Error SDW PID=[1608] TID=[2060] ManagementException : Not supported
> snapcenter.example.com Server: dc01.example.com Address: 10.0.0.1 Name: snapcenter.example.com Address: 10.0.0.221 > 10.0.0.221 Server: dc01.example.com Address: 10.0.0.1 Name: snapcenter.example.com Address: 10.0.0.221
Please note I'm currently using SnapCenter Build 4.0.0.809 installed on Windows Server 2012 R2 with all available HotFixes applied.
Has anyone had a simmilar issue? Anyone an idea on how further investigate the root cause for this issue?
Thank you in advance!
Kind regards,
Pieter-Jan Rijmenants
Solved! See The Solution
Many thanks to everyone who post here!
(You saved me a day or two.)
I had a similar problem and solved it after reading this thread!
My config was:
1. SnapCenter 4.4 installed on domain-joined server.
2. AD suffix of the domain is "domain.local" (for example).
3. Test VM named "TEST-10G" is a workgroup computer without a specific domain suffix (this is important).
4. I manually added a record of type A to the DNS zone of the domain for the TEST-10G host.
5. When I added this VM to SnapCenter, it automatically added the domain suffix to the name of this VM, so it would appear everywhere as "TEST-10G.domain.local" (this is important).
6. Job always failed on "create a snapshot" step.
Here is log entries from SCW Plugin:
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Cannot retrieve storage connection setting from SMS server.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Response error: Access denied for server: TEST-10G or timeout expired.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Invalid StorageSystemId type supplied.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Could not find valid Storage System for the resource в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.GetSALPluginProvider(SDStorageSystemId storageSystemId, SmRequestBase request)
в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.GetSALPluginProvider(String storageSystemId, SmRequestBase request)
в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.CreateSnapshot(CreateSnapshotRequest request)
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Could not find valid Storage System for the resource
Finally, I added the default dns suffix in the system settings of this VM and reboot.
Right after that, everything worked as expected.
Hi
the log snippet you poested are coming from the exchange server.
did you check if that server has the proper connectivity to the storage?
you can try to "bypass" the DNS and add the hostname (not the FQDN) to the hosts file and check if something changes
let us know..
bye
Dear @matte,
We are talking about a test environment over here. Since I stumbled upon another issue on our Domain Controller I have been focussing on that in the past days. Furthermore, I quit testing the specific SnapCenter case in which I ran into the issue I initially posted.
Maybe I'll try to reproduce the issue in the (near) future. Although, I'm not sure of it at the moment.
Nevertheless, thank you for your response!
Kind regards,
Pieter-Jan Rijmenants
Ended up running into the exact same issue on our environment (also a test environment) running SnapCenter 4.3 (on Windows Server 2016) and trying to create LUN's on a Windows Server 2012 R2 cluster environment. We were receiving the "No storage system connection set" error message and chasing it for weeks now both with and without the assistance of NetApp.
We found that the SnapCenter server was on the domain, while the SVM was in a "workgroup"...and we were logging into and managing the SnapCenter server (including trying to create LUN's) as our domain users. When we logged into SnapCenter as a domain user, and tried to connect a LUN on an SVM which was in "workgroup", it wasn't authenticating. So even though it was giving errors related to not being able to find the storage system, it was more that it wasn't able to get the storage system details from the storage system with that user.
Add the SVM to the domain, or log into and create disks as the local "Administrator" and no issues.
Dear MCooper6,
You situation seems slightly different from the one we initially encountered. In our case, the Host and Storage Virtual Machine were both part of the same Active Directory domain. Nevertheless, many thanks for sharing your update/findings!
Kind regards,
Pieter-Jan Rijmenants
Hi,
Same issue here. But no luck with that workaround of adding SVM to domain or using local users to log in to Snapcenter.
The error is the same in all cases.
Any other ideas?
Regards,
Pedro Rocha
Hi,
Just an update on my issue. Plugin host did not had access to 443 on cluster/svm. When I gave that access it went further on the connect disk process and stucked on putting the LUN online:
In the logs the first error is: 2020-06-25T18:12:05.7548987-03:00 Error SDW PID=[22752] TID=[17052] Error: Failed to get partition information from disk on node V-MAILBOX01
This occurs right after the disk is onlined by Snapcenter: 2020-06-25T18:12:05.6767784-03:00 Information SDW PID=[22752] TID=[17052] Microsoft DiskPart version 6.3.9600 Copyright (C) 1999-2013 Microsoft Corporation. On computer: V-MAILBOX01 DISKPART> Disk 13 is now the selected disk. DISKPART> Disk attributes cleared successfully. DISKPART> DiskPart successfully onlined the selected disk. DISKPART> Leaving DiskPart
The last message is repeated several times until the job ends not being able to connect the LUN. Filled a case with NetApp.
Regards,
Pedro
Hi,
What I understand from your post is that you changed the SnapCenter server hostname in between, right?
When a host is registered with SnapCenter, there are entries in SMcore files pointing to SC server URL. It needs to updated in case you changed the SC server host name in between.
In your plug-in host, go to path C:\Program Files\NetApp\SnapCenter\SMCore
Open 'SMCoreServiceHost.exe' in text and check the highlighted line. It should reflect the URL with correct FQDN
<appSettings>
<add key="DontOverWrite" value="" />
<add key="UseWebService" value="true" />
<add key="SMSServerURL" value="https://hostname.domain.com:8146" />
<add key="SMSServer" value="localhost" />
<appSettings>
Many thanks to everyone who post here!
(You saved me a day or two.)
I had a similar problem and solved it after reading this thread!
My config was:
1. SnapCenter 4.4 installed on domain-joined server.
2. AD suffix of the domain is "domain.local" (for example).
3. Test VM named "TEST-10G" is a workgroup computer without a specific domain suffix (this is important).
4. I manually added a record of type A to the DNS zone of the domain for the TEST-10G host.
5. When I added this VM to SnapCenter, it automatically added the domain suffix to the name of this VM, so it would appear everywhere as "TEST-10G.domain.local" (this is important).
6. Job always failed on "create a snapshot" step.
Here is log entries from SCW Plugin:
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Cannot retrieve storage connection setting from SMS server.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Response error: Access denied for server: TEST-10G or timeout expired.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Invalid StorageSystemId type supplied.
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Could not find valid Storage System for the resource в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.GetSALPluginProvider(SDStorageSystemId storageSystemId, SmRequestBase request)
в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.GetSALPluginProvider(String storageSystemId, SmRequestBase request)
в SnapDrive.Nsf.ServiceProviders.SALPluginFactory.CreateSnapshot(CreateSnapshotRequest request)
2021-03-31T23:52:33.6321583+03:00 Error SAL PID=[5184] TID=[3044] Could not find valid Storage System for the resource
Finally, I added the default dns suffix in the system settings of this VM and reboot.
Right after that, everything worked as expected.