Data Backup and Recovery
Data Backup and Recovery
We recently purchased and setup a FAS3240 running 8.0.1P5 and I'm having trouble with Snapdrive on our AIX host.
After installing Snapdrive 4.2 I can run some commands, but can't create a snapshot.
The LUNs mounted on the host via FC were created on the filer not w/ Snapdrive but the snapdrive command doesn't associate them with the hosts.
We can lists the LUNs (see below). Notice we see the LUNs with the "sanlun" command.
We made just a couple changes to the snapdrive.conf file:
autosupport-enabled=on # Enable autosupport flag
default-transport="FCP" # Transport type to use for storage provisioning, when a decision is needed
multipathing-type="NativeMPIO" # Multipathing software to use when more than one multipathing solution is available. Possible values are 'DMP', 'NativeMPIO' or 'none'
We use AIX LVM and JFS2 which are defaults.
Details:
FAS3240A> options trusted.hosts
trusted.hosts * (same value required in local+partner)
AIX 5.3 TL 12
- Snapdrive 4.2 for AIX installed
bash-2.05b# lslpp -l | grep -i snap
NetApp.snapdrive 4.2.0.0 COMMITTED Network Appliance SnapDrive
- Snapdrive Utilties installed
bash-2.05b# lslpp -l | grep -i utilit
NetApp.MPIO_Host_Utilities_Kit.config
5.1.0.0 COMMITTED NetApp MPIO PCM Host Utilities
NetApp.MPIO_Host_Utilities_Kit.fcp
5.1.0.0 COMMITTED NetApp MPIO PCM Host Utilities
NetApp.MPIO_Host_Utilities_Kit.pcmodm
5.1.0.0 COMMITTED NetApp MPIO PCM Host Utilities
- the sanlun command sees the LUNs and they are mounted and active
bash-2.05b# sanlun lun show
controller: lun-pathname device filename adapter protocol lun size lun state
FAS3240A: /vol/freyroot/freyrootlun01 hdisk35 fcs0 FCP 100g (107374182400) GOOD
FAS3240A: /vol/frey/freylun01 hdisk36 fcs0 FCP 200g (214748364800) GOOD
FAS3240A: /vol/frey02/freylun02 hdisk37 fcs0 FCP 300.0g (322163441664) GOOD
FAS3240A: /vol/frey03/freylun03 hdisk38 fcs0 FCP 150g (161061273600) GOOD
bash-2.05b# snapdrive version
snapdrive Version 4.2P1
Snapdrive Daemon Version 4.2P1
# snapdrive config list
username appliance name appliance type
--------------------------------------------
snapdrvaix 10.65.2.135 StorageSystem
snapdrvaix 10.65.2.137 StorageSystem
# snapdrive snap list -filer 10.65.2.135
snap name host date snapped
--------------------------------------------------------------------------------
10.65.2.135:/vol/root:hourly.0 non-snapdrive snapshot
10.65.2.135:/vol/root:hourly.1 non-snapdrive snapshot
10.65.2.135:/vol/root:nightly.0 non-snapdrive snapshot
...
10.65.2.135:/vol/papayasg3:FAS3210(1573774893)_papayasg3_mirror.1 non-snapdrive snapshot
...
# snapdrive storage list -all
0001-185 Command error: storage show failed: no NETAPP devices to show or add the host to the trusted hosts (options trusted.hosts) and enable SSL on the storage system or retry after changing snapdrive.conf to use http for storage system communication and restarting snapdrive daemon.
# snapdrive snap -fs /u02 -snapname my_snap
0001-023 Admin error: Unable to discover all LUNs in file system /u02.
Devices not responding: /dev/hdisk36, /dev/hdisk37, /dev/hdisk38
Please check the LUN status on the storage system and bring the LUN online if necessary or add the host to the trusted hosts (options trusted.hosts) and enable SSL on the storage system or retry after changing snapdrive.conf to use http for storage system communication and restarting snapdrive daemon.
-- Running # snapdrive storage list -filer 10.65.2.137
LUNs not connected to this host:
lun path size state
----------------------------- ------ ------
10.65.2.135:/vol/bragi/bragilun01 895.1g online
10.65.2.135:/vol/papayasg1/papayasg1db 200.0g online
10.65.2.135:/vol/frey/freylun01 200g online
10.65.2.135:/vol/fulla/fullalun01 200g online
10.65.2.135:/vol/gefjon/gefjonlun01 100g online
10.65.2.135:/vol/heimdall/heimdalllun01 175.0g online
10.65.2.135:/vol/hild/hildlun02 100g online
10.65.2.135:/vol/hild/hildlun01 195.0g online
10.65.2.135:/vol/idun/idunlun01 145.0g online
10.65.2.135:/vol/papayasg2/papayasg2db 200.0g online
10.65.2.135:/vol/bragiroot/bragirootlun01 100g online
10.65.2.135:/vol/freyroot/freyrootlun01 100g online
10.65.2.135:/vol/fullaroot/fullarootlun01 100g online
10.65.2.135:/vol/gefjonroot/gefjonrootlun01 100g online
10.65.2.135:/vol/heimdallroot/heimdallrootlun01 100g online
10.65.2.135:/vol/hildroot/hildrootlun01 100g online
10.65.2.135:/vol/idunroot/idunrootlun01 100g online
10.65.2.135:/vol/lokiroot/lokirootlun01 100g online
10.65.2.135:/vol/loki/lokilun02 100g online
10.65.2.135:/vol/loki/lokilun01 195.0g online
10.65.2.135:/vol/bragiu02/bragilun02 495.0g online
10.65.2.135:/vol/bragiu03/bragilun03 295.0g online
10.65.2.135:/vol/frey02/freylun02 300.0g online
10.65.2.135:/vol/frey03/freylun03 150g online
...
Are WWPW of local adapters correctly found? What is result of
sanlun fcp show adapter -v
sanlun fcp show adapter -c
And on filer
igroup show -v <the name of your AIX igroup
Note: The LUNs were created on the filer, not through snapdrive.
bash-2.05b# sanlun fcp show adapter -v
adapter name: fcs0
WWPN: 10000000c946c36a
WWNN: 20000000c946c36a
driver name: /usr/lib/drivers/pci/efcdd
model: df1000fa
model description: FC Adapter
serial number: 1B524039C9
hardware version: Not Available
driver version: 5.3.12.4
firmware version: 191105
Number of ports: 1
port type: Fabric
port state: Operational
supported speed: 2 GBit/sec
negotiated speed: 2 GBit/sec
OS device name: fcs0
adapter name: fcs1
WWPN: 10000000c94060bb
WWNN: 20000000c94060bb
driver name: /usr/lib/drivers/pci/efcdd
model: df1000fa
model description: FC Adapter
serial number: 1F4330AAAF
hardware version: Not Available
driver version: 5.3.12.4
firmware version: 191105
Number of ports: 1
port type: Fabric
port state: Operational
supported speed: 2 GBit/sec
negotiated speed: 2 GBit/sec
OS device name: fcs1
bash-2.05b# sanlun fcp show adapter -c
Enter this controller command to create an initiator group for this system:
igroup create -f -t aix "frey.pepperdine.edu" 10000000c946c36a 10000000c94060bb
FAS3240A> igroup show -v frey
frey (FCP):
OS Type: aix
Member: 10:00:00:00:c9:46:c3:6a (logged in on: vtic, 0c)
Member: 10:00:00:00:c9:40:60:bb (logged in on: vtic, 0d)
Pset: freyboot
ALUA: Yes
Our VAR and then we checked that our hardware/software is supported in the Support Matrix. But when I run the sdconfcheck the last line doesn't look right (i'm going to post this as separate question just so more people may see it).
bash-2.05b# ls
sdconfcheck snapdrive snapdrived
bash-2.05b# ./sdconfcheck check
Detected PowerPC Architecture
Detected IBM AIX OS
Detected FCP on AIX
Detected AIX JFS File System
Detected AIX JFS2 File System
Detected AIX Native LVM
Detected AIX Native MPIO
Did not find any supported cluster solutions.
Detected IBM AIX FCP Support Kit 4.1
Detected IBM AIX iSCSI Initiator Support Kit 1.1
Did not find any supported configurations by SDU Version 4.2
Additional info:
snapdrive.conf:
multipathing-type="NativeMPIO" # Multipathing software to use when more than one multipathing solution is available. Possible values are 'DMP', 'NativeMPIO' or 'none'
- on the AIX host
# snapdrive storage create -lun my_test_lun:/vol/aaa -lunsize 1g
0001-852 Command error: Bad lun name: my_test_lun:/vol/aaa - format not recognized, missing lun short name. Correct format is my_test_lun:/vol/aaa/mylun
# snapdrive storage create -lun my_test_lun:/vol/aaa/my_test_lun -lunsize 1g
0001-136 Admin error: Unable to log on to storage system: my_test_lun
Please set user name and/or password for my_test_lun, i.e.
snapdrive config set root my_test_lun or
Please set the management interface for my_test_lun to be used as data interface, i.e.
snapdrive config set -mgmtpath <mgmtpath> my_test_lun
# snapdrive storage create -fs /mnt/test -fstype jfs2 -lun 10.65.2.135:/vol/aaa/my_test_lun -lunsize 10m -nolvm
LUN 10.65.2.135:/vol/aaa/my_test_lun ... created
mapping new lun(s) ... done
discovering new lun(s) ... *failed*
Cleaning up ...
- LUN 10.65.2.135:/vol/aaa/my_test_lun ... deleted
0001-476 Admin error: Unable to discover the device associated with 10.65.2.135:/vol/aaa/my_test_lun. If multipathing in use, there may be a possible multipathing configuration error.
Please verify the configuration and then retry.
- at the same time on the filer
Thu Oct 6 10:48:43 PDT [FAS3240A: app.log.info:info]: frey.pepperdine.edu: snapdrive 4.2P1: (1) daemon started: protos=, Connect Luns=0, dgs=0, hvs=0, fs=0, host_name=frey.pepperdine.edu, host_os=AIX, host_os_release=5, host_os_version=3, No of controller=2, PM/RBAC=native
Thu Oct 6 10:52:51 PDT [FAS3240A: lun.map:info]: LUN /vol/aaa/my_test_lun was mapped to initiator group frey.pepperdine.edu_fcp_SdIg=4
Thu Oct 6 10:53:16 PDT [FAS3240A: lun.map.unmap:info]: LUN /vol/aaa/my_test_lun unmapped from initiator group frey.pepperdine.edu_fcp_SdIg
Thu Oct 6 10:53:16 PDT [FAS3240A: lun.destroy:info]: LUN /vol/aaa/my_test_lun destroyed
Thu Oct 6 10:53:36 PDT [FAS3240A: app.log.alert:ALERT]: frey.pepperdine.edu: snapdrive 4.2P1: (4) storage create failed: protos=, Connect Luns=0, dgs=0, hvs=0, fs=0, host_name=frey.pepperdine.edu, host_os=AIX, host_os_release=5, host_os_version=3, No of controller=2, PM/RBAC=native,1476: Unable to discover the device associated with 10.65.2.135:/vol/aaa/my_test_lun. If multipathing in use, there may be a possible multipathing configuration error. Please verify the configuration and then retry.
For some reasons it finds old version of host utilities. It could be the cause of your problem.