Re: netapp einrichten

ITAWOLV · ‎2018-06-25

Hallo,

unsere Netapp ist nicht mehr erreichbar. The Netapp befand sich in einem anderen Netzwerk. Speicher / Freigeben mit Daten sind noch vorhanden. Nach dem Umzug des Netapp-Servers in ein anderes Netzwerk, wurde die Verbindung nicht mehr.

Ich bitte um Hilfe.

schmitz_peter · ‎2018-06-26

Hi,

I switch to English, so that potential contributors can contribute...

If you place your filer into a different network, you'll have to configure it accordingly.

As we don't know anything about your environment, please give us some more information, like

* what kind of hardware is this?

* is it running cDOT or 7-mode?

* how is the current network on the machine configured?

* do you have access to the service processor?

* can you put it back to the old network until this is resolved?

Cheers

Peter

ITAWOLV · ‎2018-06-26

Hallo,

vorab, ich bin ein absoluter Netapp Neuling. Also wenn ich mich per Com Schnittstelle verbinde, kommt ein Loader-B.

- Es ist ein FAS2220 mit einer DS 4243 Erweiterung

- Was ist der c-dot oder 7-Modus?

- der Service Prozessor ist mit einer IP Konfiguriert und per Ping erreichbar

- nein. ich kann leider nicht ins alte Netzwerk zurückstellen

schmitz_peter · ‎2018-06-26

Hi,

das hört sich nach einem Schubs in kaltes Wasser an...

"LOADER-B" heißt, dass die Maschine im PROM steht und das OS nicht gebootet wurde.

Versuch mal bitte am LOADER-Prompt den Befehl "printenv" (ohne Gänsefüßchen), dann können wir sehen, welches Betriebssystem zuletzt lief.

Das entscheidet darüber, wie das Netzwerk zu konfigurieren ist.

C-DOT, oder Clustered DataONTAP, ist das aktuelle OS, mit dem viele der aktuellen NetApp-Geräte laufen. Davor gab es den 7-Mode.

Wenn Ihr keinen Dienstleister habt, der sich um die Maschine kümmern kann, dann wird es etwas mehr Arbeit für Dich. Habt Ihr einen Wartungsvertrag bei NetApp für das Gerät?

Ansonsten poste mal die Ausgabe von "printenv" und lies zumindest mal https://en.wikipedia.org/wiki/ONTAP

Schöne Grüße

Peter

ITAWOLV · ‎2018-06-26

CPU Type: Intel(R) Xeon(R) CPU C3528 @ 1.73GHz
LOADER-B> printenv

Variable Name Value
-------------------- --------------------------------------------------
CPU_NUM_CORES 4
BOOT_CONSOLE sp0a
BIOS_VERSION 8.3.0
BIOS_DATE 04/08/2014
SYS_MODEL FAS2220
SYS_REV D6
SYS_SERIAL_NUM 651417000047
MOBO_MODEL FAS
MOBO_REV D6
MOBO_SERIAL_NUM 9444190887
CPU_SPEED 1730
CPU_TYPE Intel(R) Xeon(R) CPU C3528 @ 1.73GHz
savenv saveenv
ENV_VERSION 1
USE_SECONDARY false
LOADER_VERSION 4.3
ARCH X86_64
BOARDNAME SB_XIX
PRIMARY_KERNEL_URL fat://boot0/X86_64/kernel/primary.krn
BACKUP_KERNEL_URL fat://boot0/backup/X86_64/kernel/primary.krn
GX_DIAG_URL fat://boot0/X86_64/freebsd/image1/kernel
FIRMWARE_URL fat://boot0/X86_64/firmware/SB_XIX/firmware.img
REBOOT_REASON REBOOT_HALT_CMD
BIOS_INTERFACE 9FC3
BOOT_FLASH flash0a
GX_PRIMARY_KERNEL_URL fat://boot0/X86_64/freebsd/image1/kernel
GX_BACKUP_KERNEL_URL fat://boot0/X86_64/freebsd/image2/kernel
ntap.init.kernelname X86_64/freebsd/image1/kernel
AUTOBOOT true
AUTOBOOT_FROM PRIMARY
AUTO_FW_UPDATE true
BOOTED_FROM OTHER
boot_ontap autoboot boot0
boot_primary setenv BOOTED_FROM PRIMARY; boot -elf64 $GX_PRIMARY_KERNEL_URL $PRIMARY_KERNEL_URL
boot_backup setenv BOOTED_FROM BACKUP; boot -elf64 $GX_BACKUP_KERNEL_URL $BACKUP_KERNEL_URL
netboot setenv BOOTED_FROM NETWORK; boot -elf64
boot_diags boot -sld -elf64 $GX_DIAG_URL $PRIMARY_KERNEL_URL
ldkern load -elf64 $GX_PRIMARY_KERNEL_URL $PRIMARY_KERNEL_URL
update_flash flash -backup $FIRMWARE_URL flash0a
version printenv BIOS_VERSION LOADER_VERSION
BT_BIOS_VERSION 8.3.0
BT_LOADER_VERSION 4.3

schmitz_peter · ‎2018-06-26

Hi,

schade, man kann leider nicht sehen, welches OS läuft. Allerdings wurde die Maschine wohl schon länger nicht mehr aktualisiert ("BIOS_DATE 04/08/2014").

Tja, wenn das 4243-Shelf (wichtig) und die Netzwerkkabel richtig angeschlossen sind, dann hilft nur Booten und schauen, was passiert.

Also, am besten Mitschneiden, was über das Terminal kommt, und dann am LOADER-B ein beherztes "boot_ontap" eingeben, dann sollte die Maschine versuchen zu booten.

Ist das eine Einzelmaschine, oder ein HA-Pärchen? Und nochmal: Wenn Ihr einen Wartungsvertrag habt, dann macht besser ein Ticket auf.

Schöne Grüße

Peter

ITAWOLV · ‎2018-06-26

Hallo,

wir haben leider keinen Wartungsvertrag mehr.

Und ja, es ist eigentlich ein HA Pärchen.

Das ist die Ausgabe bei einem boot_ontap:

LOADER-B> boot_ontap
Loading X86_64/freebsd/image1/kernel:0x100000/9570440 0xa20888/4044560 Entry at 0x8016e330
Loading X86_64/freebsd/image1/platform.ko:0xdfc000/781096 0xf97750/720384 0xebab40/44936 0x1047550/49560 0xec5ac8/108679 0xee034f/80250 0xef3ce0/170336 0x10536e8/193800 0xf1d640/16 0xf1d650/2384 0x1082bf0/7152 0xf1e000/0 0xf1e000/344 0x10847e0/1032 0xf1e158/1944 0x1084be8/5832 0xf1e8f0/1648 0x10862b0/4944 0xf1ef60/240 0x1087600/720 0xf1f060/448 0xf5a8a0/14896 0xf97652/253 0xf5e2d0/135720 0xf7f4f8/98650
Starting program at 0x8016e330
NetApp Data ONTAP 8.2.2 7-Mode
Root mount waiting for: usbus0
Root mount waiting for: usbus0
Root mount waiting for: usbus0
Root mount waiting for: usbus0
Copyright (C) 1992-2014 NetApp.
All rights reserved.
md1.uzip: 39168 x 16384 blocks
md2.uzip: 15360 x 16384 blocks
*******************************
* *
* Press Ctrl-C for Boot Menu. *
* *
*******************************
Jun 26 14:18:44 [localhost:cf.nm.nicTransitionUp:info]: HA interconnect: Link up on NIC 0.
Jun 26 14:18:46 [localhost:cf.rv.flush.handleExchange:info]: HA interconnect: Flushing is active.
Jun 26 14:18:54 [localhost:snmp.link.up:info]: Interface 2 is up
Jun 26 14:18:54 [localhost:netif.linkUp:info]: Ethernet e0b: Link up.
Jun 26 14:18:55 [localhost:diskown.isEnabled:info]: software ownership has been enabled for this system
add host 127.0.10.1Jun 26 14:18:55 [localhost:config.noPartnerDisks:CRITICAL]: No disks were detected for the partner; this node will be unable to takeover correctly
WAFL CPLEDGER is enabled. Checklist = 0x7ff841ff
: gateway 127.0.20.1
Jun 26 14:18:55 [localhost:callhome.dsk.config:warning]: Call home for DISK CONFIGURATION ERROR
Jun 26 14:18:55 [localhost:wafl.memory.status:info]: 2684MB of memory is currently available for the WAFL file system.
Jun 26 14:18:55 [localhost:snmp.link.up:info]: Interface 6 is up
Jun 26 14:18:55 [localhost:netif.linkUp:info]: Ethernet e0P: Link up.
Jun 26 14:18:55 [localhost:snmp.link.up:info]: Interface 5 is up
Jun 26 14:18:55 [localhost:netif.linkUp:info]: Ethernet e0M: Link up.
Jun 26 14:18:55 [localhost:dcs.framework.enabled:info]: The DCS framework is enabled on this node.
Jun 26 14:18:55 [localhost:cf.nm.nicReset:warning]: HA interconnect: Initiating soft reset on card 0 due to rendezvous reset.
Jun 26 14:18:55 [localhost:cf.rv.notConnected:error]: HA interconnect: Connection for 'cfo_rv' failed.
Jun 26 14:18:55 [localhost:fmmb.current.lock.disk:info]: Disk 0b.01.0 is a local HA mailbox disk.
Jun 26 14:18:55 [localhost:fmmb.current.lock.disk:info]: Disk 0b.01.1 is a local HA mailbox disk.
Jun 26 14:18:55 [localhost:fmmb.instStat.change:info]: normal mailbox instance on local side.
Jun 26 14:18:55 [localhost:fmmb.instStat.change:info]: no mailbox instance on partner side.
Jun 26 14:18:56 [localhost:snmp.link.up:info]: Interface 1 is up
Jun 26 14:18:56 [localhost:netif.linkUp:info]: Ethernet e0a: Link up.
Jun 26 14:18:56 [localhost:snmp.link.up:info]: Interface 3 is up
Jun 26 14:18:56 [localhost:netif.linkUp:info]: Ethernet e0c: Link up.
Jun 26 14:18:56 [localhost:snmp.link.up:info]: Interface 4 is up
Jun 26 14:18:56 [localhost:netif.linkUp:info]: Ethernet e0d: Link up.
Waiting for giveback...(Press Ctrl-C to abort wait)Jun 26 14:24:19 [localhost:cf.disk.inventory.mismatch:CRITICAL]: Status of the disk 0a.00.3 (500605BA:00EA3DCC:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000) has recently changed or the node () is missing the disk.
Jun 26 14:24:19 [localhost:cf.disk.inventory.mismatch:CRITICAL]: Status of the disk 0a.00.2 (500605BA:00EA5E0C:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000) has recently changed or the node () is missing the disk.
Jun 26 14:24:19 [localhost:cf.disk.inventory.mismatch:CRITICAL]: Status of the disk 0a.00.1 (500605BA:00EA5ADC:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000) has recently changed or the node () is missing the disk.
Jun 26 14:24:19 [localhost:cf.disk.invent.mismatchalt:CRITICAL]: Status of some of the disks has changed or the node () is missing 12 disks (detailed logs have been throttled).
Jun 26 14:24:19 [localhost:callhome.sfo.miscount:CRITICAL]: Call home for HA GROUP ERROR: DISK/SHELF COUNT MISMATCH

This node was previously declared dead.
Pausing to check HA partner status ...
partner is operational and in takeover mode.

You must initiate a giveback or shutdown on the HA
partner in order to bring this node online.

The HA partner is currently operational and in takeover mode.This node cannot continue unless you initiate a giveback on the partner.
Once this is done this node will reboot automatically.

waiting for giveback...

schmitz_peter · ‎2018-06-26

Hi,

das sieht ganz nach einem oder mehreren Verkabelungsfehlern aus:

Jun 26 14:18:55 [localhost:config.noPartnerDisks:CRITICAL]: No disks were detected for the partner; this node will be unable to takeover correctly
Jun 26 14:18:55 [localhost:callhome.dsk.config:warning]: Call home for DISK CONFIGURATION ERROR
Jun 26 14:24:19 [localhost:cf.disk.invent.mismatchalt:CRITICAL]: Status of some of the disks has changed or the node () is missing 12 disks (detailed logs have been throttled).
Jun 26 14:24:19 [localhost:callhome.sfo.miscount:CRITICAL]: Call home for HA GROUP ERROR: DISK/SHELF COUNT MISMATCH

Immerhin kennen wir jetzt das Betriebssystem: NetApp Data ONTAP 8.2.2 7-Mode

Ich weiß nicht, wie der Cluster vorher aussah, aber er sollte genauso wieder aufgebaut werden, bevor Ihr wieder dran geht.

Hilfe zur Verkabelung gibt's bei https://library.netapp.com/ecm/ecm_download_file/ecmp1149275

Mich beunruhigt das "eigentlich" in "es ist eigentlich ein HA Pärchen". Habt ihr das Pärchen getrennt? Das geht nämlich nicht einfach so...

Peter

schmitz_peter · ‎2018-06-26

Ach so, falls der Cluster Switchless war:

https://library.netapp.com/ecm/ecm_download_file/ECMP1193604

ITAWOLV · ‎2018-06-26

Also die Verkabelung ist richtig.

Was das HA Paar angeht. Ich weiß das es 2 Store´s sind (Store1, Store2) mit dieser Erweiterung DS 4243.

schmitz_peter · ‎2018-06-26

Wenn der Filer sagt, dass dem Partner 12 Disks fehlen, dann sieht es so aus, als würde die SAS-Verkabelung nicht ordentlich am Partner anliegen...

Jun 26 14:18:55 [localhost:cf.rv.notConnected:error]: HA interconnect: Connection for 'cfo_rv' failed.

deutet darauf hin, dass das Pärchen kein Pärchen mehr ist, weil es keinen Interconnect gibt.

Ohne die Maschinen zu sehen, kann man leider nur im Trüben fischen...

schmitz_peter · ‎2018-06-26

Maschine A hat Maschine B übernommen und wartet darauf, dass sie dem Cluster zurück gegeben wird.

Wenn Du das serielle Kabel auf die andere Maschine steckst, wird diese vermutlich einen Login-Prompt zeigen.

Wenn Du Dich dort einloggst, könntest Du ein "cf status", "cf monitor" und "cf partner" eingeben...

ITAWOLV · ‎2018-06-26

Ja. Die Befehle funktionieren alle. Als Ausgabe erhalten, die Maschine B der Partner von A ist.

Ich bitte dringends um Hilfe.

schmitz_peter · ‎2018-07-02

Hi,

poste bitte mal den Output von "disk show -v", "disk show -n", "aggr status -f" und "sysconfig -r".

schmitz_peter · ‎2018-06-26

Ach ja, bitte installieren:

https://mysupport.netapp.com/tools/info/ECMS1357843I.html?productID=61923/

Damit kannst Du zumindest Konfigurationsfehler entdecken.