The operating system has halted. Please press any key to reboot.
System halting... cpu_reset called on cpu#0
Phoenix SecureCore(tm) Server Copyright 1985-2008 Phoenix Technologies Ltd. All Rights Reserved BIOS version: 8.3.0 Portions Copyright (c) 2008-2014 NetApp, Inc. All Rights Reserved
CPU = 1 Processors Detected, Cores per Processor = 2 Intel(R) Xeon(R) CPU C3528 @ 1.73GHz Testing RAM 512MB RAM tested 6144MB RAM installed 256 KB L2 Cache per Processor Core 4096K L3 Cache Detected System BIOS shadowed USB 2.0: MICRON eUSB DISK BIOS is scanning PCI Option ROMs, this may take a few seconds... WARNING 02A1: SP Not Found ERROR No Response to Controller FRU ID Read Request via IPMI ERROR No Response to Midplane FRU ID Read Request via IPMI
Boot Loader version 4.3 Copyright (C) 2000-2003 Broadcom Corporation. Portions Copyright (C) 2002-2014 NetApp, Inc. All Rights Reserved.
CPU Type: Intel(R) Xeon(R) CPU C3528 @ 1.73GHz BIOS POST Failure(s) detected. Abort AUTOBOOT LOADER-A>
02A1: SP Not Found is worrying and initially I thought that it's faulty but it's working when I connect to via ssh and using ctrl+g on console.
Also from loader-a I can check it's status:
LOADER-A> sp status Firmware Version: 2.2.3 Ethernet Link: up, 100 Mb, full duplex, auto-neg complete Mgmt MAC Address: 00:A0:98:3F:D5:2E IPv4 Settings Using DHCP: NO IP Address: 192.168.100.10 Netmask: 255.255.255.0 Gateway: 192.168.100.1 IPv6: Disabled LOADER-A>
I've powered down whole chassis, unplugged module and plugged again but that didn't help.
After complete reboot SP and array lost current time and I was able to set time via LOADER-A and SP got it instantly so there's communication.
Show devices command:
LOADER-A*> show devices Device Name Description ----------- --------------------------------------------------------- sp0a Service Processor: Console at 0x3F8, PSI at 0x2F8 clock0a ISA RTC at 0x70 (index) and 0x71 (target) kcs0a KCS at 0xca3 (command) and 0xca2 (data) u0a.0 MICRON eUSB DISK-(USB 2.0) boot0 u0a.0 alias boot device boot_i u0a.0 alias boot device e0M IBA GE Slot 0500 v1353 (00-A0-98-3F-D5-2D) e0P IBA GE Slot 0700 v1353 (00-A0-98-3F-D5-2C) e0a IBA GE Slot 0402 v1353 (00-A0-98-3F-D5-28) e0b IBA GE Slot 0403 v1353 (00-A0-98-3F-D5-29) e0c IBA GE Slot 0400 v1353 (00-A0-98-3F-D5-2A) e0d IBA GE Slot 0401 v1353 (00-A0-98-3F-D5-2B)
I've tried sp reset from loader but that didn't worked but resetting from SP via sp reboot command worked.
From SP commandline:
SP netapp1> system fru list
FRU ID Name ============================================== IPMI session creation failed - err(0x0001)
SP netapp1> system sensors Sensor Name | Current | Unit | Status | LCR | LNC | UNC | UCR -----------------+------------+------------+------------+-----------+-----------+-----------+----------- Error: Unable to establish LAN session Get Device ID command failed Unable to open SDR for reading
Dunno what else to check but I have access to LOADER and SP so if someone can guide me then I can check other things but I must admit it turns out that my knowledge about netapp is pretty limited so I'm seeking help form more experienced users here. Any idea would be appreciated.
Yeah I've tried that before I posted any question here.
It's old archival system and we don't have support active since 2019 that's why I'm asking here counting that someone kind
Like I mentioned I've tried to boot all available options and non of them worked.
When machine is starting form scratch (powered down completely) it's not giving me this "PANIC : The 0 is not a supported platform" only SP not found error.
BIOS is scanning PCI Option ROMs, this may take a few seconds... WARNING 02A1: SP Not Found ERROR No Response to Controller FRU ID Read Request via IPMI ERROR No Response to Midplane FRU ID Read Request via IPMI
Also since second controller is working is there a way to somehow force it to serve volumes which were on broken one?
Right now it's saying netapp2(takeover) but I can't see volumes from broken one on it and I can login via netapp web based tool cos it's complaining that netapp1 isn't responding.
In that case Ethernet network is not a big problem
You got to check your SAN.
Also your hosts must have multipath in place to find the LUNs through the partner controller (online one).
Is there actually any issue with your LUNs access?
Regarding the Ethernet network you won’t see the adapters as it was before. If you are missing any IP from the impaired controller, you must the it manually via ifconfig command and add it to the /etc/rc
I am a bit confuse now. Your first issue was the controller not booting. If you followed the KB, now you would need NetApp assistance.
Besides that what else is not working as you need?
If you swap which controller is in which slot, does the same slot fail to boot the other controller and it works ok in the other one?
If that is the case, replace the chassis - while not supported to do this in production, you can use any DS2246 chassis.
If the error moves with the controller, first open it up and remove the coin cell battery and then reinstall it, but if the error persists, replace the controller - they're about $300-400 on ebay, and with 8.1, the licenses aren't serial number locked - you will need to reassign disks from maintenance mode and destroy mailboxes and reestablish HA, but that should be it.
Yup it's not booting and that's major issue but like I said it's archival infrastructure so if I'll be able to start it with one controller only that will be fine for now.
Of course it would be nice to fix that even from knowledge gaining perspective but in most similar post the solution is to either contact support and/or replace the hardware which in my case is impossible cos no one will pay for that.
Regarding LUN access there's was an issue. My Dell machines didn't boot but I've managed to fix WWPN and lun ID in emulex cards and bring up whole environment.