<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HA Broken, want to confirm steps in ONTAP Hardware</title>
    <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141637#M8945</link>
    <description>&lt;P&gt;Yes - there is a degree of BIOS assisted memory partitioning performed in HA mode so it needs to be set before ONTAP boots. I've reviewed available internal documentation and there does not appear to be a workaround.&lt;/P&gt;</description>
    <pubDate>Mon, 23 Jul 2018 06:15:01 GMT</pubDate>
    <dc:creator>AlexDawson</dc:creator>
    <dc:date>2018-07-23T06:15:01Z</dc:date>
    <item>
      <title>HA Broken, want to confirm steps</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141624#M8942</link>
      <description>&lt;P&gt;I'll try to be brief, but that's almost impossible for me, so I'll apologize now. TL;DR: I'm sorry.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We have an HA-pair of 2240-4s in our HQ. We are running Ontap 8.2.5P1 &lt;FONT size="3"&gt;&lt;STRONG&gt;7-Mode&lt;/STRONG&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;. I know. And that's why I put it in bold because, being ancient, much of the advise I got from the NetApp Technical Support staff was &lt;EM&gt;slightly&lt;/EM&gt; incorrect, because the information they researched, and KB links were for Cluster Mode. So when I, or we, would walk through the steps of what I'm about to describe, things didn't work as expected because of syntax changes between the two Ontap versions.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;To break it down, we suffered a major problem a couple weeks ago when we lost the air conditioning to the server room, resulting in filer A losing six disks (not all at once, but over the time it took to pull the trigger on remotely powering off our server room. In the meantime, everything was overheating and we suffered a WAFL inconsistency.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;NetApp techs were great and dedicated, I want to make that clear. But the issue was (and rightly so) everyone just assumes you're running cluster mode.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;Now, my question. In all the rukus, HA got broken, and NetApp's determination is that options for cf.mode is set to HA on both filers. But the chassis NVRAM reports A as being in Non-HA mode. At the end of this post, I'll paste the options cf output so you can see.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;Therefore, I've been provided a KB which I've tentatively scheduled for next Tuesday night. It involves opening up the chassis and removing the batteries from CMOS and NVRAM and clearing it out. It will cause an outage.My only concern at this point is I'm not familiar with the internals of a 2240, never opened one up, and this is the first time I've dealt with this issue...and all the articles I seem to find are for other scenarios, filers and, of course, clustered mode. I just want to confirm this seems like the appropriate solution, or the possibility of an online solution or PROM setting that could avoid any downtime. I love the overtime, but I also love sleep. And I want to stress, the engineer who has ownership has been great, available, informative and dedicated. I'm not questioning their (and their team's) remedy. But I have nothing to lose by asking the community.&lt;BR /&gt;&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;Below is the article link and following that are the cf information from A and B. A is the one that had the problems.&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;TIA,&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;Steve&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;FONT size="3"&gt;&lt;FONT size="1 2 3 4 5 6 7"&gt;This is the article:&lt;/FONT&gt;&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Error message: Chassis FRU PROM write operation failed Replace the system chassis of controller&lt;/P&gt;
&lt;P&gt;&lt;A href="https://kb.netapp.com/app/answers/answer_view/a_id/1029050/loc/en_US#__highlight" target="_blank"&gt;https://kb.netapp.com/app/answers/answer_view/a_id/1029050/loc/en_US#__highlight&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here are our options cf for A-filer, followed by B-filer (A is the one that suffered all the problems)&lt;/P&gt;
&lt;P&gt;--------------------------------------------------------------------------------------------------&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;A-filer&amp;gt; cf enable&lt;BR /&gt;Controller is in Non-HA mode.&lt;BR /&gt;A-filer&amp;gt; options.cf.enable true&lt;BR /&gt;options.cf.enable not found.&amp;nbsp; Type '?' for a list of commands&lt;BR /&gt;A-filer&amp;gt; options cf ?&lt;BR /&gt;Setting invalid option cf failed.&lt;BR /&gt;cf.giveback.auto.after.panic.takeover on&lt;BR /&gt;cf.giveback.auto.cancel.on_network_failure on&lt;BR /&gt;cf.giveback.auto.cifs.terminate.minutes 5&lt;BR /&gt;cf.giveback.auto.delay.seconds 600&lt;BR /&gt;cf.giveback.auto.enable&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.giveback.auto.override.vetoes off&lt;BR /&gt;cf.giveback.auto.terminate.bigjobs off&lt;BR /&gt;cf.giveback.check.partner&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.hw_assist.enable&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;BR /&gt;cf.hw_assist.partner.address&lt;BR /&gt;cf.hw_assist.partner.port&lt;BR /&gt;cf.mode&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ha&lt;BR /&gt;cf.remote_syncmirror.enable&amp;nbsp; off&lt;BR /&gt;cf.sfoaggr_maxtime&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 120&lt;BR /&gt;cf.takeover.bypass_optimization off&lt;BR /&gt;cf.takeover.change_fsid&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.takeover.detection.seconds 15&lt;BR /&gt;cf.takeover.on_disk_shelf_miscompare off&lt;BR /&gt;cf.takeover.on_failure&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;BR /&gt;cf.takeover.on_network_interface_failure off&lt;BR /&gt;cf.takeover.on_network_interface_failure.policy all_nics&lt;BR /&gt;cf.takeover.on_panic&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;BR /&gt;cf.takeover.on_reboot&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;BR /&gt;cf.takeover.on_short_uptime&amp;nbsp; off&lt;BR /&gt;cf.takeover.use_mcrc_file&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;----------------------------------------------------------------------------------&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;B-filer&amp;gt; cf status&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;A-filer may be down, takeover disabled because of reason (takeover disabled by partner)&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;B-filer has disabled takeover by A-filer (unsynchronized log)&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;VIA Interconnect is up (link up).&lt;/STRONG&gt;&lt;BR /&gt;B-filer&amp;gt; options cf&lt;BR /&gt;cf.giveback.auto.after.panic.takeover on&lt;BR /&gt;cf.giveback.auto.cancel.on_network_failure on&lt;BR /&gt;cf.giveback.auto.cifs.terminate.minutes 5&lt;BR /&gt;cf.giveback.auto.delay.seconds 600&lt;BR /&gt;cf.giveback.auto.enable&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.giveback.auto.override.vetoes off&lt;BR /&gt;cf.giveback.auto.terminate.bigjobs off&lt;BR /&gt;cf.giveback.check.partner&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.hw_assist.enable&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.hw_assist.partner.address 192.168.blah.blah&lt;BR /&gt;cf.hw_assist.partner.port&amp;nbsp;&amp;nbsp;&amp;nbsp; 4444&lt;BR /&gt;cf.mode&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ha&lt;BR /&gt;cf.remote_syncmirror.enable&amp;nbsp; off&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (same value required in local+partner)&lt;BR /&gt;cf.sfoaggr_maxtime&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 120&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (value might be overwritten in takeover)&lt;BR /&gt;cf.takeover.bypass_optimization off&lt;BR /&gt;cf.takeover.change_fsid&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.takeover.detection.seconds 15&lt;BR /&gt;cf.takeover.on_disk_shelf_miscompare off&lt;BR /&gt;cf.takeover.on_failure&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.takeover.on_network_interface_failure off&lt;BR /&gt;cf.takeover.on_network_interface_failure.policy all_nics&amp;nbsp;&amp;nbsp; (same value in local+partner recommended)&lt;BR /&gt;cf.takeover.on_panic&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; on&lt;BR /&gt;cf.takeover.on_reboot&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; off&lt;BR /&gt;cf.takeover.on_short_uptime&amp;nbsp; on&lt;BR /&gt;cf.takeover.use_mcrc_file&amp;nbsp;&amp;nbsp;&amp;nbsp; off&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; (value might be overwritten in takeover)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jun 2025 13:29:39 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141624#M8942</guid>
      <dc:creator>Digriz60</dc:creator>
      <dc:date>2025-06-04T13:29:39Z</dc:date>
    </item>
    <item>
      <title>Re: HA Broken, want to confirm steps</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141632#M8943</link>
      <description>&lt;P&gt;Have you already tried "cf disable" on controller A yet?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;There should be an FRU map on the top of the PCM (the slide out module). Otherwise this photo shows the insides - the NVRAM battery is the large black plastic unit at the bottom of the motherboard, while the CMOS one is the coin cell on the right bottom of the board&lt;/P&gt;
&lt;P&gt;&lt;IMG src="https://www.storagereview.com/images/StorageReview-NetApp-FAS2240-2-Controller-10GbE-8Gb-FC.jpg" border="0" /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jul 2018 03:18:24 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141632#M8943</guid>
      <dc:creator>AlexDawson</dc:creator>
      <dc:date>2018-07-23T03:18:24Z</dc:date>
    </item>
    <item>
      <title>Re: HA Broken, want to confirm steps</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141636#M8944</link>
      <description>&lt;P&gt;Thanks for the information and illustration! I just tried disable/enable and receive the same error:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;la-fas01-a&amp;gt; cf status&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;Indeterminate state.&amp;nbsp; Mode is HA, FRU value is non-HA.&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I was just hoping there would have been a way to issue a command from the SP, but I'm sure the way things are engineered, the system has to be off, just like changing a bios setting. And I strongly assumed that if this is the procedure I was given, that's the only way to accomplish this. And, to be honest, even if someone came up with a hack, I'd still probably stick with the official plan. Just wanted to see what was out there, you never know!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Again, thank you!&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Steve&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jul 2018 05:56:03 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141636#M8944</guid>
      <dc:creator>Digriz60</dc:creator>
      <dc:date>2018-07-23T05:56:03Z</dc:date>
    </item>
    <item>
      <title>Re: HA Broken, want to confirm steps</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141637#M8945</link>
      <description>&lt;P&gt;Yes - there is a degree of BIOS assisted memory partitioning performed in HA mode so it needs to be set before ONTAP boots. I've reviewed available internal documentation and there does not appear to be a workaround.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jul 2018 06:15:01 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141637#M8945</guid>
      <dc:creator>AlexDawson</dc:creator>
      <dc:date>2018-07-23T06:15:01Z</dc:date>
    </item>
    <item>
      <title>Re: HA Broken, want to confirm steps</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141639#M8946</link>
      <description>&lt;P&gt;System cannot be "off" - HA mode is configured in maintenance mode boot. It still means outage for the controller in question.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jul 2018 06:23:34 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/HA-Broken-want-to-confirm-steps/m-p/141639#M8946</guid>
      <dc:creator>aborzenkov</dc:creator>
      <dc:date>2018-07-23T06:23:34Z</dc:date>
    </item>
  </channel>
</rss>

