<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Linux DB service was down, after takeover operated. in ONTAP Discussions</title>
    <link>https://community.netapp.com/t5/ONTAP-Discussions/Linux-DB-service-was-down-after-takeover-operated/m-p/99298#M20184</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;One of my customer who has been using NetApp Storage LUN with various OS (Windows, Solaris, AIX, Linux etc..).&lt;/P&gt;&lt;P&gt;At that time, NetApp recommaned to upgrade their Ontap OS 8.1.4P2 -&amp;gt; 8.1.4P6.&lt;/P&gt;&lt;P&gt;So, I decided to upgrade both controller using non-disruptive opertation.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Controller 1 is for SAN / Controller 2 is for NAS Service. Following our customer's opinion, we divided the controller for uses.&lt;/P&gt;&lt;P&gt;Here is the problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Controller 2 which has been using NAS service took over Controller1 (SAN), FCP Services was down and up for few second during taking over.&lt;/P&gt;&lt;P&gt;That time, I/O error was ouccred on the DB server side, which has been using NetApp LUN. cuz DB process was down too.&lt;/P&gt;&lt;P&gt;More specific when server detected a FC disconnection between server and storage lun, DB dumpted large dump log, made full status to server volume.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Long story short, Detecting FC disconnection -&amp;gt; server dumpted large log to volume -&amp;gt; volume was full status -&amp;gt; DB Process was down.&lt;/P&gt;&lt;P&gt;NetApp support gave me a solution to change a LUN timeout value on both server side, default (30 second) &amp;nbsp;to 120 second.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Even though &amp;nbsp;the value of LUN connection time out was default (30 second), why the LUN was disconnected? &amp;nbsp;FCP service was down for just few second.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[root@redhat-cn ~]# cat /sys/block/sdX/device/timeout&lt;/P&gt;&lt;P&gt;[root@redhat-cn ~]# echo 120 &amp;gt; /sys/block/sdX/device/timeout // does this value make health check interval between two device?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't get any idea of this issue....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Plz give me some help&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 05 Jun 2025 05:17:06 GMT</pubDate>
    <dc:creator>PATRICK_SEO</dc:creator>
    <dc:date>2025-06-05T05:17:06Z</dc:date>
    <item>
      <title>Linux DB service was down, after takeover operated.</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Linux-DB-service-was-down-after-takeover-operated/m-p/99298#M20184</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;One of my customer who has been using NetApp Storage LUN with various OS (Windows, Solaris, AIX, Linux etc..).&lt;/P&gt;&lt;P&gt;At that time, NetApp recommaned to upgrade their Ontap OS 8.1.4P2 -&amp;gt; 8.1.4P6.&lt;/P&gt;&lt;P&gt;So, I decided to upgrade both controller using non-disruptive opertation.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Controller 1 is for SAN / Controller 2 is for NAS Service. Following our customer's opinion, we divided the controller for uses.&lt;/P&gt;&lt;P&gt;Here is the problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Controller 2 which has been using NAS service took over Controller1 (SAN), FCP Services was down and up for few second during taking over.&lt;/P&gt;&lt;P&gt;That time, I/O error was ouccred on the DB server side, which has been using NetApp LUN. cuz DB process was down too.&lt;/P&gt;&lt;P&gt;More specific when server detected a FC disconnection between server and storage lun, DB dumpted large dump log, made full status to server volume.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Long story short, Detecting FC disconnection -&amp;gt; server dumpted large log to volume -&amp;gt; volume was full status -&amp;gt; DB Process was down.&lt;/P&gt;&lt;P&gt;NetApp support gave me a solution to change a LUN timeout value on both server side, default (30 second) &amp;nbsp;to 120 second.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Even though &amp;nbsp;the value of LUN connection time out was default (30 second), why the LUN was disconnected? &amp;nbsp;FCP service was down for just few second.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;[root@redhat-cn ~]# cat /sys/block/sdX/device/timeout&lt;/P&gt;&lt;P&gt;[root@redhat-cn ~]# echo 120 &amp;gt; /sys/block/sdX/device/timeout // does this value make health check interval between two device?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I don't get any idea of this issue....&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Plz give me some help&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 05 Jun 2025 05:17:06 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Linux-DB-service-was-down-after-takeover-operated/m-p/99298#M20184</guid>
      <dc:creator>PATRICK_SEO</dc:creator>
      <dc:date>2025-06-05T05:17:06Z</dc:date>
    </item>
  </channel>
</rss>

