<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic &amp;quot;Disk maint start&amp;quot; fails with &amp;quot;disk maint: Maximum number of disks testing ....&amp;quot; in ONTAP Hardware</title>
    <link>https://community.netapp.com/t5/ONTAP-Hardware/quot-Disk-maint-start-quot-fails-with-quot-disk-maint-Maximum-number-of-disks/m-p/68762#M6458</link>
    <description>&lt;P&gt;I have a disk that is periodically throwing not ready errors and threw a SAS bus error yesterday. The filer has not failed the disk yet, but it throws a clump of not ready errors every few hours. I can live with occasional not ready errors but not a SAS error (it also triggered an autosupport):&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="font-family: courier new,courier;"&gt;2&amp;gt;Mar 13 16:02:16&amp;nbsp; [esd-filer-1b:callhome.hm.sas.alert.major:CRITICAL]: Call home for SAS&amp;nbsp; Connectivity Monitor: DualPathToDiskShelf_Alert[50:05:0c:c1:02:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'd tried "disk maint start" but nothing happens:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;gt; disk maint start -d 1c.02.17&lt;BR /&gt;*** You are about to mark the following file system disk(s) for copy,&amp;nbsp; ***&lt;BR /&gt;*** which will eventually result in them being removed from service&amp;nbsp;&amp;nbsp;&amp;nbsp; ***&lt;BR /&gt;&amp;nbsp; Disk /aggr1/plex0/rg2/1c.02.17&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RAID Disk Device&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; HA&amp;nbsp; SHELF BAY CHAN Pool Type&amp;nbsp; RPM&amp;nbsp; Used (MB/blks)&amp;nbsp;&amp;nbsp;&amp;nbsp; Phys (MB/blks)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; --------- ------&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ------------- ---- ---- ---- ----- --------------&amp;nbsp;&amp;nbsp;&amp;nbsp; --------------&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; data&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1c.02.17&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1c&amp;nbsp;&amp;nbsp;&amp;nbsp; 2&amp;nbsp;&amp;nbsp; 17&amp;nbsp; SA:B&amp;nbsp;&amp;nbsp; -&amp;nbsp; BSAS&amp;nbsp; 7200 423111/866531584&amp;nbsp; 423946/868242816&lt;BR /&gt;***&lt;BR /&gt;Do you want to continue? y&lt;BR /&gt;disk maint: Maximum number of disks testing 1c.02.17&lt;/P&gt;
&lt;P&gt;&amp;gt; disk maint status&lt;/P&gt;
&lt;P&gt;[nothing]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have 5 spares and my options appear to be set correctly:&lt;/P&gt;
&lt;TABLE&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.allowed_entries 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.enable&lt;/TD&gt;
&lt;TD&gt;on&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.max_disks&amp;nbsp; 84&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.rec_allowed_entries 5&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.spares_check on&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.recovery_needed.count&amp;nbsp;&amp;nbsp; 5&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;
&lt;P&gt;(I don't know what this is but I think it's a cluster param)&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Meanwhile I will try to fail the disk to a spare and swap it out, it would be nice to maint test the disk, which includes a power cycle and might either mark the disk as truly bad or clear the problem (or maybe crash the SAS bus, who knows...). I don't have any remote hands to physically pull it and reseat it or I'd just do that.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Does anyone know what the "disk maint: Maximum number of disks testing" message means?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,w&lt;/P&gt;</description>
    <pubDate>Thu, 05 Jun 2025 05:40:35 GMT</pubDate>
    <dc:creator>WSANDERSATFLEXERA</dc:creator>
    <dc:date>2025-06-05T05:40:35Z</dc:date>
    <item>
      <title>"Disk maint start" fails with "disk maint: Maximum number of disks testing ...."</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/quot-Disk-maint-start-quot-fails-with-quot-disk-maint-Maximum-number-of-disks/m-p/68762#M6458</link>
      <description>&lt;P&gt;I have a disk that is periodically throwing not ready errors and threw a SAS bus error yesterday. The filer has not failed the disk yet, but it throws a clump of not ready errors every few hours. I can live with occasional not ready errors but not a SAS error (it also triggered an autosupport):&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN style="font-family: courier new,courier;"&gt;2&amp;gt;Mar 13 16:02:16&amp;nbsp; [esd-filer-1b:callhome.hm.sas.alert.major:CRITICAL]: Call home for SAS&amp;nbsp; Connectivity Monitor: DualPathToDiskShelf_Alert[50:05:0c:c1:02:&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I'd tried "disk maint start" but nothing happens:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;gt; disk maint start -d 1c.02.17&lt;BR /&gt;*** You are about to mark the following file system disk(s) for copy,&amp;nbsp; ***&lt;BR /&gt;*** which will eventually result in them being removed from service&amp;nbsp;&amp;nbsp;&amp;nbsp; ***&lt;BR /&gt;&amp;nbsp; Disk /aggr1/plex0/rg2/1c.02.17&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; RAID Disk Device&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; HA&amp;nbsp; SHELF BAY CHAN Pool Type&amp;nbsp; RPM&amp;nbsp; Used (MB/blks)&amp;nbsp;&amp;nbsp;&amp;nbsp; Phys (MB/blks)&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; --------- ------&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ------------- ---- ---- ---- ----- --------------&amp;nbsp;&amp;nbsp;&amp;nbsp; --------------&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; data&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1c.02.17&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1c&amp;nbsp;&amp;nbsp;&amp;nbsp; 2&amp;nbsp;&amp;nbsp; 17&amp;nbsp; SA:B&amp;nbsp;&amp;nbsp; -&amp;nbsp; BSAS&amp;nbsp; 7200 423111/866531584&amp;nbsp; 423946/868242816&lt;BR /&gt;***&lt;BR /&gt;Do you want to continue? y&lt;BR /&gt;disk maint: Maximum number of disks testing 1c.02.17&lt;/P&gt;
&lt;P&gt;&amp;gt; disk maint status&lt;/P&gt;
&lt;P&gt;[nothing]&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I have 5 spares and my options appear to be set correctly:&lt;/P&gt;
&lt;TABLE&gt;
&lt;TBODY&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.allowed_entries 1&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.enable&lt;/TD&gt;
&lt;TD&gt;on&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.max_disks&amp;nbsp; 84&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.rec_allowed_entries 5&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.maint_center.spares_check on&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;&amp;nbsp;&lt;/TD&gt;
&lt;/TR&gt;
&lt;TR&gt;
&lt;TD&gt;disk.recovery_needed.count&amp;nbsp;&amp;nbsp; 5&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/TD&gt;
&lt;TD&gt;(value might be overwritten in takeover)&lt;/TD&gt;
&lt;TD&gt;
&lt;P&gt;(I don't know what this is but I think it's a cluster param)&lt;/P&gt;
&lt;/TD&gt;
&lt;/TR&gt;
&lt;/TBODY&gt;
&lt;/TABLE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Meanwhile I will try to fail the disk to a spare and swap it out, it would be nice to maint test the disk, which includes a power cycle and might either mark the disk as truly bad or clear the problem (or maybe crash the SAS bus, who knows...). I don't have any remote hands to physically pull it and reseat it or I'd just do that.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Does anyone know what the "disk maint: Maximum number of disks testing" message means?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks,w&lt;/P&gt;</description>
      <pubDate>Thu, 05 Jun 2025 05:40:35 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/quot-Disk-maint-start-quot-fails-with-quot-disk-maint-Maximum-number-of-disks/m-p/68762#M6458</guid>
      <dc:creator>WSANDERSATFLEXERA</dc:creator>
      <dc:date>2025-06-05T05:40:35Z</dc:date>
    </item>
    <item>
      <title>Re: "Disk maint start" fails with "disk maint: Maximum number of disks testing ...."</title>
      <link>https://community.netapp.com/t5/ONTAP-Hardware/quot-Disk-maint-start-quot-fails-with-quot-disk-maint-Maximum-number-of-disks/m-p/68766#M6459</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;To follow up: I swapped the disk with a spare with the "disk replace". When I tried to zero the flaky disk, now a spare, the Netapp failed it. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So, zeroing the disk will serve the same purpose as running maintenance checks, if the read errors are persistent enough.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 17 Mar 2014 20:41:54 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Hardware/quot-Disk-maint-start-quot-fails-with-quot-disk-maint-Maximum-number-of-disks/m-p/68766#M6459</guid>
      <dc:creator>WSANDERSATFLEXERA</dc:creator>
      <dc:date>2014-03-17T20:41:54Z</dc:date>
    </item>
  </channel>
</rss>

