<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic De-duping existing 6.5TB NAS volume - Should I do entire scan or just new writes ? in ONTAP Discussions</title>
    <link>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31417#M7444</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I would appreciate any suggestions on this.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;&lt;STRONG style="text-decoration: underline;"&gt;Scenario:&lt;/STRONG&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;&lt;STRONG style="text-decoration: underline;"&gt;FAS3270-8.1.2&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;1. Planing to dedupe existing 6.5 TB (nearly full) NAS Volume [1.5TB trapped in snapshot], which is never been enabled for de-dupe.&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;2. This volume is snapmirrored to DR Filer on hourly basis.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I am estimating atlesat 30% savings on this dataset [6.5-1.5=5TB x 30 % = 1.5TB]&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;So I am hoping to save around 1.5TB, considering this change in data volume size :&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;1.What would be the snapmirror impact when I actually kick-in process post dedupe. [Assuming snapmirror will be on hold until the entire dedupe finishes]&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;2. This is a very key volume (NAS share) for the organization, is it worth doing a entire scan [de-dupe], I am concerned about the [read] performance impact that it may have post de-dupe. Is it worth it ? or should I just de-dupe all the new writes that come in?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I have read almost all threads on performance related issues, and it is said that there is no read performance impact, just new writes have 7% impact.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;It would be great to some use cases - Wherein a large volume (NAS) is de-duped from scratch ? and has it impacted the read performance post-dedupe ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;Thanks,&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;-Ashwin&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 05 Jun 2025 05:39:20 GMT</pubDate>
    <dc:creator>ASHWINPAWARTESL</dc:creator>
    <dc:date>2025-06-05T05:39:20Z</dc:date>
    <item>
      <title>De-duping existing 6.5TB NAS volume - Should I do entire scan or just new writes ?</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31417#M7444</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I would appreciate any suggestions on this.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;&lt;STRONG style="text-decoration: underline;"&gt;Scenario:&lt;/STRONG&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;&lt;STRONG style="text-decoration: underline;"&gt;FAS3270-8.1.2&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;1. Planing to dedupe existing 6.5 TB (nearly full) NAS Volume [1.5TB trapped in snapshot], which is never been enabled for de-dupe.&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;2. This volume is snapmirrored to DR Filer on hourly basis.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I am estimating atlesat 30% savings on this dataset [6.5-1.5=5TB x 30 % = 1.5TB]&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;So I am hoping to save around 1.5TB, considering this change in data volume size :&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;1.What would be the snapmirror impact when I actually kick-in process post dedupe. [Assuming snapmirror will be on hold until the entire dedupe finishes]&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;2. This is a very key volume (NAS share) for the organization, is it worth doing a entire scan [de-dupe], I am concerned about the [read] performance impact that it may have post de-dupe. Is it worth it ? or should I just de-dupe all the new writes that come in?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;I have read almost all threads on performance related issues, and it is said that there is no read performance impact, just new writes have 7% impact.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;It would be great to some use cases - Wherein a large volume (NAS) is de-duped from scratch ? and has it impacted the read performance post-dedupe ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;Thanks,&lt;/P&gt;&lt;P style="background-color: #eef4f9; font-size: 12px; color: #454545; font-family: Arial, Helvetica, Verdana, sans-serif;"&gt;-Ashwin&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 05 Jun 2025 05:39:20 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31417#M7444</guid>
      <dc:creator>ASHWINPAWARTESL</dc:creator>
      <dc:date>2025-06-05T05:39:20Z</dc:date>
    </item>
    <item>
      <title>Re: De-duping existing 6.5TB NAS volume - Should I do entire scan or just new writes ?</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31420#M7446</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Ashwin&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P style="font-size: 12px; font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #eef4f9;"&gt;1.What would be the snapmirror impact when I actually kick-in process post dedupe. [Assuming snapmirror will be on hold until the entire dedupe finishes]&lt;/P&gt;&lt;P style="font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #ffffff;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Snapmirror will see this as change and will have an update equal to your dedupe savings.&amp;nbsp; You will also see your snapshot usage grow by the same amount as you dedupe until they roll off.&lt;/P&gt;&lt;P style="font-size: 12px; font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #eef4f9;"&gt;2. This is a very key volume (NAS share) for the organization, is it worth doing a entire scan [de-dupe], I am concerned about the [read] performance impact that it may have post de-dupe. Is it worth it ? or should I just de-dupe all the new writes that come in?&lt;/P&gt;&lt;P style="font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #ffffff;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Read performance won't be impacted as reads are read into cache, so any deduped data will be read into cache and served out that way already.&amp;nbsp; And if that block isn't in cache, it won't take any longer to load a deduped block than an inflated block.&amp;nbsp; We have 14T volumes deduped to just over 1TB with no noticeable difference&lt;/P&gt;&lt;P style="font-size: 12px; font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #eef4f9;"&gt;I have read almost all threads on performance related issues, and it is said that there is no read performance impact, just new writes have 7% impact.&lt;/P&gt;&lt;P style="font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #ffffff;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; I cannot speak to this, as we dedupe on a schedule, so there is no write impact any customer has been able to identify.&amp;nbsp; They also do not see this while the schedule is running.&lt;/P&gt;&lt;P style="font-size: 12px; font-family: Arial, Helvetica, Verdana, sans-serif; color: #454545; background-color: #eef4f9;"&gt;It would be great to some use cases - Wherein a large volume (NAS) is de-duped from scratch ? and has it impacted the read performance post-dedupe ?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; As I stated above we have turned dedupe on for multiple 14TB volumes with a nearly 14:1 dedupe ratio in almost all cases on NFS data(all volumes contained extremely similar data) without the customer ever seeing any impact. I also have deduped 4TB of running ESX datastore data with no impact and a 12TB volume that didn't dupe very well at all, only about 5% and again the customer never knew it had happened.&amp;nbsp; My most recent was nearly 9TB, deduped down to 5TB and it was great that the customer actually asked me when I was going to run it...three days after the process had finished and been running on schedule and the change closed out.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;- Scott&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Mon, 31 Mar 2014 20:01:14 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31420#M7446</guid>
      <dc:creator>cscott</dc:creator>
      <dc:date>2014-03-31T20:01:14Z</dc:date>
    </item>
    <item>
      <title>Re: De-duping existing 6.5TB NAS volume - Should I do entire scan or just new writes ?</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31429#M7451</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;This is exactly what I needed to know. &lt;SPAN style="font-size: 10pt; line-height: 1.5em;"&gt;Thank you Scott. I really appreciate it. [Glad to see that ratio 14:1]&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Wed, 02 Apr 2014 13:01:59 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/De-duping-existing-6-5TB-NAS-volume-Should-I-do-entire-scan-or-just-new-writes/m-p/31429#M7451</guid>
      <dc:creator>ASHWINPAWARTESL</dc:creator>
      <dc:date>2014-04-02T13:01:59Z</dc:date>
    </item>
  </channel>
</rss>

