<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Deduplication and 7zip files in ONTAP Discussions</title>
    <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71365#M16633</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Eugene,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for the reply. My DB dumps was actually from the same DB. I would assume there will be a lot of duplicates, my guess is the 7zip compression algorithm made the file unique to each other....I don't know but it is good to be aware of this.&amp;nbsp; &lt;SPAN __jive_emoticon_name="happy" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/5.0.1/images/emoticons/happy.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TingWei&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Thu, 17 Apr 2014 02:57:41 GMT</pubDate>
    <dc:creator>TINGWEI_LIM</dc:creator>
    <dc:date>2014-04-17T02:57:41Z</dc:date>
    <item>
      <title>Deduplication and 7zip files</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71351#M16631</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi guys,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have a volume is filled with lots of 7zip files which contained MSSQL dump. The volume has about 2000++ 7zip files which is a MSSQL daily dump, each sized 500MB-1.5GB.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Initially I thought deduplication is going to save a lot but it turned out that I was wrong, the saving wasn't much.... is that happening to most zip files? Anyone experienced before?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;sis status -l output&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;State:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Enabled&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Compression:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Disabled&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Inline Compression:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Disabled&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Status:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Idle&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Progress:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Idle for 08:19:57&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Type:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Regular&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Schedule:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; sun-sat@7&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Minimum Blocks Shared:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Blocks Skipped Sharing:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Operation State:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Success&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Successful Operation Begin: Tue Apr 15 07:00:00 MYT 2014&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Successful Operation End:&amp;nbsp;&amp;nbsp; Tue Apr 15 07:02:48 MYT 2014&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Operation Begin:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Tue Apr 15 07:00:00 MYT 2014&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Operation End:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Tue Apr 15 07:02:48 MYT 2014&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Operation Size:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 6978 MB&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Last Operation Error:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Change Log Usage:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0%&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Logical Data&amp;amp;colon;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1345 GB/49 TB (3%)&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Queued Job:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; -&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;Stale Fingerprints:&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0%&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;df -sh output&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Filesystem&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; used&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; saved&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; %saved&lt;/P&gt;&lt;P&gt;/vol/myvolume/&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 1341GB&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 3772MB&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; 0%&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 05 Jun 2025 05:38:08 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71351#M16631</guid>
      <dc:creator>TINGWEI_LIM</dc:creator>
      <dc:date>2025-06-05T05:38:08Z</dc:date>
    </item>
    <item>
      <title>Re: Deduplication and 7zip files</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71356#M16632</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;TingWei -&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;DeDupe is working on WAFL 4K blocks. There's probably not many duplicate blocks in your DB dumps, and less likely that there'd be dupe zipped blocks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If there were lots of duplicate files that had been zipped up then dedupe would do great things.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I hope this response has been helpful to you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;At your service,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Eugene E. Kashpureff&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Senior Consultant, K&amp;amp;H Research &lt;/SPAN&gt;&lt;A class="jive-link-external-small" href="http://www.khresear.ch/" target="_blank"&gt;http://www.khresear.ch/&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Senior Instructor, Unitek Education &lt;/SPAN&gt;&lt;A class="jive-link-external-small" href="http://www.unitek.com/training/netapp/" target="_blank"&gt;http://www.unitek.com/training/netapp/&lt;/A&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 15 Apr 2014 07:32:39 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71356#M16632</guid>
      <dc:creator>ekashpureff</dc:creator>
      <dc:date>2014-04-15T07:32:39Z</dc:date>
    </item>
    <item>
      <title>Re: Deduplication and 7zip files</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71365#M16633</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi Eugene,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for the reply. My DB dumps was actually from the same DB. I would assume there will be a lot of duplicates, my guess is the 7zip compression algorithm made the file unique to each other....I don't know but it is good to be aware of this.&amp;nbsp; &lt;SPAN __jive_emoticon_name="happy" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/5.0.1/images/emoticons/happy.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TingWei&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 17 Apr 2014 02:57:41 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71365#M16633</guid>
      <dc:creator>TINGWEI_LIM</dc:creator>
      <dc:date>2014-04-17T02:57:41Z</dc:date>
    </item>
    <item>
      <title>Re: Deduplication and 7zip files</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71370#M16634</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I just saw this and maybe it's already too late. But maybe it will help someone else.&lt;/P&gt;&lt;P&gt;The way most&amp;nbsp; compression algorithms are working is that even very small changes in the uncompressed file will cause a "cascade" of differences throughout the compressed file. So even though the uncompressed source files might be 99% identical, the compressed files are totally different.&lt;/P&gt;&lt;P&gt;There is an option in newer versions of gzip&amp;nbsp; called --rsyncable (&lt;A href="http://superuser.com/questions/636881/what-are-good-compression-algorithms-for-delta-synchronization" title="http://superuser.com/questions/636881/what-are-good-compression-algorithms-for-delta-synchronization" target="_blank"&gt;http://superuser.com/questions/636881/what-are-good-compression-algorithms-for-delta-synchronization&lt;/A&gt;) which slightly increases the size of the compressed archive, but also syncs the compressed output with the uncompressed input frequently. In this case, the compressed output will remain more similar, even when the uncompressed input has small changes.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Give it a try and let us know if it helps.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Michael&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Tue, 13 May 2014 14:50:31 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71370#M16634</guid>
      <dc:creator>MMUELLER_HC</dc:creator>
      <dc:date>2014-05-13T14:50:31Z</dc:date>
    </item>
    <item>
      <title>Re: Deduplication and 7zip files</title>
      <link>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71375#M16635</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey Michael,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for the info! I will check with my client to see if they are willing to include the option --resyncable . &lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 30 May 2014 08:15:29 GMT</pubDate>
      <guid>https://community.netapp.com/t5/ONTAP-Discussions/Deduplication-and-7zip-files/m-p/71375#M16635</guid>
      <dc:creator>TINGWEI_LIM</dc:creator>
      <dc:date>2014-05-30T08:15:29Z</dc:date>
    </item>
  </channel>
</rss>

