<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Snapprotect - Snapshot Catalogs in Data Protection</title>
    <link>https://community.netapp.com/t5/Data-Protection/Snapprotect-Snapshot-Catalogs/m-p/69492#M4831</link>
    <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Some more info (answering some of my own questions):&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I started testing with a smaller volume (~800,000 files) and sat watching the CVNasFileScan.log file while a few things ran.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Ran a "Full" backup with Cataloging on.&amp;nbsp; Seemed to take about 5 minutes to catalog things (seems faster than the other volume but I'll ignore that for now).&amp;nbsp; I watched the index process in the log.&lt;/LI&gt;&lt;LI&gt;Next ran an "Incremental" backup with Cataloging on.&amp;nbsp; The log noted it was doing a SnapDiff (presumably comparing to the last full).&amp;nbsp; Noted no files changed, indexing said successful, took very little time.&amp;nbsp; Ok, so looks like this does what I thought.&lt;/LI&gt;&lt;LI&gt;Next ran another "Full", as I presumed this would do another full catalog.&amp;nbsp; I was surprised to see that the indexer did another SnapDiff (comparing to ???) and noted no changed files, said indexing was successful, and it took very little time.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So maybe I'm overthinking this...if I get through one catalog of a volume with a big number of files, presumably each catalog process will just process changed files.&amp;nbsp; That makes me happy but I'm confused still &lt;SPAN __jive_emoticon_name="grin" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/5.0.1/images/emoticons/grin.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
    <pubDate>Fri, 06 Jun 2014 17:06:09 GMT</pubDate>
    <dc:creator>MURRAYCH1</dc:creator>
    <dc:date>2014-06-06T17:06:09Z</dc:date>
    <item>
      <title>Snapprotect - Snapshot Catalogs</title>
      <link>https://community.netapp.com/t5/Data-Protection/Snapprotect-Snapshot-Catalogs/m-p/69487#M4830</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Hey All,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm working on a new SnapProtect 10 SP6 setup and battling a lack of information &lt;SPAN __jive_emoticon_name="happy" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/5.0.1/images/emoticons/happy.gif"&gt;&lt;/SPAN&gt;&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Here is the scenario:&amp;nbsp; I have an Office and a Datacenter connected with a 100Mbit WAN Link about 500km apart.&amp;nbsp; I am trying to build a protection scenario where a CIFS Volume in the Office gets snapshots hourly / daily / weekly, and a copy is snapvaulted to the datacenter weekly.&amp;nbsp; So far I have that working. Eventually I might add a cut to tape from the datacenter as an additional layer but for the moment it's not in play.&amp;nbsp; The primary SnapProtect server / Media Agent is in the Datacenter.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My question has to do with catalogs.&amp;nbsp; The volume in play has over 7 million files on it.&amp;nbsp; If I configure the jobs to perform cataloging, the indexing process takes Many Many Hours to complete... when the goal is hourly snapshots, this is obviously unworkable.&amp;nbsp; &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;At first I wondered if I even needed catalogs; but it seems like without them you lose a lot of the search capabilities when it comes to restores.&amp;nbsp; If I know the filename(s) and locations of them I can obviously just mount a snap to restore, but we do frequently get much more fuzzy restore requests.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So I need to figure out a way to improve the performance of cataloging.&amp;nbsp; Here's what I've tried:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Added a Media Agent in the Office, and configured things so the indexing happens there (taking the WAN link out of the equation).&amp;nbsp; This roughly doubled performance, but it's still taking Hours.&lt;/LI&gt;&lt;LI&gt;Configured jobs so Cataloging happens only on the Weekly snaps, and not for the dailies and hourlies.&amp;nbsp; This at least lets the hourlies complete.&amp;nbsp; This is an ok solution but I suspect I'll only have search capabilities for the weekly backups...which isn't really all that great.&lt;/LI&gt;&lt;LI&gt;I've read that if you configure the hourly / daily jobs as Incremental and the weeklies as full, that the Cataloging will somehow only do the changed data (and therefore complete much faster).&amp;nbsp; I'm going to try this but have no data yet.&amp;nbsp; I'm completely perplexed on the concept of how a backup can be incremental when a snapshot is always more or less a "full" copy...&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I'm wondering if anyone has any tips or best practices or advice with regards to configuring SnapProtect jobs for cataloging.&amp;nbsp; I see that one of the volumes I'm going to eventually need to throw in here has 26million files in it (almost 4 times the one I'm struggling with now!)&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any help would be appreciated.&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Thu, 05 Jun 2025 05:34:48 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Data-Protection/Snapprotect-Snapshot-Catalogs/m-p/69487#M4830</guid>
      <dc:creator>MURRAYCH1</dc:creator>
      <dc:date>2025-06-05T05:34:48Z</dc:date>
    </item>
    <item>
      <title>Re: Snapprotect - Snapshot Catalogs</title>
      <link>https://community.netapp.com/t5/Data-Protection/Snapprotect-Snapshot-Catalogs/m-p/69492#M4831</link>
      <description>&lt;HTML&gt;&lt;HEAD&gt;&lt;/HEAD&gt;&lt;BODY&gt;&lt;P&gt;Some more info (answering some of my own questions):&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I started testing with a smaller volume (~800,000 files) and sat watching the CVNasFileScan.log file while a few things ran.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Ran a "Full" backup with Cataloging on.&amp;nbsp; Seemed to take about 5 minutes to catalog things (seems faster than the other volume but I'll ignore that for now).&amp;nbsp; I watched the index process in the log.&lt;/LI&gt;&lt;LI&gt;Next ran an "Incremental" backup with Cataloging on.&amp;nbsp; The log noted it was doing a SnapDiff (presumably comparing to the last full).&amp;nbsp; Noted no files changed, indexing said successful, took very little time.&amp;nbsp; Ok, so looks like this does what I thought.&lt;/LI&gt;&lt;LI&gt;Next ran another "Full", as I presumed this would do another full catalog.&amp;nbsp; I was surprised to see that the indexer did another SnapDiff (comparing to ???) and noted no changed files, said indexing was successful, and it took very little time.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So maybe I'm overthinking this...if I get through one catalog of a volume with a big number of files, presumably each catalog process will just process changed files.&amp;nbsp; That makes me happy but I'm confused still &lt;SPAN __jive_emoticon_name="grin" __jive_macro_name="emoticon" class="jive_macro jive_emote" src="https://community.netapp.com/5.0.1/images/emoticons/grin.gif"&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;/BODY&gt;&lt;/HTML&gt;</description>
      <pubDate>Fri, 06 Jun 2014 17:06:09 GMT</pubDate>
      <guid>https://community.netapp.com/t5/Data-Protection/Snapprotect-Snapshot-Catalogs/m-p/69492#M4831</guid>
      <dc:creator>MURRAYCH1</dc:creator>
      <dc:date>2014-06-06T17:06:09Z</dc:date>
    </item>
  </channel>
</rss>

