2014-11-11 04:40 AM
we copied by rsync a volume containing MANY MANY files (28,819,704) from an old EMC Celerra array to a Netapp FA8060.
On source, I have about 2 TB of data that, with deduplication on, become 1.4 TB.
On destination, I'm already at 1.9 TB and rsync still doesnt' end, in spite of deduplication and compression saving of about same percentual of EMC.
How is it possible? Is there some basic difference of block size?
2014-11-12 02:59 AM
File based copying programs are not necessarily deduplication-aware. You may have to run dedupe once after the copying is complete, unless you have enabled automatic schedules.
from TR-3958: (http://www.netapp.com/us/media/tr-3958.pdf)
Compression and deduplication share the same scheduler and can be scheduled to run in one of five
Inline (compression only)
Scheduled on specific days and at specific times
Manually, by using the command line
Automatically, when 20% new data has been written to the volume
SnapVault software based, when used on a SnapVault destination