Tools for de-duplicating file system data
Duperemove is a simple tool for finding duplicated extents and submitting them for deduplication. When given a list of files it will hash their contents on a block by block basis and compare those hashes to each other, finding and categorizing blocks that match each other. When given the
-d option, duperemove will submit those extents for deduplication using the Linux kernel extent-same
Duperemove can store the hashes it computes in a hash file. If given an existing hash file, duperemove will only compute hashes for those files which have changed since the last run. Thus you can run duperemove repeatedly on your data as it changes, without having to re-checksum unchanged data.
Duperemove can also take input from the
- Website: https://github.com/markfasheh/duperemove
- Licenses: GPL 2
- Package source: gnu/packages/disk.scm
- Builds: See build status
- Issues: See known issues
duperemove 0.11.3 as follows:
guix install email@example.com
Or install the latest version:
guix install duperemove
You can also install packages in augmented, pure or containerized environments for development or simply to try them out without polluting your user profile. See the
guix shell documentation for more information.