Experiences with zfs deduplication?

needanke@feddit.org · 1 month ago

Experiences with zfs deduplication?

pimeys@lemmy.nauk.io · 1 month ago

You should maybe read about the use cases for deduplication before using it. Here’s one recent article:

https://despairlabs.com/blog/posts/2024-10-27-openzfs-dedup-is-good-dont-use-it/

If you mostly store legit Blu-ray rips, the answer is probably no, you should not use zfs deduplication.

friend_of_satan@lemmy.world · edit-2 1 month ago

I was also going to link this. I started using zfs 10-ish years ago and used dedup when it came out, and it was really not worth it except for archiving a bunch of stuff I knew had gigs of duplicate data. Performance was so poor.

Cousin Mose@lemmy.hogru.ch · 1 month ago

I’m in almost the exact same situation as OP, 8 TB of raw Blu-ray dumps except I’m on XFS. I ran duperemove and freed ~200 GB.

needanke@feddit.org · 1 month ago

I think I was a bit unclear on that, I meant uncompressed rips as in I ripped the relevant media to unkompressed mkvs, I didn’t save the entire disk dump. I also have mostly such rips, but also a bit of media from other sourches ™ which is already compressed. So I suspect my results would be even worse.

Cousin Mose@lemmy.hogru.ch · 1 month ago

I agree. Most of my duplicates came from the raw disc files. I too dump some content to MKV (mainly TV episodes) but those files likely have much less duplication, though I do recall some of the duplicates coming from The Office in MKV.

(I do wonder if those The Office duplicates were something like the opening title, or scenes from the episode showing clips from previous episodes because it seems highly unlikely that the raw video streams were similar.)