426
Compression! (lemmy.dbzer0.com)
top 29 comments
sorted by: hot top controversial new old
[-] xoggy@programming.dev 74 points 3 weeks ago

Turns out we're already doing a lot of compression at the file and filesystem level.

[-] Opisek@piefed.blahaj.zone 33 points 3 weeks ago

Not necessarily. For example, you can't really compress encrypted files. You can certainly try but the result will likely be what the meme portrays.

[-] Randelung@lemmy.world 32 points 3 weeks ago

Turns out pseudo random byte streams don't really repeat that often.

[-] SkyeStarfall 4 points 3 weeks ago

We just need to develop an algorithm to compress random byte streams, easy

[-] WolfLink@sh.itjust.works 7 points 3 weeks ago

Media files are always thoroughly compressed (except in certain settings like professional video and audio work).

[-] rainwall@piefed.social 4 points 3 weeks ago

Media files can can benefit from a codec change. Going from h264 to h265/hevc can net a 30-50% reduction in size for almost no quality loss.

The only trade off is increased cpu usage if the client doesnt have hardware h265 support and the time to do the transcoding.

[-] Opisek@piefed.blahaj.zone 1 points 3 weeks ago

I don't see how that relates to encryption.

[-] WolfLink@sh.itjust.works 2 points 3 weeks ago

It’s more relevant to the previous comment as an example of how we are doing a lot of compression at the filesystem level.

The files that are typically largest are already quite thoroughly compressed.

[-] MrLLM@ani.social 20 points 3 weeks ago* (last edited 3 weeks ago)

Plot twist: Your file manager reports directories in GB and files in GB converted to GiB with GB as units

~/j~

[-] addie@feddit.uk 19 points 3 weeks ago

Well, yeah. The real advantage is only having a single file to transfer, makes eg. SFTP a lot less annoying at the command line.

Lossless compression works by storing redundant information more efficiently. If you've got 50 GB in a directory, it's going to be mostly pictures and videos, because that would be an incredible amount of text or source code. Those are already stored with lossy compression, so there's just not much more you can squeeze out.

I suppose you might have 50 GB of logs, especially if you've a logserver for your network? But most modern logging stores in a binary format, since it's quicker to search and manipulate, and doesn't use up such a crazy amount of disk space.

[-] webhead@sh.itjust.works 10 points 3 weeks ago

I actually just switched a backup script of mine from using a tar.gz to just a regular tar file. It's a little bigger but overall, the process is so much faster I don't even care about the tiny extra bit of compression (100gb vs 120gb transferred over a 1gbit connection). The entire reason I do this is, like you said, transferring files over the Internet is a billion times faster as one file, BUT you don't need the gzip step just for that

[-] bjoern_tantau@swg-empire.de 2 points 3 weeks ago

You probably know of this already, but you might consider using Borg Backup instead. It only sends the changed bits which is even faster.

[-] webhead@sh.itjust.works 2 points 3 weeks ago

I'll have to take a look. I was trying to keep it simple but I'm not opposed to giving that tool a shot. Thanks. :)

[-] aev_software@programming.dev 4 points 3 weeks ago

Meh, that barely fits an empty MS Word doc...

/s

[-] Honytawk@feddit.nl 1 points 3 weeks ago

Can we transcribe it to a QR code?

[-] dontsayaword@piefed.social 17 points 3 weeks ago
[-] menas@lemmy.wtf 16 points 3 weeks ago

haha. I spent so much time looking for an efficient compression algorithm for audio and movies ... until I finally understood there are already compressed X) However it allow me to discover zst, wich is uncredible with text :

 35M test.log
801K test.7z
  32 test.log.7z
  46 test.log.tar.bz2
  45 test.log.tar.gz
 108 test.log.tar.xz
  22 test.log.tar.zst
2,1M test.tar.bz2
8,1M test.tar.gz
724K test.tar.xz
1,1M test.tar.zst
8,1M test.zip
[-] UnrepentantAlgebra@lemmy.world 10 points 3 weeks ago

Am I reading that correctly that test.log.7z compressed 35 mB of text into 32 bytes?

[-] rbn@sopuli.xyz 7 points 3 weeks ago

And .tar.zst into 22!?

What was that log file? Millions of identical characters?

[-] Tangent5280@lemmy.world 1 points 3 weeks ago

Email written only with the letter 'P'

[-] menas@lemmy.wtf 2 points 3 weeks ago

I shall have check. No for some reason some archives failed. Not 7z, but it size is 800kB :

35M test.log
804K test.7z    
8,1M test.tar.gz
8,1M test.zip
724K test.tar.xz
2,1M test.tar.bz2
1,1M test.tar.zst
[-] sobchak@programming.dev 8 points 3 weeks ago

Yeah, I found zst recently as well. Quite a bit faster than xz/7z which is what I previously used (or gz when I just needed something fast).

[-] your_good_buddy@lemmy.world 14 points 3 weeks ago

"Look what they need to mimic a fraction of our compression ratio." - tar cJf said to tar cjf

[-] Scubus@sh.itjust.works 11 points 3 weeks ago

I dont remember him crushing a tank, is this an edit or did they release another watchmen movie?

[-] SacralPlexus@lemmy.world 16 points 3 weeks ago

It’s from the 2009 movie, during the montage where he is on Mars and is recounting to himself how he got his powers and his relationship with Janey. Timestamp is 1:10:08.

[-] Scubus@sh.itjust.works 6 points 3 weeks ago

Ah, thanks. Figured it was just an edit of the climax of the movie. You know the one

[-] golden_zealot@lemmy.ml 4 points 3 weeks ago

I believe it was a short clip shown when they went into Dr. Manhattans back story.

[-] Randelung@lemmy.world 10 points 3 weeks ago

compress ze file

[-] Evotech@lemmy.world 8 points 3 weeks ago

Me compressing binary data

this post was submitted on 12 Nov 2025
426 points (100.0% liked)

iiiiiiitttttttttttt

1381 readers
138 users here now

you know the computer thing is it plugged in?

A community for memes and posts about tech and IT related rage.

founded 7 months ago
MODERATORS