918
submitted 2 days ago* (last edited 2 days ago) by misk@sopuli.xyz to c/technology@lemmy.world

edit: adjusted title slightly

all 40 comments
sorted by: hot top controversial new old
[-] dread@lemmy.world 23 points 1 day ago* (last edited 1 day ago)

What's frustrating is that the ones who claimed to have done this are self-proclaimed "hacktivists". You're stupid if you think the Internet Archive is the enemy in this day and age.

[-] Snapz@lemmy.world 43 points 1 day ago

Capitalism hates a memory. Hates/fears anything it can't update, whitewash or otherwise directly control or obscure after the fact.

If humanity had any hope, we'd surround this thing with torches to defend it tooth and nail.

[-] BitsAndBites@lemmy.world 9 points 1 day ago

Thanks, I just used their PayPal link to send my support and light my torch!

https://archive.org/donate/

[-] Lojcs@lemm.ee 130 points 2 days ago

...Google started adding links to archived websites in the Wayback Machine

They better be compensating it..

[-] lud@lemm.ee 26 points 1 day ago

I don't know if there is compensation but the internet archive says it's a collaboration and they seem to be happy about it.

https://blog.archive.org/2024/09/11/new-feature-alert-access-archived-webpages-directly-through-google-search/

[-] FlyingSquid@lemmy.world 48 points 1 day ago

I really hope the rest of the archive comes back soon. I was in the middle of a book and it was a book I hadn't read since I was a kid.

Yeah, I could pay for it or wait for it to come via interlibrary loan (it's not exactly a well-known book), but I really didn't need a physical copy. And it isn't even all that long.

Sigh.

[-] empireOfLove2@lemmy.dbzer0.com 26 points 1 day ago* (last edited 1 day ago)

Damn it'd be a shame if someone DM'ed me the name of the book and I had to go looking to see if there's an epub/pdf version available for download in certain places. A real shame indeed.

[-] FlyingSquid@lemmy.world 13 points 1 day ago

I don't care saying what book it is right here, because I've looked for both and came up wanting. It's not available normally as an ebook for purchase, so I have my doubts.

https://www.goodreads.com/book/show/997118.Doktor_Bey_s_handbooks_of_strange_sex

Basically, the IA had it because they scan in masses of texts without even caring what they are. As long as they get a copy and it isn't in the archive yet, they'll scan it in.

FWIW, it's pretty amusing.

[-] empireOfLove2@lemmy.dbzer0.com 17 points 1 day ago

Oh thats a super off the wall book. It barely exists anywhere let alone an ebook. I stand corrected and humbled.

[-] FlyingSquid@lemmy.world 12 points 1 day ago

It was found for me by someone else! I am amazed.

[-] empireOfLove2@lemmy.dbzer0.com 13 points 1 day ago

Damn! And I thought I knew all the weird nooks to find books online.... I have much to learn

[-] fossilesque@mander.xyz 9 points 1 day ago

I got you fam, dm you a link in 1 sec.

[-] FlyingSquid@lemmy.world 5 points 1 day ago

Wow! Thanks! I looked and looked!

[-] fossilesque@mander.xyz 12 points 1 day ago

Anna's Archive, just author's name search. :)

[-] Rai@lemmy.dbzer0.com 2 points 1 day ago

What a dope site!

[-] FlyingSquid@lemmy.world 2 points 1 day ago

Oh nice! I've never heard of that before. Bookmarked. Thanks again!

[-] LaunchesKayaks@lemmy.world 1 points 1 day ago

Can I get the link too? The book looks interesting

[-] small44@lemmy.world 9 points 1 day ago

That's why I download everything

[-] FlyingSquid@lemmy.world 5 points 1 day ago

Downloading books you have to borrow from the IA is not easy these days.

[-] Appoxo@lemmy.dbzer0.com 3 points 1 day ago

Other sides

[-] abofim@discuss.tchncs.de 64 points 2 days ago

op forgot to mention that it is a "provisional, read-only manner,” according to founder Brewster Kahle.

[-] argh_another_username@lemmy.ca 28 points 2 days ago

Ok, serious question. Why is it normally read/write? I’ve always treated it as being read only.

[-] TheLugal@lemmy.world 65 points 2 days ago

To you as a user it's readonly. To the thousands that submits urls for archival it is readwrite.

[-] antonim@lemmy.dbzer0.com 15 points 1 day ago

You can (well, could) put in any live URL there and IA would take a snapshot of the current page on your request. They also actively crawl the web and take new snapshots on their own. All of that counts as 'writing' to the database.

[-] SkaveRat@discuss.tchncs.de 6 points 1 day ago

Not just websites. Basically any digital media. From PDFs, book scans, manuals, floppy disks, CDs, basically anything even remotely worth archiving

[-] antonim@lemmy.dbzer0.com 2 points 1 day ago

Yep, but I didn't mention that because it's not a part of the "Wayback Machine", it's just the general "Internet Archive" business of archiving media, which is for now still completely unavailable. (I've uploaded dozens of public-domain books there myself, and I'm really missing it...)

[-] Corno@lemm.ee 10 points 1 day ago

Glad to see it's recovering. I hope the whole archive can come back up soon!

[-] leanleft@lemmy.ml 16 points 2 days ago

currently* back only as readonly

[-] argh_another_username@lemmy.ca 8 points 2 days ago

Ok, serious question. Why is it normally read/write? I’ve always treated it as being read only.

[-] altima_neo@lemmy.zip 22 points 2 days ago* (last edited 2 days ago)

I mean how else would they archive web sites or content?

[-] argh_another_username@lemmy.ca 8 points 1 day ago

I’ve always thought they were a crawler.

[-] kautau@lemmy.world 8 points 1 day ago

The Wayback machine is a crawler, which is big part of what they do but not everything. The Wayback machine crawls its own pages, but you can also submit URLs to be crawled.

The other part of what they do is hosting a significant number of digital archives of media that is no longer sold / in print / distributed. Much of that content is user uploaded. Like “oh hey I found this old clip art cd from the early 90s. I don’t really have a use for it, but if this doesn’t get uploaded somewhere it’s probably going to be lost to time. I’ll submit it to the internet archives.”

[-] pmc 2 points 1 day ago

They do some crawling themselves, but Archive Team (a third party group) does a lot of web archiving as well.

[-] BossDj@lemm.ee 8 points 2 days ago
[-] misk@sopuli.xyz 15 points 2 days ago* (last edited 2 days ago)

IA hosts TONS of user uploaded content. They’re not uploading those Gameboy ROMs themselves.

[-] v_krishna@lemmy.ml 6 points 1 day ago

Live music archive is still down for example 😞

[-] pmc 2 points 1 day ago

My most frequent use case of the IA in general is the Cover Art Archive, and I frequently upload cover art for albums to the CAA via MusicBrainz. That's how I discovered the IA was down, when an upload failed.

[-] LainTrain@lemmy.dbzer0.com 3 points 2 days ago
this post was submitted on 14 Oct 2024
918 points (100.0% liked)

Technology

58691 readers
3316 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS