485
save this rule (lemmy.dbzer0.com)
submitted 1 year ago by antonim@lemmy.dbzer0.com to c/196
you are viewing a single comment's thread
view the rest of the comments
[-] LoamImprovement@beehaw.org 8 points 1 year ago

Hold up, does someone know how to save an entire site? I would really like to get the 5e wikidot archived in case Hasbro or whoever wants to shut it down for good.

[-] Kolanaki@yiffit.net 7 points 1 year ago* (last edited 1 year ago)

Probably a browser extension these days. I had one back in the late 90's or early 2000's that would simply download the page you were on, as well as every page, image, audio file, etc. on every recursive link on that page.

This was back when most websites had a table of contents link somewhere, though. There are plenty of sites now that don't link to every page contained on the domain and are only accessible if you manually enter the URL or use dynamically created pages that only exist upon request.

[-] anton 5 points 1 year ago

It won't save everything, but if a script follows every link recursively, most content should be reached that way. That's kind of what Google does but for one site instead of the internet.

If there is a search function try very simple queries.

The alternative of brute forcing links would be unfeasible, even if you are not rate limited by the site, due to the exponential complexity.

If you want to do something please look into api/scraping etikette like exponential back off.

[-] jherazob@beehaw.org 5 points 1 year ago

There's software that browses to the homepage of a site and starts traversing it all, saving it all in the process

[-] zzz@feddit.de 3 points 1 year ago

Link? And where can I upload a PDF* of the site to share with you? tmpfiles.org’s short duration probably won’t cut it…

*Although I’m certain The Saver™️ would only do full webarchive zips, for us casuals, the PDF export shall do (and be easier in day to day use)

[-] LoamImprovement@beehaw.org 2 points 1 year ago

http://dnd5e.wikidot.com/

Honestly it's not the information so much as the way it's organized that I'd like to save. It is the best resource for putting together characters, currently.

this post was submitted on 13 Dec 2023
485 points (100.0% liked)

196

17535 readers
587 users here now

Be sure to follow the rule before you head out.


Rule: You must post before you leave.



Other rules

Behavior rules:

Posting rules:

NSFW: NSFW content is permitted but it must be tagged and have content warnings. Anything that doesn't adhere to this will be removed. Content warnings should be added like: [penis], [explicit description of sex]. Non-sexualized breasts of any gender are not considered inappropriate and therefore do not need to be blurred/tagged.

If you have any questions, feel free to contact us on our matrix channel or email.

Other 196's:

founded 2 years ago
MODERATORS