163 sats \ 7 replies \ @aljaz 8 Dec 2023 \ parent \ on: I grinded ~5000 hrs of Bitcoin content to build WhyBitcoinOnly.com. AMA AMA
Link rot is a big problem, i have a list of bitcoin resources and in a year or two many links die. Pdf/web archive should probably be the default links for sites like this
Yeah, maybe add "(archive)" link after each link?
reply
How would you go about doing this? I've never made a fully backed up archive like you're alluding to, I assumed it'd be extremely storage-intensive to have an offline version made of every single content piece.
reply
Some of it might be as easy as linking to archive.org, though not all of the sites might be on there. There has got to be an program you could run that would create your own archive of the websites. I'll look into it and see if I find anything.
reply
Interesting. If you could I'd really appreciate it! Will be quite busy with family obligations in coming weeks so can't dive into much myself, but definitely all ears to anything you learn!
reply
wayback machine has "save now" function where it will take a snapshot of the website if the website is not forbidding crawlers - that would be the easiest solution but i'm sure some websites will have crawlers disabled in robots.txt
reply
Does it have a bulk capability where I could tell it to save every link on the page (in the 4-digits I believe 😅) or would I need to click through every single one manually?
reply
You can use a browser extension from https://archive.is. Probably a lot of those articles are already archived there.
reply