I used to use archive.today to archive news stories. However, between recently experiencing issues accessing the site and now that FBI trying to shut it down, I need to find a new way to archive.

I came across ArchiveBox, but it seems like you have to self-host and I don’t have the tech skills yet to self-host an application. I also know the Internet Archive has the WayBack Machine, but never have good luck using the site.

I am hoping that I can find a site that is similar to archive.today and is not on the FBI’s watchlist.

  • davel@lemmy.ml
    link
    fedilink
    English
    arrow-up
    9
    ·
    edit-2
    2 days ago

    Why are specifically are you using archive.today? To post links that bypass paywalls, or for something else? Because if it’s for something else then there may be other solutions, like using archive.org or saving the page locally.

  • golden_zealot@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 day ago

    If you have a machine and/or the storage for it, you could deploy a docker container of linkwarden and do it yourself for a lot of things.

    It says it’s for “bookmarking” but in addition to storing the outbound link, it takes backups of pages as text, html, and PDF and can do so recursively with the pages links. Nice interface, makes stuff searchable and taggable etc.

    • starlight@lemmy.caOP
      link
      fedilink
      arrow-up
      2
      ·
      5 hours ago

      That’s really cool. I didn’t know Linkwarden could do that. I’ll further take a look at this, thank you!

  • RedStrawberry@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    2
    ·
    edit-2
    1 day ago

    As others have said, SingleFile extenstion works well. I’ve also found zotero with the web extension quite good. Its useful for added organisation/catagoriesation especially since I’m already using it for academic work.

    There is also zimit for use with kiwix, both a comandline version(see github) and website if you want something simpler.

    Although I’ve found the website has long queues quite often and it may not get a clean backup if the website uses cloudflare or the like. But its useful if I need an offline copy of a website with many pages.

    I recommend having a look at the archive team wiki page on software, here, see if anything fits your needs.

    • starlight@lemmy.caOP
      link
      fedilink
      arrow-up
      1
      ·
      4 hours ago

      They all look like they can work. Zimit especially looks interesting. I’ll take a look at all of them. Thank you!

  • hexagonwin@lemmy.sdf.org
    link
    fedilink
    arrow-up
    2
    ·
    2 days ago

    singlefile or webrecorder in chromium based browsers maybe?

    self hosting is actually pretty easy actually :) we’re here to help too.

    for large scale crawling i usually use archiveteam’s grab-site.

  • call_me_xale@lemmy.zip
    link
    fedilink
    arrow-up
    3
    ·
    2 days ago

    Just learned about Readeck the other day. Self-hosted for now, but it sounds like they’re planning to launch a centrally-hosted instance at some point, maybe keep an eye on that.

  • Enternasyonal@lemmygrad.ml
    link
    fedilink
    arrow-up
    3
    ·
    2 days ago

    Tbh internet archive and wayback machine is the best option I can think of. It’s easy to use and I only had problems with it when I was looking for old archives from late 90s and early 2000s, it sometimes didn’t load. That’s the only problem I had w wayback m.