btw the hoarder project is an active victim of a patent troll[0][1]; the official Firefox extension is currently blocked by dmca[2]. any donations might be helpful.
Talking about hoarding, LTO tapes are the king of cheap storage, but if you want to archive significant amounts (hundreds of TB or more), it takes a significant investment to buy a tape library with somewhat recent drive. Too bad there aren't any alternatives - or are there?
Set this up a couple weeks using an proxmox lxc script and have it using ollama to create tags. I hadn’t heard of singlefile before. That seems like an excellent pairing.
That's what single file is for. Hoarder fetches the webpage using it's own browser, single file makes a copy using your browser including any sessions, then sends that to hoarder.
Worth noting that Linkding (what the author migrated from to Hoarder) also now supports page archiving via headless Chrome + SingleFile and also via manual upload: https://linkding.link/archiving/
btw the hoarder project is an active victim of a patent troll[0][1]; the official Firefox extension is currently blocked by dmca[2]. any donations might be helpful.
[0]: https://github.com/hoarder-app/hoarder/commit/b2c795ccb562c0...
[1]: https://www.reddit.com/r/selfhosted/s/CMCPP7cc8i
[2]: https://github.com/hoarder-app/hoarder/issues/899
Talking about hoarding, LTO tapes are the king of cheap storage, but if you want to archive significant amounts (hundreds of TB or more), it takes a significant investment to buy a tape library with somewhat recent drive. Too bad there aren't any alternatives - or are there?
Tapes are really crap for home use though. They're expensive, super noisy. You constantly have to change them during backing up.
What I do now is use a whole box full of older harddrives that I replaced in my NAS. And I basically use them as tapes with a change frame.
Set this up a couple weeks using an proxmox lxc script and have it using ollama to create tags. I hadn’t heard of singlefile before. That seems like an excellent pairing.
Thoughts on this vs something like ArchiveBox?
No really the same goal. In Hoarder, the goal is to tag and make content easily searchable. The cached part is a plus, not the main goal.
Actually, it's good but not an cached archive, its a just a cached zen mode version of the webpage (or full file if it is a PDF, EPUB, ...).
Can Hoarder archive a webpage protected by some kind of auth / login?
That's what single file is for. Hoarder fetches the webpage using it's own browser, single file makes a copy using your browser including any sessions, then sends that to hoarder.
sounds promising! thanks, i’ll look into this.
Worth noting that Linkding (what the author migrated from to Hoarder) also now supports page archiving via headless Chrome + SingleFile and also via manual upload: https://linkding.link/archiving/