btw the hoarder project is an active victim of a patent troll[0][1]; the official Firefox extension is currently blocked by dmca[2]. any donations might be helpful.
Set this up a couple weeks using an proxmox lxc script and have it using ollama to create tags. I hadn’t heard of singlefile before. That seems like an excellent pairing.
Worth noting that Linkding (what the author migrated from to Hoarder) also now supports page archiving via headless Chrome + SingleFile and also via manual upload: https://linkding.link/archiving/
That's what single file is for. Hoarder fetches the webpage using it's own browser, single file makes a copy using your browser including any sessions, then sends that to hoarder.
btw the hoarder project is an active victim of a patent troll[0][1]; the official Firefox extension is currently blocked by dmca[2]. any donations might be helpful.
[0]: https://github.com/hoarder-app/hoarder/commit/b2c795ccb562c0...
[1]: https://www.reddit.com/r/selfhosted/s/CMCPP7cc8i
[2]: https://github.com/hoarder-app/hoarder/issues/899
Set this up a couple weeks using an proxmox lxc script and have it using ollama to create tags. I hadn’t heard of singlefile before. That seems like an excellent pairing.
Worth noting that Linkding (what the author migrated from to Hoarder) also now supports page archiving via headless Chrome + SingleFile and also via manual upload: https://linkding.link/archiving/
Thoughts on this vs something like ArchiveBox?
No really the same goal. In Hoarder, the goal is to tag and make content easily searchable. The cached part is a plus, not the main goal.
Actually, it's good but not an cached archive, its a just a cached zen mode version of the webpage (or full file if it is a PDF, EPUB, ...).
Can Hoarder archive a webpage protected by some kind of auth / login?
That's what single file is for. Hoarder fetches the webpage using it's own browser, single file makes a copy using your browser including any sessions, then sends that to hoarder.