So I'm looking for a good self-hosted document store. Right now I'm just keeping them in my owncloud tree, which is cool for anything I'm actively working on, but for archival I'm not real happy with it as a solution, mostly because it clutters up my sync tree and I simply don't need archives synced between devices.
So far I've been looking at Alfresco CMS, Ambar, and Paperless. I'm definitely open to other suggestions.
I'm looking for a handful of features.
Critical
Organization - a folder structure at minimum
OCR for scans
Searchability - based on both filenames and document contents
Self-hosted
Native Linux server ("native" meaning "built for Linux", not "must be written in a language that compiles to native bytecode")
Important
Organization - ideally, both by date and category, e.g. folders for dates and some sort of a tag system for categories
Either application-level implementation of file access protocols (e.g. FTP, SMB; Alfresco does this) or on-disk storage mirroring the application-level folder layout so I can just throw (e.g.) an FTP daemon and samba on top (like how owncloud/nextcloud stores its files).
A decent web UI, functionally speaking - aesthetics are nice, but I really care more about usability
Docker images
Nice to have
Sample Kubernetes deployment so I don't need to write my own
Integration with scanning software (like Paperless has)
Multitenancy
Lightweight
Thanks in advance for any recommendations
So far I've been looking at Alfresco CMS, Ambar, and Paperless. I'm definitely open to other suggestions.
I'm looking for a handful of features.
Critical
Organization - a folder structure at minimum
OCR for scans
Searchability - based on both filenames and document contents
Self-hosted

Native Linux server ("native" meaning "built for Linux", not "must be written in a language that compiles to native bytecode")
Important
Organization - ideally, both by date and category, e.g. folders for dates and some sort of a tag system for categories
Either application-level implementation of file access protocols (e.g. FTP, SMB; Alfresco does this) or on-disk storage mirroring the application-level folder layout so I can just throw (e.g.) an FTP daemon and samba on top (like how owncloud/nextcloud stores its files).
A decent web UI, functionally speaking - aesthetics are nice, but I really care more about usability
Docker images
Nice to have
Sample Kubernetes deployment so I don't need to write my own

Integration with scanning software (like Paperless has)
Multitenancy
Lightweight
Thanks in advance for any recommendations
Last edited by billfoster on Tue Nov 07, 2023 9:49 am, edited 1 time in total.