Self-Hosting Paperless-GPT: AI-Powered Document Classification

Self-Hosting Paperless-GPT: AI-Powered Document Classification You’ve scanned your documents into Paperless-ngx. Maybe hundreds, maybe thousands. Now comes the tedious part — naming them, tagging them, sorting them into the right categories. Every receipt, invoice, letter, and tax form needs a sensible title and the right tags, or your digital filing cabinet becomes a digital junk drawer. Paperless-GPT solves this by connecting your Paperless-ngx instance to a large language model. Drop a document in, and the AI generates a title, assigns tags, identifies the correspondent, and even extracts custom field data. It can also re-OCR your documents using LLM vision, catching text that traditional OCR engines miss on messy or low-quality scans. ...

March 17, 2026 · 7 min · Self Host Setup

Self-Hosting Paperless-ngx: Go Paperless at Home

Every household drowns in paper. Tax documents, medical records, receipts, warranties, letters — they pile up in drawers and filing cabinets until you need one and can’t find it. Paperless-ngx fixes this permanently. Scan or photograph a document, drop it in a folder, and Paperless automatically OCRs it, extracts the text, tags it, and makes it searchable. Finding any document takes seconds instead of minutes. Why Paperless-ngx? Full-text search across every document you’ve ever scanned Automatic OCR — extracts text from scanned images and PDFs Smart tagging — learns your patterns and auto-categorizes Correspondent detection — knows who sent what Multiple file formats — PDF, PNG, JPEG, TIFF, even Office documents Mobile-friendly web UI for access anywhere on your network Prerequisites Docker and Docker Compose installed At least 2GB RAM (OCR is memory-hungry) Storage space for your documents (plan ~5MB per page average) Docker Compose Setup Create a directory and compose file: ...

February 19, 2026 · 5 min · Self Host Setup