Archiving paper documents as PDF files is a great way to save shelf space and preserve essential records.
However, more than simply scanning the documents is required. It would be best if you also used Optical Character Recognition (OCR) to process the scans. Once OCR has processed a PDF scan, the file contains an invisible text version in addition to the scanned image of the document. macOS Spotlight can now index the content, and you can use HoudahSpot to search your document archive.
But what if some of your PDF files lack OCR text?
Continue reading Find and Fix PDF Files That Lack Searchable Text