Will a scanner do OCR?

OCR – Optical Character Recognition is the process of reading the text on a scanned page. The text could then be searchable inside the document (PDF) or exported into an editable form like a word document.

A document scanner is basically a high speed camera and it only creates images of the pages. The software you are using may have an OCR facility. The higher end applications will do an even better job of the OCR. Kofax Express as an example and Canon’s Capture Perfect offer the facility to Scan to searchable PDF.

When you are creating serious amounts of pages – say you have a 100 page per minute scanner running, you will need to create the OCR’d output as a post scanning operation. Otherwise the PC is more than likely going to hold up the scanner by buffering.