Optical Character Recognition (OCR) for scanned PDFs

Modified on: Wed, 27 Sep, 2023 at 1:26 AM

Navigation
How to Activate Text Recognition for Scanned PDF Files
How to Perform Text Recognition

Restriction: You need to be signed in using an administrator's account in order to activate this function within the settings menu.

User Menu > Settings > Configuration Options > Recognition > Text

The Optical Character Recognition (OCR) function can also read text from scanned PDF documents. OCR is generally used to read text from images, e.g. from scanned documents.

Text recognition for scanned PDFs must be enabled in the settings.

Note: The text recognition function is a feature that runs automatically if the PDF is a text-based document. If aspects of the text were originally in image format, Optical Character Recognition (OCR) will need to be a manual process that can be performed for individual assets or in bulk.

Note: Please contact your Canto Account Manager to enable this feature.

How to activate Text Recognition for scanned PDF files

Select the Setup Optical Character Recognition (OCR) for scanned PDFs checkbox.

How to perform Text Recognition

You can perform text recognition for individual PDFs separately, or for several PDFs at once:

To read text from a single scanned PDF, open the PDF in Preview View and click the (OCR for PDF) icon in the toolbar.

To read text from multiple scanned PDFs, select the desired PDF files and click the (OCR for PDFs) icon in the toolbar.

The recognized text will be saved in the Document Text field in each case.

Navigation

How to activate Text Recognition for scanned PDF files

How to perform Text Recognition

Related Articles