Handling Image-only Pages
Pages without a text layer are a special case for conversion. You can
decide how the program should handle these pages: convert them with
the built-in Optical Character Recognition (OCR), transfer them as
images to the target document or skip them. You can require inspection
of the first pages (up to ten) in files you open. Optionally, you can set
conversion to be stopped, if no text-layer pages are detected.
If you have Nuance® OmniPage®, you can use this to have more control
over the recognition process.
PDF Converter supports over 100 languages, including Danish, Dutch,
English, Finnish, French, German, Italian, Norwegian, Polish,
Portuguese, Spanish and Swedish. The program can convert
multi-lingual documents. A full list of supported languages is provided
in Help. Correct language choice is important for converting
image-only pages and handling non-standard encoding.
A new choice is ‘Auto-detect’, which is designed for unattended
processing where input image-only PDFs might be in different
languages. This examines each PDF and assigns a language to it,
choosing from the 22 languages with dictionary support.
SharePoint and other DMS support
PDF or XPS files can be opened from Microsoft SharePoint through a set
of dialog boxes incorporated in PDF Converter, accessed from the File
menu item Open from SharePoint. Other detected Document
Management Systems (Hummingbird, Worksite, DocsOPEN, LiveLink)
also have a menu item; they can be accessed directly through their own
interfaces. See Help for detail. Converted files can be stored in
SharePoint or other systems with the Store in Document Management
System section in the Output Options panel.