39
Chapter 5 Saving and exporting 66
1.
Choose Save to Files in the Export Results drop-down list. In the dialog box that appears,
select Image under Save as.
2.
Choose a folder location and a file type. Type in a file name.
3.
Select to save the selected zone image(s) only, the current page image, selected page
images or all images in the document. For multiple zones or multiple pages, you can have
all images in a single multi-page image file, providing you set TIFF, MAX, DCX, JB2 or
Image-only PDF or XPS as file type. Otherwise each image is placed in a separate file.
OmniPage adds numerical suffixes to the file name you provide, to generate unique file
names.
4.
Click Options... if you want to specify a saving mode (black-and-white, grayscale, color or
‘As is’), a maximum resolution and other settings. For TIFF files, you specify the
compression method here.
5.
Click OK to save the image(s) as specified. Zones and recognized text are not saved with
the file.
Saving recognition results
You can save recognized pages to disk in a wide variety of file types.
1.
Choose Export Results... in the File menu, or click the Export Results button in the
OmniPage Toolbox with Save to Files selected in the drop-down list.
2.
The Save to Files dialog box appears. Select Text under Save as.
3.
Select a folder location and a file type for your document. Select a page range, file options,
naming options and a formatting level for the document. See “Selecting a formatting
level” further down.
4.
Type in a file name. Click Options... if you want to specify precise settings for the export.
See “Selecting converter options” later in this chapter.
5.
Click OK. The document is saved to disk as specified. If View Result is selected, the
exported file will appear in its target application; that is the one associated with the
selected file type in your Windows system or in the advanced saving options for your
selected file type converter.
30
Chapter 5 Saving and exporting 67
Selecting a formatting level
The formatting level for export is defined at export time, in the saving dialog box (Save to
Files, Copy to Clipboard, Send in Mail or other dialog box). Three of the levels correspond to
the format views of the same name in the Text Editor. However, the level to be applied for
saving is independent of the formatting view displayed in the Text Editor. When exporting to
file or mail, first specify a file type. This determines which formatting levels are available.
The formatting levels are:
Plain Text
This exports plain decolumnized left-aligned text in a single font and font size.
When exporting to Text or Unicode file types, graphics and tables are not
supported. You can export plain text to nearly all file types and target applications;
in these cases graphics, tables and bullets can be retained.
Formatted Text
This exports decolumnized text with font and paragraph styling, along with
graphics and tables. This is available for nearly all file types.
Flowing Page
wherever possible with column and indent settings, not with text boxes or frames.
Text will then flow from one column to the other, which does not happen when
text boxes are used.
True Page
This keeps the original layout of the pages, including columns. This is done with
text, picture and table boxes and frames. This is offered only for target
applications capable of handling these. True Page formatting is the only choice for
XML export and for all PDF export, except to the file type ‘PDF Edited’.
Spreadsheet
This exports recognition results in tabular form, suitable for use in spreadsheet
applications. This places each document page onto a separate worksheet.
When exporting to Microsoft Excel, 'Spreadsheet' is good for saving whole-page
tables. Prefer 'Formatted Text' if your document contains smaller tables: each table will be
30
Chapter 5 Saving and exporting 68
placed on a separate worksheet with non-table parts placed in an index worksheet with
hyperlinks to each relevant worksheet
Selecting converter options
Click the Options... button in a saving dialog box to have precise control over the export. This
brings up a dialog box with the name of the converter associated with the current file type. It
presents a series of options tailored to this file type. First, confirm or change the formatting
level, because this influences which other options are presented. Select options as desired.
Help details how to do this.
To make changes apply to all future export done with the given converter, select the
checkmark Make changes permanent. If this is not selected, changes are applied to the
current export only and are not saved for future use. Export settings can be changed and saved
without a document save – choose Tools/Saving Preferences...
.
Using multiple converters
Multiple converters allow you to export to two or more file types in one export step. Choose
Multiple in the saving dialog box:
To make your own multiple converter, open the Saving Preferences dialog box from the Tools
menu. Choose the heading Multiple converters. Select a converter and click Create from... .
This will make a copy of the selected converter that you can freely modify without
overwriting the original one.
The new converter appears in the list. Select it and click Options... to specify its settings. You
receive a list of all text converters, followed by all image converters. Checkmark the desired
ones. Optionally specify sub-folder paths for each file type.
You can save pages with different formatting levels or file options to the different file types, as
defined in their simple converters. A few saving operations cannot be done with multiple
converters. These are:
Saving OmniPage Documents
OmniPage workflows cannot be saved via multiple converters. Use the File menu or a
workflow with a step Save to OPD.
31
Chapter 5 Saving and exporting 69
Saving to two targets
For instance, you cannot use a multiple converter to save a document to file and also send it in
mail. Use a workflow with two saving steps, or perform two separate saves.
Saving different page ranges
You cannot save different page ranges to different file types, because only one set of selected
pages can exist at saving time. For the same reason, a single workflow cannot be used either.
Perform two separate saves or use two workflows.
Saving to PDF
You have five choices when saving to Portable Document Format (PDF) files. The first four
are presented as Text converters, the last one is listed among the Image converters.
PDF (Normal):
Pages are exported as they appeared in the Text Editor in True Page view. The PDF file can be
viewed and searched in a PDF viewer and edited in a PDF editor.
PDF Edited:
Use this if you have made significant editing changes in the recognition results. You have
three formatting level choices, including True Page. The PDF file can be viewed, searched
and edited.
PDF Searchable Image
The PDF file is viewable only and cannot be modified in a PDF editor. The original images
are exported, but there is a linked text file behind each image, so the text can be searched. A
found word is highlighted in the image.
PDF with image substitutes:
As for PDF (Normal), but words containing reject and suspect characters have image
overlays, so these uncertain words display as they were in the original document. The PDF
file can be viewed, searched and edited.
PDF Image:
The original images are exported. The PDF file is viewable only and cannot be modified in a
PDF editor and text cannot be searched.
Besides the above flavors, you can use other parameters in defining your PDF output by
clicking Options:
31
Chapter 5 Saving and exporting 70
PDF 1.6 or 1.7
Save to PDF version 1.6 or 1.7 for enhanced security, markup and attachment embedding
functionality.
PDF/A
Choose to create PDF/A compliant files to be confident that files display identically
regardless of the computer environment and remain readable even after many years of
technological evolution.
Tagged PDF
Create a tagged PDF file to preserve its structure. This will ensure logical reading order,
correct table structure and more.
PDF MRC
Use this high compression technology for good quality and smaller file size. Available for
color and grayscale PDF Images or PDF Searchable Images.
Linearized PDF
Choose this to create PDF files optimized for fast loading and display when embedded in web
pages.
Password protection
In OmniPage Professional you can set a type and level of encryption and then define an Open
password and/or a Permissions password for PDF files.
A smaller range of choices is available for saving to XPS files.
Converting from PDF
To extract text content from a PDF file, load it into OmniPage, recognize it, and save the
results to a text format.
A variety of outputs is also available from a PDF file shortcut menu: Word, Excel, RTF,
WordPerfect or text. For more options, use the Convert Now Wizard.
eDiscovery Assistant for searchable PDF
Access this Assistant from the Tools menu or from a PDF file’s shortcut menu in Windows
Explorer. The Assistant is specially designed to create searchable PDF files from image-only
PDF files, or files that already contain some text elements or text pages; it does this without
altering or applying an OCR process to existing text. In other words, it limits its processing to
Documents you may be interested
Documents you may be interested